MCP TESTING PLANNED
Record real MCP sessions. Replay them deterministically.
Capture every tool call, freeze the responses, and run them back in CI. No flaky network. No vendor rate limits. Same bytes, every time.
v0.2 · early alpha · MCP record/replay in dev
Manage, version, and release LLM prompts the way you ship code. Git-native specs, signed releases, JUnit-ready replay.
03 — capabilities
MCP TESTING PLANNED
Capture every tool call, freeze the responses, and run them back in CI. No flaky network. No vendor rate limits. Same bytes, every time.
PROMPT REGRESSION
Every prompt is versioned. Every change runs the suite. Pass-rate, latency, and tokens land in the PR before review.
# promptlm.yaml id: customer_support group: support version: 1.0.0 request: vendor: openai model: gpt-4o parameters: { temperature: 0.6, maxTokens: 256 } messages: - role: system content: You are a support assistant. - role: user content: "Summarize: {{ticket}}"
MODEL MOCKING
Inject malformed JSON. Force a refusal. Drop a tool call halfway through. Build the failure modes that production will eventually serve you — and write the tests that catch them.
// TranslationServiceTest.java @EnablePromptWireMock class TranslationServiceTest { @Test void translates( @InjectPrompt(id = "translate-hello") String prompt, @InjectResponse(id = "translate-hello") String response) { // WireMock stubs auto-generated from your prompt repo } }
GET STARTED