Question 1

What is a DriftCheck pack?

Accepted Answer

A pack is a YAML file that defines cases for evaluating prompts or agents. Each case has a name, a dimension (quality, safety, latency, cost), a weight, and either pre-filled outputs for heuristic scoring or an input with an execution block for live model comparison.

Question 2

What dimensions does DriftCheck score?

Accepted Answer

Every case is tagged with one of four dimensions: quality (correctness and completeness), safety (policy adherence and edge-case handling), latency (speed and concision), and cost (token efficiency and cost-related behavior).

Question 3

How does the execution block work?

Accepted Answer

The execution block lets you run live model comparisons instead of pre-filled outputs. You specify a provider, baseline model, candidate model, and optional system prompt. Checks calls both models with your case inputs and scores the responses using your defined rules.

Question 4

How do I share my benchmark results?

Accepted Answer

Local runs stay private. When you explicitly publish a report, A2ZAI creates a proof URL that you can embed in READMEs, launch posts, and social media.

Question 5

Can I compare different runs of the same pack?

Accepted Answer

Yes. The benchmark page shows run history and a "Compare with" link for each past run, so you can track how your prompts or agents improve over time.

DriftCheck quickstart and pack format

Install and run

V1 starter packs

What is a pack?

Required fields

Dimensions

Scoring rules (per case)

Execution block (optional)

Sharing your benchmark