Back to Checks

Compare two models

No YAML. Pick a scenario and two models — get a benchmark card in under a minute.

Runs use OpenAI. You get a public benchmark URL and card to share. Repo is optional — this run is saved under A2ZAI quick-compare.