GMP Bench
← Back to Leaderboard

Claude Sonnet 4.6

Anthropic
Overall Score
98.8%

39 evaluations

Knowledge QA
100.0%

35 evals

Task Completion
88.3%

4 evals

Avg Latency
27.6s

across all evals

Score by Category