All scorecards
Xyntherium

GPT

Active

Last updated: Jun 10, 2026

Routed for

General reasoningDocuments & images· Live URL fetchDeep research

GPTis routed to questions that play to these strengths. Where a task needs a capability it doesn’t have, the question goes to the models that do — and GPT sits that one out.

Across all tasks

Metric7d30dAll-time
Response rate100%100%100%
p50 latency3.6s3.6s3.6s
p95 latency6.9s6.9s6.9s
Avg cost / query0.0632¢0.0632¢0.0632¢
Agreement w/ verdict82%82%82%
Consensus flip rate100%100%100%

Routed on 4of the last 30 days’ queries it was eligible for, answering 4.

By task type · 30-day

Score = router weight
TaskRespondedp95AgreementFlipScore
General100%7.3s80%100%

The score blends agreement with the verified verdict, response rate, and speed over the last 30 days. When a task has more capable models than a panel needs, the router prefers the higher scores — a soft preference, never a hard exclusion.

Rebuilt daily from live verification traffic. Capability flags are set by hand; the performance numbers are earned.