All scorecards
Xyntherium

Claude

Active

Last updated: Jun 10, 2026

Routed for

General reasoningDocuments & images· Live URL fetchDeep research

Claudeis routed to questions that play to these strengths. Where a task needs a capability it doesn’t have, the question goes to the models that do — and Claude sits that one out.

Across all tasks

Metric7d30dAll-time
Response rate100%100%100%
p50 latency5.1s5.1s5.1s
p95 latency9.1s9.1s9.1s
Avg cost / query0.4283¢0.4283¢0.4283¢
Agreement w/ verdict53%53%53%
Consensus flip rate67%67%67%

Routed on 4of the last 30 days’ queries it was eligible for, answering 4.

By task type · 30-day

Score = router weight
TaskRespondedp95AgreementFlipScore
General100%9.4s67%100%

The score blends agreement with the verified verdict, response rate, and speed over the last 30 days. When a task has more capable models than a panel needs, the router prefers the higher scores — a soft preference, never a hard exclusion.

Rebuilt daily from live verification traffic. Capability flags are set by hand; the performance numbers are earned.