All scorecards
Xyntherium

Gemini

Active

Last updated: Jun 10, 2026

Routed for

General reasoningDocuments & images· Live URL fetchDeep research

Geminiis routed to questions that play to these strengths. Where a task needs a capability it doesn’t have, the question goes to the models that do — and Gemini sits that one out.

Across all tasks

Metric7d30dAll-time
Response rate75%75%75%
p50 latency1.6s1.6s1.6s
p95 latency6.4s6.4s6.4s
Avg cost / query0.0228¢0.0228¢0.0228¢
Agreement w/ verdict44%44%44%
Consensus flip rate50%50%50%

Routed on 4of the last 30 days’ queries it was eligible for, answering 3.

By task type · 30-day

Score = router weight
TaskRespondedp95AgreementFlipScore
General50%7.0s67%

The score blends agreement with the verified verdict, response rate, and speed over the last 30 days. When a task has more capable models than a panel needs, the router prefers the higher scores — a soft preference, never a hard exclusion.

Rebuilt daily from live verification traffic. Capability flags are set by hand; the performance numbers are earned.