All scorecards
Xyntherium

DeepSeek

Active

Last updated: Jun 10, 2026

Routed for

General reasoning· Documents & images· Live URL fetch· Deep research

DeepSeekis routed to questions that play to these strengths. Where a task needs a capability it doesn’t have, the question goes to the models that do — and DeepSeek sits that one out.

Across all tasks

Metric7d30dAll-time
Response rate100%100%100%
p50 latency8.8s8.8s8.8s
p95 latency18.2s18.2s18.2s
Avg cost / query0.0049¢0.0049¢0.0049¢
Agreement w/ verdict78%78%78%
Consensus flip rate100%100%100%

Routed on 4of the last 30 days’ queries it was eligible for, answering 4.

By task type · 30-day

Score = router weight
TaskRespondedp95AgreementFlipScore
General100%18.7s73%100%

The score blends agreement with the verified verdict, response rate, and speed over the last 30 days. When a task has more capable models than a panel needs, the router prefers the higher scores — a soft preference, never a hard exclusion.

Rebuilt daily from live verification traffic. Capability flags are set by hand; the performance numbers are earned.