All scorecards
Xyntherium
DeepSeek
ActiveLast updated: Jun 10, 2026
Routed for
✓ General reasoning· Documents & images· Live URL fetch· Deep research
DeepSeekis routed to questions that play to these strengths. Where a task needs a capability it doesn’t have, the question goes to the models that do — and DeepSeek sits that one out.
Across all tasks
| Metric | 7d | 30d | All-time |
|---|---|---|---|
| Response rate | 100% | 100% | 100% |
| p50 latency | 8.8s | 8.8s | 8.8s |
| p95 latency | 18.2s | 18.2s | 18.2s |
| Avg cost / query | 0.0049¢ | 0.0049¢ | 0.0049¢ |
| Agreement w/ verdict | 78% | 78% | 78% |
| Consensus flip rate | 100% | 100% | 100% |
Routed on 4of the last 30 days’ queries it was eligible for, answering 4.
By task type · 30-day
Score = router weight| Task | Responded | p95 | Agreement | Flip | Score |
|---|---|---|---|---|---|
| General | 100% | 18.7s | 73% | 100% | — |
The score blends agreement with the verified verdict, response rate, and speed over the last 30 days. When a task has more capable models than a panel needs, the router prefers the higher scores — a soft preference, never a hard exclusion.
Rebuilt daily from live verification traffic. Capability flags are set by hand; the performance numbers are earned.
