inference-harness

T

Abiba b2ec4b0572 fix: throughput panel handles streaming-only models gracefully

- Dashboard: when a model has zero non-streaming records, shows
  "streaming only" instead of misleading 0 tok/s
- Dashboard: minimum bar width enforced (6% avg, 4% p50) so
  low-tps models are always visible
- Router: removed inflated streaming tps estimate (prompt tokens
  skewed results for long conversations)

Fixes Dense model appearing to "register nothing" when Mumuni
sends mostly streaming requests.

2026-05-25 19:45:21 +00:00

dashboard

fix: throughput panel handles streaming-only models gracefully

2026-05-25 19:45:21 +00:00

nginx

feat: per-request performance tracking + /metrics/performance endpoint

2026-05-25 16:50:45 +00:00

router

fix: handle single data point in performance percentiles

2026-05-25 17:00:40 +00:00

.gitignore

May 19, 2026: Full harness update