inference-harness

T

Abiba f42747d721 feat: performance analytics panel on dashboard

dashboard/dashboard.py (+61 lines):
- New /api/performance endpoint proxying to router metrics/performance
- Performance Analytics row with 4 panels:
  - Latency distribution (p50/p95/p99 per model) with stacked bars
  - Throughput comparison (avg + p50 tokens/sec per model)
  - Routing effectiveness table by reason
  - Agent performance bars with latency
- 1h/24h window toggle, auto-refresh every 15s
- Color-coded per model (purple=MoE, amber=Dense, green=VLM)

2026-05-25 16:58:15 +00:00

dashboard

feat: performance analytics panel on dashboard

2026-05-25 16:58:15 +00:00

nginx

feat: per-request performance tracking + /metrics/performance endpoint

2026-05-25 16:50:45 +00:00

router

feat: per-request performance tracking + /metrics/performance endpoint

2026-05-25 16:50:45 +00:00

.gitignore

May 19, 2026: Full harness update