f42747d72152eada008371cef66f36acd757c5f4
dashboard/dashboard.py (+61 lines): - New /api/performance endpoint proxying to router metrics/performance - Performance Analytics row with 4 panels: - Latency distribution (p50/p95/p99 per model) with stacked bars - Throughput comparison (avg + p50 tokens/sec per model) - Routing effectiveness table by reason - Agent performance bars with latency - 1h/24h window toggle, auto-refresh every 15s - Color-coded per model (purple=MoE, amber=Dense, green=VLM)
Description
SyslogAI Inference Harness — 3-GPU router, dashboard, LiteLLM proxy
Languages
Python
97.6%
Shell
1.9%
Dockerfile
0.5%