Abiba (pi)
|
8f3b0c6647
|
Router: health check verifies actual llama.cpp endpoint, gpu_decr negative guard, AMD sidecar fixed (sysfs fallback)
|
2026-05-17 01:52:28 +00:00 |
|
Abiba (pi)
|
808c9d3d13
|
Router: 300s timeout, gpu_decr bugfix. Dashboard: Bootstrap 5 modern redesign with KPI stats, equal-height cards, queue ring. Nginx: 600s timeout.
|
2026-05-16 22:12:21 +00:00 |
|
Abiba (pi)
|
9817fe2ef2
|
Dashboard: clean rebuild with Queue Status ring chart, GPU slot indicators, organized layout (GPU/Queue+Model+Agent/Usage/Live)
|
2026-05-16 21:05:19 +00:00 |
|
Abiba (pi)
|
654cdff718
|
Dashboard: GPU slot indicators show active/max concurrent requests. Koonimo API key added. Real-time queuing visibility.
|
2026-05-16 20:43:22 +00:00 |
|
Abiba (pi)
|
bf90e57c5f
|
Load-aware routing: tracks active GPU requests in Redis, distributes overflow when MoE saturated. 6 concurrent requests now spread across all 3 GPUs instead of queuing on one.
|
2026-05-16 20:23:32 +00:00 |
|
Abiba (pi)
|
2db2796e53
|
Dashboard: rename to SyslogAI Harness, GPU bar now shows utilization instead of VRAM
|
2026-05-16 19:26:46 +00:00 |
|
Abiba (pi)
|
ec0f9fac63
|
Fix: clean_unicode now uses chr()-based replacements + ASCII strip to prevent bash heredoc corruption. Emoji and all non-ASCII now fully stripped.
|
2026-05-16 19:12:58 +00:00 |
|
Abiba (pi)
|
3d42ea4767
|
Merge: add Abiba harness code — nginx, LiteLLM, router, dashboard, Redis
|
2026-05-16 18:53:31 +00:00 |
|
Abiba (pi)
|
7b6c6aabe1
|
Initial commit: CT 116 inference harness — nginx, LiteLLM, router, dashboard, Redis
- Complexity-based routing (MoE default, Dense heavy, Gemma light)
- Per-agent API keys with metrics tracking
- Time-series usage graphs (24h/7d/30d)
- Streaming support (SSE passthrough)
- Unicode cleanup (ASCII-only output)
- Vision support (gemma-4-E4B)
- Tier enforcement (starter/professional/enterprise)
- GPU health monitoring via sidecar polling
- Unified dashboard with line graph
|
2026-05-16 18:51:50 +00:00 |
|
mumuni-bot
|
b65ea22765
|
Update Nginx Docker config
|
2026-05-15 21:35:13 +00:00 |
|
mumuni-bot
|
cf7f61650f
|
Add Dockerfile.dashboard
|
2026-05-15 21:34:52 +00:00 |
|
mumuni-bot
|
7d00bbec0e
|
Add Dockerfile.queue
|
2026-05-15 21:34:49 +00:00 |
|
mumuni-bot
|
37f7c95b05
|
Add env example
|
2026-05-15 21:07:34 +00:00 |
|
mumuni-bot
|
a28b3a557d
|
Add Nginx router config
|
2026-05-15 21:07:33 +00:00 |
|
mumuni-bot
|
c42f3a9979
|
Add migration plan
|
2026-05-15 21:07:32 +00:00 |
|
mumuni-bot
|
e1f12c3462
|
Add dashboard
|
2026-05-15 21:07:07 +00:00 |
|
mumuni-bot
|
b55b954967
|
Add queue service
|
2026-05-15 21:07:05 +00:00 |
|
mumuni-bot
|
c85aaa570b
|
Add docker-compose
|
2026-05-15 21:07:05 +00:00 |
|
mumuni-bot
|
43382dac5b
|
Initial commit: README
|
2026-05-15 21:07:03 +00:00 |
|