54a4f26db7230a99712766aeeb51f2d714fa9fb4
Heavy tier keeps MoE primary (workhorse for >25K tok). Default tier routes Dense → VLM → MoE to prevent MoE overload. MoE had 5 timeouts in 15 min when Default pushed overflow to it.
Description
SyslogAI Inference Harness — 3-GPU router, dashboard, LiteLLM proxy
Languages
Python
97.6%
Shell
1.9%
Dockerfile
0.5%