621a897bec279732448dbe15c78eb31e80c994f2
More conversations now route to VLM as primary. 9B VLM has 262K context window and 88 tok/s average — well suited for moderate conversations. Dense absorbs overflow and heavy reasoning.
Description
SyslogAI Inference Harness — 3-GPU router, dashboard, LiteLLM proxy
Languages
Python
97.6%
Shell
1.9%
Dockerfile
0.5%