May 19, 2026: Full harness update

- Model migration: gemma-4-E4B → qwen3.5-9b-vlm
- Dashboard reorder: Usage Over Time + GPU Metrics to top
- Router counter leak fix (gpu_decr in except handler)
- VLM slot upgrade 1→2
- Redis stale key cleanup
- Automated maintenance cron job
- LiteLLM config update
- GPU router config update
- README update
This commit is contained in:
Abiba
2026-05-19 15:03:34 +00:00
parent 4f032b035c
commit 9c31b5d622
7 changed files with 43 additions and 46 deletions
+2 -2
View File
@@ -11,9 +11,9 @@ model_list:
api_base: http://192.168.68.8:8080/v1
api_key: "not-needed"
- model_name: gemma-4-E4B
- model_name: qwen3.5-9b-vlm
litellm_params:
model: openai/gemma-4-E4B
model: openai/qwen3.5-9b-vlm
api_base: http://192.168.68.110:8080/v1
api_key: "not-needed"