May 19, 2026: Full harness update

- Model migration: gemma-4-E4B → qwen3.5-9b-vlm - Dashboard reorder: Usage Over Time + GPU Metrics to top - Router counter leak fix (gpu_decr in except handler) - VLM slot upgrade 1→2 - Redis stale key cleanup - Automated maintenance cron job - LiteLLM config update - GPU router config update - README update
2026-05-19 15:03:34 +00:00
parent 4f032b035c
commit 9c31b5d622
7 changed files with 43 additions and 46 deletions
@@ -11,9 +11,9 @@ model_list:
      api_base: http://192.168.68.8:8080/v1
      api_key: "not-needed"

-  - model_name: gemma-4-E4B
+  - model_name: qwen3.5-9b-vlm
    litellm_params:
-      model: openai/gemma-4-E4B
+      model: openai/qwen3.5-9b-vlm
      api_base: http://192.168.68.110:8080/v1
      api_key: "not-needed"