fix(compression): remove hardcoded gemini-3-flash-preview as default summary model
Closes #2453 The DEFAULT_CONFIG was hardcoding google/gemini-3-flash-preview as the summary_model for context compression. This caused unexpected OpenRouter charges for users who configured a different provider/model, because the compression task would silently fall back to gemini via OpenRouter even when the user's main model was on a different provider. Fix: change summary_model default to empty string. When empty, call_llm() resolves the model through the standard auto-detection chain (auxiliary.compression config -> env vars -> main provider), which correctly uses the user's configured provider and model. Users who want a dedicated cheap model for compression can still explicitly set compression.summary_model in their config.yaml.
M
Mibayy committed
0698ddb49618646c6a576fe7d8e15d8503604c5a
Parent: 0962cbb
Committed by Teknium <teknium1@gmail.com>
on 3/22/2026, 11:36:36 AM