SIGN IN SIGN UP

fix(compression): remove hardcoded gemini-3-flash-preview as default summary model

Closes #2453

The DEFAULT_CONFIG was hardcoding google/gemini-3-flash-preview as the
summary_model for context compression. This caused unexpected OpenRouter
charges for users who configured a different provider/model, because the
compression task would silently fall back to gemini via OpenRouter even
when the user's main model was on a different provider.

Fix: change summary_model default to empty string. When empty,
call_llm() resolves the model through the standard auto-detection chain
(auxiliary.compression config -> env vars -> main provider), which
correctly uses the user's configured provider and model.

Users who want a dedicated cheap model for compression can still
explicitly set compression.summary_model in their config.yaml.
M
Mibayy committed
0698ddb49618646c6a576fe7d8e15d8503604c5a
Parent: 0962cbb
Committed by Teknium <teknium1@gmail.com> on 3/22/2026, 11:36:36 AM