SIGN IN SIGN UP

model picker: filter out non-chat models (embeddings, speech, image, rerankers)

The /model picker was flooded with hundreds of non-chat models because
OpenAI-compatible profiles (NVIDIA NIM, FPT, Chutes, Groq) and Bedrock
expose their entire catalogs (embeddings, whisper/ASR, TTS, image
upscalers, rerankers, OCR/parse models, etc.).

Add model_name_is_likely_non_chat() heuristic and apply it via
is_listable_model_name plus a final retain pass in
multiprovider_model_routes so the snapshot/catalog used by both
available_models_display and the model routes is filtered at the source.

Conservative, token-boundary aware matching keeps multimodal chat models
(e.g. llama-3.2-90b-vision-instruct, Qwen2.5-VL-Instruct).
J
jeremy committed
d7adeb7a19d9e5f78a330db95801bd0c6411697a
Parent: adcef3a