Add a switch for Instant Apply to use short model for lower context prompts (#497)
* use enum for model * add small proxy endpoint and use when needed * extend endpoint to reduce duplication and imply dependance * use experiment service * rename small -> short * fix model name * fix type * separate endpoint impls for upcoming header changes * remove extra spaces * Update baseline * Full update * Update --------- Co-authored-by: Vritant Bhardwaj <vrtoku@gmail.com> Co-authored-by: Vritant Bhardwaj <vrbhardw@microsoft.com>
R
Rob Lourens committed
4cc98c62a4f08bdd52f44bc33b9128402d489190
Parent: 629c53e
Committed by GitHub <noreply@github.com>
on 8/7/2025, 5:29:28 PM