SIGN IN SIGN UP

Add a switch for Instant Apply to use short model for lower context prompts (#497)

* use enum for model

* add small proxy endpoint and use when needed

* extend endpoint to reduce duplication and imply dependance

* use experiment service

* rename small -> short

* fix model name

* fix type

* separate endpoint impls for upcoming header changes

* remove extra spaces

* Update baseline

* Full update

* Update

---------

Co-authored-by: Vritant Bhardwaj <vrtoku@gmail.com>
Co-authored-by: Vritant Bhardwaj <vrbhardw@microsoft.com>
R
Rob Lourens committed
4cc98c62a4f08bdd52f44bc33b9128402d489190
Parent: 629c53e
Committed by GitHub <noreply@github.com> on 8/7/2025, 5:29:28 PM