🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
docs: add continuous batching page (#41847)
* docs: add continuous batching page * docs(cb): add `generate_batch` example * docs(cb): add `opentelemtry` and `serving` section * feat: add `TODO` note about opentelemetry dependency * docs(cb): add supported features * docs(cb): add unsupported features * docs(cb): add `ContinuousBatchingManager` example * docs(cb): x reference CB in optimizing inference
L
Luc Georges committed
22e39dfb319fdba0cd17302c285240f26fb4dcd2
Parent: 63fbd50
Committed by GitHub <[email protected]>
on 11/3/2025, 2:19:30 PM