🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Fix OWLv2 post_process_object_detection for multiple images (#31082)
* Add test for multiple images * [run slow] owlv2 * Fix box rescaling * [run slow] owlv2
P
Pavel Iakubovskii committed
98e2d48e9af3631e8f8f2070198912fc5d6bc19e
Parent: c31473e
Committed by GitHub <[email protected]>
on 5/28/2024, 11:06:06 AM