SIGN IN SIGN UP

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

0 0 0 Python

GLM-ASR Support (#42875)

* draft only

* update

* rename

* format config

* update config

* update shape

* update

* testing

* new config

* Update modeling_glmasr.py

* draft change

* update encoder config

* darft PR with no embed

* revert

* revert

* Update modular_glmasr.py

* update

* format

* for doc update

* add processor

* rename

* update rotray

* update convert logtic

* update

* update

* update

* update

* update

* use //2

* update for pytest

* change modular

* for sglang support

* Update modeling_glmasr.py

* audioflamingo3: use _get_audio_token_length

* clean modular

* integration tests

* Glmasr -> GlmAsr

* audioflamingo3: audio len and default prompt as attributes

* modular update

* fix

* udpate integration tests

* use auto dtype

* conversion script update

* fix

* correct test values

* make

* test update

* fix

* make

* auto updates

* doc update

* update doc

* fix

* fix import

* doc update

* fix

* fix

* doc update

* make style

* update max_audio_len

* update repo

* make

* make

* make

---------

Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com>
Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com>
Y
Yuxuan Zhang committed
a7f29523361b2cc12e51c1f5133d95f122f6f45c
Parent: 7bbfc19
Committed by GitHub <noreply@github.com> on 12/24/2025, 3:30:26 PM