Arthur
96d3795cfc
Update model tags and integration references in bug report ( #40881 )
2025-09-15 12:08:29 +02:00
Ákos Hadnagy
9c804f7ec4
Redirect MI355 CI results to dummy dataset ( #40862 )
2025-09-14 18:42:49 +02:00
Ákos Hadnagy
d8f670583e
Change docker image to preview for the MI355 CI ( #40693 )
...
* Change docker image to preview for the MI355 CI
* Use pushed image
2025-09-04 17:23:09 +02:00
Yih-Dar
30a4b8707d
CircleCI docker images cleanup / update / fix ( #40681 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-09-04 10:42:18 +02:00
Yih-Dar
ca9b36a9c1
Avoid night torch CI not run because of irrelevant docker image failing to build ( #40677 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-09-04 09:06:37 +02:00
Ákos Hadnagy
8c60a7c385
Add collated reports job to Nvidia CI ( #40470 )
...
* Add collated reports job to Nvidia CI
* machine_type
* Move collated reports job to model_jobs
* Propagate repo id variable
* assifgn runner_type is self-scheduled-caller
2025-09-02 14:25:22 +02:00
Matt
3c3dac3c12
Add Copilot instructions ( #40432 )
...
* Add copilot-instructions.md
* Fix typo
* Update .github/copilot-instructions.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com >
2025-09-01 14:09:54 +01:00
Yih-Dar
db6821b79c
Allow remi-or to run-slow ( #40590 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-09-01 12:30:53 +02:00
Yih-Dar
821384d5d4
Fix the CI workflow of merge to main ( #40503 )
...
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-08-27 18:35:12 +02:00
ivarflakstad
304225aa15
Collated reports: no need to upload artifact ( #40502 )
...
No need to upload collated reports as gh artifact
2025-08-27 18:31:55 +02:00
Yih-Dar
80f4c0c6a0
CI when PR merged to main ( #40451 )
...
* up
* up
* up
* up
* up
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-08-27 10:56:18 +02:00
Yih-Dar
ff8b88a948
Fix nightly torch CI ( #40469 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-08-26 22:02:15 +02:00
Yih-Dar
74ad608a2b
Not to shock AMD team by the cancelled workflow run notification ❤️ 💖 ( #40467 )
2025-08-26 20:53:24 +02:00
ivarflakstad
6b5eab70e4
Remove working-dir from collated reports job ( #40435 )
2025-08-25 18:14:35 +02:00
ivarflakstad
1a35d07f56
Update collated reports working directory and --path ( #40433 )
2025-08-25 15:18:26 +00:00
Yih-Dar
5d906740d2
Update CI with nightly torch workflow file ( #40306 )
...
* fix nightly ci
* Apply suggestions from code review
Co-authored-by: ivarflakstad <69173633+ivarflakstad@users.noreply.github.com >
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: ivarflakstad <69173633+ivarflakstad@users.noreply.github.com >
2025-08-20 16:59:00 +02:00
ivarflakstad
28746cdc7b
Remove MI300 CI ( #40270 )
...
Remove MI300 CI (in history if we need it back)
2025-08-19 08:23:39 +00:00
Ákos Hadnagy
e472efb9ac
Fix benchmark workflow ( #40254 )
...
Correct init_db.sql path
Co-authored-by: Akos Hadnagy <akoshuggingface@mi325x8-123.atl1.do.cpe.ice.amd.com >
2025-08-18 18:14:16 +00:00
ivarflakstad
2fe43376cd
AMD scheduled CI ref env file ( #40243 )
...
* Reference env-file to be used in docker running the CI
* Disable MI300 CI for now
2025-08-18 15:23:27 +02:00
Guillaume LEGENDRE
e446372f76
Create self-scheduled-amd-mi355-caller.yml ( #40134 )
2025-08-14 01:33:45 +02:00
ivarflakstad
ebceef343a
Collated reports ( #40080 )
...
* Add initial collated reports script and job definition
* provide commit hash for this run. Also use hash in generated artifact name. Json formatting
* tidy
* Add option to upload collated reports to hf hub
* Add glob pattern for test report folders
* Fix glob
* Use machine_type as path filter instead of glob. Include machine_type in collated report
2025-08-13 14:48:15 +02:00
Yih-Dar
801e869b67
send some feedback when manually building doc via comment ( #39889 )
...
* fix
* fix
* fix
* Update .github/workflows/pr_build_doc_with_comment.yml
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com >
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com >
2025-08-04 18:20:48 +00:00
Yih-Dar
0d511f7a77
Use comment to build doc on PRs ( #39846 )
...
* try
* try
* try
* try
* try
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-08-04 10:24:45 +02:00
Joao Gante
33aa49df9d
[docs] Ko doc fixes after toc update ( #39660 )
...
* update docs
* doc builder working
* make fixup
2025-07-29 17:05:26 +01:00
Yih-Dar
63b3200779
Use --gpus all in workflow files ( #39752 )
...
gpu all
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-29 14:53:33 +02:00
Anton Vlasjuk
a0fa500a3d
[CI] Add Eric to comment slow ci ( #39601 )
...
add to ci
2025-07-28 13:24:00 +00:00
Jitesh Gupta
cbede2969b
Add self-hosted runner scale set workflow for mi325 CI ( #39651 )
2025-07-28 13:32:25 +02:00
Joao Gante
328ca9cf1d
[dependencies] Update datasets pin ( #39500 )
...
* pyarrow pin
* make fixup
* test?
* like this?
* like this?
* like this?
* datasets pin
* comment
2025-07-18 12:05:28 +00:00
Joao Gante
2b819ba4e3
[dependencies] temporary pyarrow pin ( #39496 )
...
* pyarrow pin
* make fixup
* test?
* like this?
* like this?
* like this?
2025-07-18 10:05:40 +00:00
Yih-Dar
161cf3415e
add stevhliu to the list in self-comment-ci.yml ( #39315 )
...
add
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-09 19:07:44 +02:00
Yih-Dar
34c16167eb
Don't send new comment if the previous one is less than 30 minutes (unless the content is changed) ( #39170 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-07 14:43:50 +02:00
Ilyas Moutawwakil
18e0cae207
Fix many HPU failures in the CI ( #39066 )
...
* more torch.hpu patches
* increase top_k because it results in flaky behavior when Tempreture, TopP and TopK are used together, which ends up killing beams early.
* remove temporal fix
* fix scatter operation when input and src are the same
* trigger
* fix and reduce
* skip finding batch size as it makes the hpu go loco
* fix fsdp (yay all are passing)
* fix checking equal nan values
* style
* remove models list
* order
* rename to cuda_extensions
* Update src/transformers/trainer.py
2025-07-03 11:17:27 +02:00
Yih-Dar
ab59cc27fe
Suggest jobs to use in run-slow ( #39100 )
...
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
* pr
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-01 20:19:06 +02:00
Yih-Dar
fe838d6631
Fix missing fsdp & trainer jobs in daily CI ( #39153 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-01 18:10:30 +02:00
Yih-Dar
539c6c2fa8
All CI jobs with A10 ( #39119 )
...
all a10
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-30 14:23:27 +02:00
ivarflakstad
f367c6337d
Update self-comment-ci.yml user list ( #39014 )
...
add ivarflakstad to self-comment-ci.yml
2025-06-24 20:13:36 +02:00
Ilyas Moutawwakil
984ff89e73
Gaudi3 CI ( #38790 )
2025-06-23 10:56:51 +02:00
Yih-Dar
3d34b92116
Switch to use A10 progressively ( #38936 )
...
* try
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-20 16:10:35 +00:00
Quentin Gallouédec
bc68defcac
Update PULL_REQUEST_TEMPLATE.md ( #38770 )
2025-06-12 14:03:33 +02:00
Yih-Dar
7c58336949
[Hotfix] Fix style bot ( #38779 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-12 10:20:36 +02:00
Yih-Dar
60d4b35b20
Make style bot trigger CI after push ( #38754 )
...
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-11 20:40:04 +02:00
Yih-Dar
6b610d89f1
Revert "Trigger doc-builder job after style bot" ( #38735 )
...
Revert "Trigger doc-builder job after style bot (#38398 )"
This reverts commit 51e0fac29f .
2025-06-11 14:56:39 +02:00
Yih-Dar
5009252a05
Better CI ( #38552 )
...
better CI
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-06 17:59:14 +02:00
Luc Georges
ae3733f06e
feat: add repository field to benchmarks table ( #38582 )
...
* feat: add `repository` field to benchmarks table
* fix: remove unwanted `,`
2025-06-04 15:40:52 +02:00
Yih-Dar
51e0fac29f
Trigger doc-builder job after style bot ( #38398 )
...
* update
* update
* update
* update
* update
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-28 17:15:34 +02:00
ivarflakstad
3aab6e95cb
Disable mi210 scheduled CI ( #38411 )
2025-05-28 10:35:41 +02:00
ivarflakstad
b1eae943a2
Change slack channel for mi250 CI ( #38410 )
2025-05-28 09:20:34 +02:00
ivarflakstad
07dd6b2495
Add report_repo_id to mi300 workflow ( #38401 )
2025-05-27 16:35:07 +02:00
Jitesh Gupta
c4e71e8fff
Add AMD MI300 CI caller leveraging self-hosted runner scale set workflow in hf-workflows ( #38132 )
2025-05-26 23:13:02 +02:00
Yih-Dar
eb74cf977b
Use one utils/notification_service.py ( #38379 )
...
* step 1
* step 2
* step 3
* step 4
* step 5
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 16:15:29 +02:00