Commit Graph

1162 Commits

Author SHA1 Message Date
Shion Matsumoto
663b6ce3a8 uv for tests-fabric (#21155) 2025-09-03 15:53:04 +02:00
Shion Matsumoto
29e8ce4b8a uv for pytorch tests (#21148)
* swap to uv
* no pip cache
* enable caching. use venv.
* activate environment
* fix uninstall
* replace pip with uv
* activate venv
* --find-links=...
* GLOO_SOCKET_IFNAME

---------

Co-authored-by: Nicki Skafte Detlefsen <skaftenicki@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka B <j.borovec+github@gmail.com>
2025-09-03 10:02:35 +02:00
Nicki Skafte Detlefsen
5071a04ae5 Add support for deepspeeds exclude_frozen_parameters (#21060)
* add to deepspeed strategies
* add testing
* changelog
* GLOO_SOCKET_IFNAME

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka B <j.borovec+github@gmail.com>
2025-09-02 16:15:38 +02:00
Shion Matsumoto
c3ca8a516f Fix workflow matrix reference (#21145)
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-09-02 11:15:42 +02:00
Shion Matsumoto
e760ad5643 Simplify Fabric tests workflow matrix (#21142) 2025-09-01 17:47:03 +02:00
Shion Matsumoto
db77fa7a44 Simplify workflow matrix (#21132)
* simplify
* add config field
2025-09-01 12:45:37 +02:00
dependabot[bot]
9a41f1f68e build(deps): bump google-github-actions/setup-gcloud from 2 to 3 (#21140)
Bumps [google-github-actions/setup-gcloud](https://github.com/google-github-actions/setup-gcloud) from 2 to 3.
- [Release notes](https://github.com/google-github-actions/setup-gcloud/releases)
- [Changelog](https://github.com/google-github-actions/setup-gcloud/blob/main/CHANGELOG.md)
- [Commits](https://github.com/google-github-actions/setup-gcloud/compare/v2...v3)

---
updated-dependencies:
- dependency-name: google-github-actions/setup-gcloud
  dependency-version: '3'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-01 09:34:49 +02:00
dependabot[bot]
c504088b03 build(deps): bump google-github-actions/auth from 2 to 3 (#21139)
Bumps [google-github-actions/auth](https://github.com/google-github-actions/auth) from 2 to 3.
- [Release notes](https://github.com/google-github-actions/auth/releases)
- [Changelog](https://github.com/google-github-actions/auth/blob/main/CHANGELOG.md)
- [Commits](https://github.com/google-github-actions/auth/compare/v2...v3)

---
updated-dependencies:
- dependency-name: google-github-actions/auth
  dependency-version: '3'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-01 09:34:31 +02:00
Jirka Borovec
a08b64edda ci: pin also test requirements for minimal setup (#21102)
* ci: pin also test requirements for minimal setup
* onnx >1.12.0
* numpy >1.20.0
* onnxscript >= 0.1.0
* tensorboard >=2.11
* tensorboardX >=2.6

---------

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-21 18:07:18 +02:00
Jirka Borovec
8ff43d46d1 switch to lightning_utilities.cli requirements set-oldest (#21077) 2025-08-19 13:15:58 +02:00
dependabot[bot]
5751f8cdbc build(deps): bump actions/checkout from 4 to 5 (#21091)
Bumps [actions/checkout](https://github.com/actions/checkout) from 4 to 5.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](https://github.com/actions/checkout/compare/v4...v5)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-version: '5'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-18 08:35:33 +02:00
Jirka Borovec
8d1a73475e add GPU tests with minimal req. to be required (#21075) 2025-08-15 14:49:18 +02:00
GdoongMathew
39e24f5d37 Torch-Tensorrt Integration with LightningModule (#20808)
* feat: add `to_tensorrt` in the `LightningModule`.
* refactor: fix `to_tensorrt` impl
* test: add test_torch_tensorrt.py
* add dependency in test requirement
* limit the torch-tensorrt condition again
* update tensorrt version
* update tensorrt source
* update test.txt
* ci: add extra-index
* ci: use find-links instead. works on my computer...
* fix: fix bug in torch-tensorrt 2.8.0
* add find links in ci test.
* chlog

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Jirka B <j.borovec+github@gmail.com>
2025-08-15 10:50:55 +02:00
Jirka Borovec
4ff8ff7d76 chore: bump PyTorch version in dependencies & CI (#21043)
* bump PyTorch version in dependencies & CI

build(deps): update torchvision requirement in /requirements

Updates the requirements on [torchvision](https://github.com/pytorch/vision) to permit the latest version.
- [Release notes](https://github.com/pytorch/vision/releases)
- [Commits](https://github.com/pytorch/vision/compare/v0.16.0...0.23.0)

---
updated-dependencies:
- dependency-name: torchvision
  dependency-version: 0.23.0
  dependency-type: direct:production
...

* RunIf(linux_only=True)
* _xfail_gloo_windows
* FixMe
* flaky

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-08-13 09:18:30 +02:00
Jirka Borovec
f0676260ba docker: simplify the docker name with CUDA (#21001)
Co-authored-by: Bhimraj Yadav <bhimrajyadav977@gmail.com>
2025-08-11 17:26:00 +02:00
dependabot[bot]
14be476820 build(deps): bump actions/download-artifact from 4 to 5 (#21049)
* build(deps): bump actions/download-artifact from 4 to 5

Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 4 to 5.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](https://github.com/actions/download-artifact/compare/v4...v5)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-version: '5'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-08-11 12:09:41 +02:00
dependabot[bot]
daed436539 build(deps): bump Lightning-AI/utilities from 0.15.0 to 0.15.2 (#21050)
* build(deps): bump Lightning-AI/utilities from 0.15.0 to 0.15.2

Bumps [Lightning-AI/utilities](https://github.com/lightning-ai/utilities) from 0.15.0 to 0.15.2.
- [Release notes](https://github.com/lightning-ai/utilities/releases)
- [Changelog](https://github.com/Lightning-AI/utilities/blob/main/CHANGELOG.md)
- [Commits](https://github.com/lightning-ai/utilities/compare/v0.15.0...v0.15.2)

---
updated-dependencies:
- dependency-name: Lightning-AI/utilities
  dependency-version: 0.15.2
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

* Apply suggestions from code review

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-08-11 11:54:32 +02:00
Jirka Borovec
20960ec3b6 docker: build images for latest PT 2.8 (#21042)
* docker: build images for latest PT `2.8`
* build release
2025-08-08 17:45:38 +02:00
Deependu
e752cb5409 nitpick: add make command to quickly setup the project on lightning studio (#20996)
* add make command to quickly setup on lightning
* Update Makefile

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-08-06 09:38:50 +02:00
dependabot[bot]
95ad9c25d1 build(deps): bump Lightning-AI/utilities from 0.14.3 to 0.15.0 (#21010)
* build(deps): bump Lightning-AI/utilities from 0.14.3 to 0.15.0

Bumps [Lightning-AI/utilities](https://github.com/lightning-ai/utilities) from 0.14.3 to 0.15.0.
- [Release notes](https://github.com/lightning-ai/utilities/releases)
- [Changelog](https://github.com/Lightning-AI/utilities/blob/main/CHANGELOG.md)
- [Commits](https://github.com/lightning-ai/utilities/compare/v0.14.3...v0.15.0)

---
updated-dependencies:
- dependency-name: Lightning-AI/utilities
  dependency-version: 0.15.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

* Apply suggestions from code review

* Apply suggestions from code review

* Apply suggestions from code review

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
Co-authored-by: Deependu <deependujha21@gmail.com>
2025-08-04 20:10:26 +05:30
Jirka Borovec
b36edc4325 docs: updating flaking links (#20980) 2025-07-23 17:22:34 +02:00
Deependu
4e3cf67337 fix: failing markdown link test in ci (#20979)
* fix: failing markdown link test in ci

* Apply suggestions from code review

* Update .github/markdown-links-config.json

* Update .github/markdown-links-config.json

---------

Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-07-14 23:58:36 +05:30
Shion Matsumoto
b087c1a553 Add dev env setup guide with uv (#20961) 2025-07-09 13:37:37 +02:00
Jirka Borovec
ca3880a2bb ci: bump sorting group-check (#20955)
* ci: debug sorting group-check

* trigger

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* @bump/node20

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* Empty-Commit

* dependabot/npm_and_yarn/multi-63fbbd1def

* dependabot/npm_and_yarn/brace-expansion-1.1.12

* dependabot/npm_and_yarn/octokit/webhooks-9.26.3

* dependabot/npm_and_yarn/braces-3.0.3

* dependabot/npm_and_yarn/multi-4e11cecdf1

* dependabot/npm_and_yarn/json5-1.0.2

* dependabot/npm_and_yarn/multi-3e154a7e10

* bump/5.5

* Apply suggestions from code review
2025-07-03 17:24:11 +02:00
Jirka Borovec
02ddc349f8 ci: force pip install with --upgrade-strategy=eager (#20958) 2025-07-02 12:42:05 +02:00
Jirka Borovec
33de2307fc ci: disable TPU testing (#20942) 2025-06-26 10:58:55 +02:00
Jirka Borovec
242d80ff98 test: addressing flaky spawn "process 0 terminated with signal SIGABRT" (#20933) 2025-06-24 18:16:37 +02:00
Jirka Borovec
8bdb89231a fix: update automated checkpoint messages for consistency (#20924) 2025-06-23 16:42:43 +02:00
dependabot[bot]
48fcc3099e build(deps): bump codecov/codecov-action from 4 to 5 (#20927)
Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 4 to 5.
- [Release notes](https://github.com/codecov/codecov-action/releases)
- [Changelog](https://github.com/codecov/codecov-action/blob/main/CHANGELOG.md)
- [Commits](https://github.com/codecov/codecov-action/compare/v4...v5)

---
updated-dependencies:
- dependency-name: codecov/codecov-action
  dependency-version: '5'
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-23 08:22:59 +02:00
Jirka Borovec
c85660c0da ci: debug failing run on master (#20845) 2025-06-17 18:25:40 +02:00
Jirka Borovec
2358ed338b debugging flaky test_collective_operations with SIGABRT (#20912) 2025-06-17 15:11:28 +02:00
Jirka Borovec
26e2c2c132 fix check for flaky links in readme (#20910)
* fix check for flaky links in readme

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Update ci-check-md-links.yml

* force-check-all: "yes"

* fix & skip

* Empty-Commit

* fixing

* fixing

* fixing

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-06-16 14:34:33 +02:00
Jirka Borovec
821611b5b3 bump: PyTorch to be latest 2.7.1 (#20877) 2025-06-05 18:50:13 +02:00
Jirka Borovec
7b8ff1de2c drop deprecated dependabot reviewers (#20860) 2025-05-27 15:55:56 +02:00
Jirka Borovec
3b37c3e6eb docker: update building base docker images for last CUDA & py3.10 (#20844)
docker: extend building base docker images for last CUDA
2025-05-19 15:25:20 +02:00
Jirka Borovec
9d93d7734e ci: skip failing run on master (#20843)
* disabled
2025-05-19 14:48:11 +02:00
Jirka Borovec
bef53a8658 docker: extend building base docker images for litGPT (#20842) 2025-05-19 11:53:17 +02:00
Jirka Borovec
1b12c4bdd3 updating issue template (#20720)
* updating issue template

* Update .github/ISSUE_TEMPLATE/1_bug_report.yaml

* Update .github/ISSUE_TEMPLATE/1_bug_report.yaml

* Apply suggestions from code review

* Apply suggestions from code review

---------

Co-authored-by: Ali Alshaarawy <45029495+ali-alshaar7@users.noreply.github.com>
2025-05-08 15:17:25 +01:00
Jirka Borovec
749dc90f1b drop mergify (#20770)
* drop mergify
* label conflist
2025-04-28 14:32:35 +02:00
Jirka Borovec
383c980b0f docs: update repo link preventing flakiness (#20769) 2025-04-28 14:32:02 +02:00
Jirka Borovec
69d8fa9b95 bump: testing latest PT on GPU to 2.7 (#20754) 2025-04-25 20:29:32 +02:00
Jirka Borovec
6cf2581feb update model message (#20753)
* update model message
* nitpick_ignore
2025-04-25 19:49:44 +02:00
William Falcon
1af7f73d4b Release prep 2 2025-04-24 06:05:37 -04:00
Jirka Borovec
f01ff1403f release preparation (#20752)
setting logger's codeower
2025-04-24 06:04:11 -04:00
Jirka Borovec
c88bb407b3 ci: try to supress false failing check for create legacy checkpoint (#20746) 2025-04-22 15:09:35 +02:00
Jirka Borovec
08a04de85a ci: try to suppress false failing check for create legacy checkpoint (#20745) 2025-04-22 11:57:40 +02:00
Jirka Borovec
d57c2a3ff1 ci: bump dead ubuntu 20.04 (#20719) 2025-04-16 10:54:33 +02:00
Tom Hu
02311d03fb feat: add Codecov test results (#20694)
* feat: add Codecov test results
* linter

---------

Co-authored-by: Jirka B <j.borovec+github@gmail.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-04-07 13:57:07 +02:00
dependabot[bot]
5e0209ab41 build(deps): bump Lightning-AI/utilities from 0.14.2 to 0.14.3 (#20700)
* build(deps): bump Lightning-AI/utilities from 0.14.2 to 0.14.3

Bumps [Lightning-AI/utilities](https://github.com/lightning-ai/utilities) from 0.14.2 to 0.14.3.
- [Release notes](https://github.com/lightning-ai/utilities/releases)
- [Changelog](https://github.com/Lightning-AI/utilities/blob/main/CHANGELOG.md)
- [Commits](https://github.com/lightning-ai/utilities/compare/v0.14.2...v0.14.3)

---
updated-dependencies:
- dependency-name: Lightning-AI/utilities
  dependency-version: 0.14.3
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

* Apply suggestions from code review

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-04-07 13:41:16 +02:00
dependabot[bot]
44eb839880 build(deps): bump Lightning-AI/utilities from 0.14.1 to 0.14.2 (#20667)
* build(deps): bump Lightning-AI/utilities from 0.14.1 to 0.14.2

Bumps [Lightning-AI/utilities](https://github.com/lightning-ai/utilities) from 0.14.1 to 0.14.2.
- [Release notes](https://github.com/lightning-ai/utilities/releases)
- [Changelog](https://github.com/Lightning-AI/utilities/blob/main/CHANGELOG.md)
- [Commits](https://github.com/lightning-ai/utilities/compare/v0.14.1...v0.14.2)

---
updated-dependencies:
- dependency-name: Lightning-AI/utilities
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

* Apply suggestions from code review

---------

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Jirka Borovec <6035284+Borda@users.noreply.github.com>
2025-03-24 21:06:52 +01:00