Skip to content

Actions: deepspeedai/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,594 workflow runs
4,594 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Training multiple models
nv-lightning-v100 #14685: Pull request #7018 synchronize by tjruwase
March 6, 2025 17:06 4m 33s olruwase/zero_multi_models
March 6, 2025 17:06 4m 33s
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-lightning-v100 #14683: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:43 4m 45s raza-sikander:master
March 6, 2025 07:43 4m 45s
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-lightning-v100 #14681: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:37 Action required raza-sikander:master
March 6, 2025 07:37 Action required
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-lightning-v100 #14680: Pull request #7093 synchronize by raza-sikander
March 6, 2025 07:36 Action required raza-sikander:master
March 6, 2025 07:36 Action required
[XPU] Support XCCL on deepspeed side
nv-lightning-v100 #14679: Pull request #7113 synchronize by ys950902
March 6, 2025 07:26 4m 27s ys950902:sy/xccl_enable
March 6, 2025 07:26 4m 27s
[XPU] Support XCCL on deepspeed side
nv-lightning-v100 #14678: Pull request #7113 opened by ys950902
March 6, 2025 07:25 Action required ys950902:sy/xccl_enable
March 6, 2025 07:25 Action required
fix keep_module_on_host
nv-lightning-v100 #14677: Pull request #7112 synchronize by inkcherry
March 6, 2025 03:02 4m 30s inkcherry:fix_keep_module_on_host
March 6, 2025 03:02 4m 30s
fix keep_module_on_host
nv-lightning-v100 #14676: Pull request #7112 opened by inkcherry
March 6, 2025 03:00 Action required inkcherry:fix_keep_module_on_host
March 6, 2025 03:00 Action required
Enable torch.autocast with ZeRO
nv-lightning-v100 #14675: Pull request #6993 synchronize by tohtana
March 6, 2025 01:48 4m 29s tohtana/support_autocast
March 6, 2025 01:48 4m 29s
nv-lightning-v100
nv-lightning-v100 #14674: Scheduled
March 6, 2025 00:22 4m 27s master
March 6, 2025 00:22 4m 27s
Update Domino for Llama3
nv-lightning-v100 #14673: Pull request #7084 synchronize by shenzheyu
March 5, 2025 22:59 5m 4s shenzheyu:master
March 5, 2025 22:59 5m 4s
Update Domino for Llama3
nv-lightning-v100 #14672: Pull request #7084 synchronize by shenzheyu
March 5, 2025 22:56 Action required shenzheyu:master
March 5, 2025 22:56 Action required
Enable python 3.11 and 3.12 tests
nv-lightning-v100 #14671: Pull request #7007 synchronize by loadams
March 5, 2025 20:12 4m 22s loadams/reenable-py311-312
March 5, 2025 20:12 4m 22s
Training multiple models
nv-lightning-v100 #14669: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 4m 50s olruwase/zero_multi_models
March 5, 2025 16:08 4m 50s
Training multiple models
nv-lightning-v100 #14668: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 35s olruwase/zero_multi_models
March 5, 2025 16:08 35s
Enable ZeRO set/get APIs for NVMe offload
nv-lightning-v100 #14667: Pull request #7046 synchronize by loadams
March 5, 2025 15:46 4m 29s olruwase/update_nvme_offload_states
March 5, 2025 15:46 4m 29s
Conditionally quote env vars
nv-lightning-v100 #14666: Pull request #7071 synchronize by loadams
March 5, 2025 15:45 4m 38s saurabhkoshatwar:bugfix/env_export
March 5, 2025 15:45 4m 38s
Variable batch size and LR scheduler
nv-lightning-v100 #14664: Pull request #7104 synchronize by bm-synth
March 5, 2025 09:22 4m 30s bm-synth:variable_batch_size_and_lr_2
March 5, 2025 09:22 4m 30s
Enable torch.autocast with ZeRO
nv-lightning-v100 #14663: Pull request #6993 synchronize by tohtana
March 5, 2025 02:56 4m 18s tohtana/support_autocast
March 5, 2025 02:56 4m 18s
Enable torch.autocast with ZeRO
nv-lightning-v100 #14662: Pull request #6993 synchronize by tohtana
March 5, 2025 02:55 1m 28s tohtana/support_autocast
March 5, 2025 02:55 1m 28s
Update Domino for Llama3
nv-lightning-v100 #14661: Pull request #7084 synchronize by GuanhuaWang
March 5, 2025 01:00 Action required shenzheyu:master
March 5, 2025 01:00 Action required
Update gaudi2 nightly,ci to latest 1.20.0 build
nv-lightning-v100 #14660: Pull request #7093 synchronize by loadams
March 5, 2025 00:29 Action required raza-sikander:master
March 5, 2025 00:29 Action required
nv-lightning-v100
nv-lightning-v100 #14659: Scheduled
March 5, 2025 00:21 9m 43s master
March 5, 2025 00:21 9m 43s