Skip to content

Actions: deepspeedai/DeepSpeed

hpu-gaudi2

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,565 workflow runs
1,565 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix keep_module_on_host
hpu-gaudi2 #1818: Pull request #7112 synchronize by inkcherry
March 6, 2025 03:02 59m 49s inkcherry:fix_keep_module_on_host
March 6, 2025 03:02 59m 49s
fix keep_module_on_host
hpu-gaudi2 #1817: Pull request #7112 opened by inkcherry
March 6, 2025 03:00 Action required inkcherry:fix_keep_module_on_host
March 6, 2025 03:00 Action required
Enable torch.autocast with ZeRO
hpu-gaudi2 #1816: Pull request #6993 synchronize by tohtana
March 6, 2025 01:48 1h 32m 7s tohtana/support_autocast
March 6, 2025 01:48 1h 32m 7s
hpu-gaudi2
hpu-gaudi2 #1815: Scheduled
March 6, 2025 00:11 2h 10m 4s master
March 6, 2025 00:11 2h 10m 4s
Update Domino for Llama3
hpu-gaudi2 #1814: Pull request #7084 synchronize by shenzheyu
March 5, 2025 22:56 Action required shenzheyu:master
March 5, 2025 22:56 Action required
Training multiple models
hpu-gaudi2 #1812: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 1h 38m 11s olruwase/zero_multi_models
March 5, 2025 16:08 1h 38m 11s
Training multiple models
hpu-gaudi2 #1811: Pull request #7018 synchronize by tjruwase
March 5, 2025 16:08 24s olruwase/zero_multi_models
March 5, 2025 16:08 24s
Enable torch.autocast with ZeRO
hpu-gaudi2 #1807: Pull request #6993 synchronize by tohtana
March 5, 2025 02:56 57m 54s tohtana/support_autocast
March 5, 2025 02:56 57m 54s
Enable torch.autocast with ZeRO
hpu-gaudi2 #1806: Pull request #6993 synchronize by tohtana
March 5, 2025 02:55 1m 12s tohtana/support_autocast
March 5, 2025 02:55 1m 12s
Update gaudi2 nightly,ci to latest 1.20.0 build
hpu-gaudi2 #1805: Pull request #7093 synchronize by loadams
March 5, 2025 00:29 Action required raza-sikander:master
March 5, 2025 00:29 Action required
hpu-gaudi2
hpu-gaudi2 #1804: Scheduled
March 5, 2025 00:11 3m 52s master
March 5, 2025 00:11 3m 52s
Improve overflow handling in ZeRO
hpu-gaudi2 #1802: Pull request #6976 synchronize by tjruwase
March 4, 2025 23:20 5m 12s olruwase/ds_5241
March 4, 2025 23:20 5m 12s
Variable batch size and LR scheduler
hpu-gaudi2 #1800: Pull request #7104 synchronize by bm-synth
March 4, 2025 20:09 Action required bm-synth:variable_batch_size_and_lr_2
March 4, 2025 20:09 Action required
Variable batch size and LR scheduler
hpu-gaudi2 #1798: Pull request #7104 synchronize by bm-synth
March 4, 2025 16:52 Action required bm-synth:variable_batch_size_and_lr_2
March 4, 2025 16:52 Action required
Improve overflow handling in ZeRO
hpu-gaudi2 #1797: Pull request #6976 synchronize by tjruwase
March 4, 2025 15:36 1h 8m 18s olruwase/ds_5241
March 4, 2025 15:36 1h 8m 18s
Enable ZeRO set/get APIs for NVMe offload
hpu-gaudi2 #1796: Pull request #7046 synchronize by tjruwase
March 4, 2025 11:27 3h 13m 52s olruwase/update_nvme_offload_states
March 4, 2025 11:27 3h 13m 52s
Training multiple models
hpu-gaudi2 #1795: Pull request #7018 synchronize by tjruwase
March 4, 2025 11:27 2h 18m 52s olruwase/zero_multi_models
March 4, 2025 11:27 2h 18m 52s