Skip to content

Actions: microsoft/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,058 workflow runs
5,058 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Avoid poisoning process with CUDA calls as soon as importing
nv-lightning-v100 #13697: Pull request #6810 synchronize by loadams
December 10, 2024 23:58 6m 43s HollowMan6:pr
December 10, 2024 23:58 6m 43s
Fix: ensure deepspeed.initialize can only be called once.
nv-lightning-v100 #13696: Pull request #6848 synchronize by loadams
December 10, 2024 22:50 Action required traincheck-team:master
December 10, 2024 22:50 Action required
Fix: ensure deepspeed.initialize can only be called once.
nv-lightning-v100 #13695: Pull request #6848 opened by traincheck-team
December 10, 2024 22:24 Action required traincheck-team:master
December 10, 2024 22:24 Action required
nv-lightning-v100
nv-lightning-v100 #13694: Merge group checks requested
December 10, 2024 20:13 39m 15s
December 10, 2024 20:13 39m 15s
Fix uneven head sequence parallelism bug (#6774)
nv-lightning-v100 #13693: Pull request #6797 synchronize by loadams
December 10, 2024 19:12 6m 37s Eugene29:master
December 10, 2024 19:12 6m 37s
Add MLP/lm_head tp grain size setting.
nv-lightning-v100 #13692: Pull request #6828 synchronize by loadams
December 10, 2024 18:34 5m 54s Yejing-Lai:lyj/tp_grain_size
December 10, 2024 18:34 5m 54s
Avoid poisoning process with CUDA calls as soon as importing
nv-lightning-v100 #13691: Pull request #6810 synchronize by loadams
December 10, 2024 18:31 14m 41s HollowMan6:pr
December 10, 2024 18:31 14m 41s
nv-lightning-v100
nv-lightning-v100 #13690: Merge group checks requested
December 10, 2024 18:31 6m 43s
December 10, 2024 18:31 6m 43s
nv-lightning-v100
nv-lightning-v100 #13689: Merge group checks requested
December 10, 2024 18:07 7m 31s
December 10, 2024 18:07 7m 31s
nv-lightning-v100
nv-lightning-v100 #13688: Merge group checks requested
December 10, 2024 18:03 5m 59s
December 10, 2024 18:03 5m 59s
Use ds-specific module id to avoid conflicts
nv-lightning-v100 #13687: Pull request #6847 opened by tjruwase
December 10, 2024 16:37 11m 3s olruwase/pr_6772
December 10, 2024 16:37 11m 3s
nv-lightning-v100
nv-lightning-v100 #13686: Merge group checks requested
December 10, 2024 16:27 5m 54s
December 10, 2024 16:27 5m 54s
[inf] Add config var to enable keeping module on host
nv-lightning-v100 #13685: Pull request #6846 opened by oelayan7
December 10, 2024 12:13 6m 50s oelayan7:keep_module_on_host
December 10, 2024 12:13 6m 50s
Variable batch size and LR scheduler
nv-lightning-v100 #13684: Pull request #5237 synchronize by bm-synth
December 10, 2024 10:29 Action required bm-synth:variable_batch_size_and_lr
December 10, 2024 10:29 Action required
Fix type error in ZeROOrderedDict
nv-lightning-v100 #13681: Pull request #6794 synchronize by loadams
December 10, 2024 06:16 3m 18s oraluben:patch-1
December 10, 2024 06:16 3m 18s
nv-lightning-v100
nv-lightning-v100 #13679: Merge group checks requested
December 10, 2024 06:15 6m 42s
December 10, 2024 06:15 6m 42s
nv-lightning-v100
nv-lightning-v100 #13678: Merge group checks requested
December 10, 2024 03:19 16m 24s
December 10, 2024 03:19 16m 24s
Unpin pytest-subtests now that 0.14.1 is released
nv-lightning-v100 #13675: Pull request #6844 opened by loadams
December 10, 2024 00:37 1h 38m 3s loadams/unpin-pytest-subtests
December 10, 2024 00:37 1h 38m 3s
nv-lightning-v100
nv-lightning-v100 #13672: Scheduled
December 10, 2024 00:23 28m 53s master
December 10, 2024 00:23 28m 53s
Variable batch size and LR scheduler
nv-lightning-v100 #13670: Pull request #5237 synchronize by loadams
December 10, 2024 00:18 Action required bm-synth:variable_batch_size_and_lr
December 10, 2024 00:18 Action required
Merge LoCo with Zero++
nv-lightning-v100 #13669: Pull request #6730 synchronize by loadams
December 10, 2024 00:18 1h 8m 17s XingyuXie:LoCo-Zero++
December 10, 2024 00:18 1h 8m 17s
Fix uneven head sequence parallelism bug (#6774)
nv-lightning-v100 #13668: Pull request #6797 synchronize by loadams
December 9, 2024 22:52 1h 14m 27s Eugene29:master
December 9, 2024 22:52 1h 14m 27s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13667: Pull request #6830 synchronize by loadams
December 9, 2024 22:52 56m 50s loadams/transformers-inference
December 9, 2024 22:52 56m 50s