Skip to content

Actions: microsoft/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,271 workflow runs
5,271 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Change compile for pipeline module torch.compile
nv-lightning-v100 #13842: Pull request #6478 synchronize by loadams
December 20, 2024 05:14 Action required NirSonnenschein:torch_compile_micro_offset_fix
December 20, 2024 05:14 Action required
Fix error caused by all_reduce call in domino
nv-lightning-v100 #13841: Pull request #6880 synchronize by tjruwase
December 20, 2024 02:22 1h 52m 55s hongwei/fix_domino_allreduce
December 20, 2024 02:22 1h 52m 55s
Change compile for pipeline module torch.compile
nv-lightning-v100 #13840: Pull request #6478 synchronize by loadams
December 20, 2024 00:56 1h 47m 34s NirSonnenschein:torch_compile_micro_offset_fix
December 20, 2024 00:56 1h 47m 34s
Fix checkpointable_layers Logic
nv-lightning-v100 #13839: Pull request #6881 synchronize by loadams
December 20, 2024 00:55 1h 19m 46s Quentin-Anthony:qanthony/fix-act-recomp
December 20, 2024 00:55 1h 19m 46s
Stage3: Use new torch grad accumulation hooks API
nv-lightning-v100 #13838: Pull request #6773 synchronize by loadams
December 20, 2024 00:55 18m 33s deepcharm:stage3-use-new-grad-acc-api
December 20, 2024 00:55 18m 33s
nv-lightning-v100
nv-lightning-v100 #13837: Scheduled
December 20, 2024 00:21 38m 27s master
December 20, 2024 00:21 38m 27s
Fix error caused by all_reduce call in domino
nv-lightning-v100 #13836: Pull request #6880 synchronize by loadams
December 19, 2024 23:23 26m 29s hongwei/fix_domino_allreduce
December 19, 2024 23:23 26m 29s
Fix checkpointable_layers Logic
nv-lightning-v100 #13835: Pull request #6881 synchronize by loadams
December 19, 2024 20:32 5m 58s Quentin-Anthony:qanthony/fix-act-recomp
December 19, 2024 20:32 5m 58s
Stage3: Use new torch grad accumulation hooks API
nv-lightning-v100 #13834: Pull request #6773 synchronize by loadams
December 19, 2024 20:32 6m 4s deepcharm:stage3-use-new-grad-acc-api
December 19, 2024 20:32 6m 4s
Add the missing view operations from sequence parallel(async).
nv-lightning-v100 #13832: Pull request #6750 synchronize by loadams
December 19, 2024 17:39 Action required inkcherry:ds_overlap_fix
December 19, 2024 17:39 Action required
Zero2: avoid graph breaks in torch.compile by using param_idx
nv-lightning-v100 #13831: Pull request #6803 synchronize by loadams
December 19, 2024 17:36 6m 23s nelyahu:zero2_param_idx
December 19, 2024 17:36 6m 23s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13829: Pull request #6830 synchronize by loadams
December 19, 2024 17:32 6m 5s loadams/transformers-inference
December 19, 2024 17:32 6m 5s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13828: Pull request #6830 synchronize by loadams
December 19, 2024 17:27 5m 10s loadams/transformers-inference
December 19, 2024 17:27 5m 10s
Cleanup ops/transformer/inference tests
nv-lightning-v100 #13827: Pull request #6830 synchronize by loadams
December 19, 2024 17:25 2m 42s loadams/transformers-inference
December 19, 2024 17:25 2m 42s
hpu_accelerator: use torch.use_deterministic_algorithms
nv-lightning-v100 #13824: Pull request #6897 opened by nelyahu
December 19, 2024 07:23 3m 11s nelyahu:patch-2
December 19, 2024 07:23 3m 11s
nv-lightning-v100
nv-lightning-v100 #13823: Scheduled
December 19, 2024 00:22 1h 47m 16s master
December 19, 2024 00:22 1h 47m 16s
Allow to compile collective for PT > 2.3
nv-lightning-v100 #13822: Pull request #6674 reopened by loadams
December 18, 2024 21:53 2h 31m 55s nelyahu:compile_collectives
December 18, 2024 21:53 2h 31m 55s
Allow to compile collective for PT > 2.3
nv-lightning-v100 #13821: Pull request #6674 synchronize by loadams
December 18, 2024 21:07 39m 26s nelyahu:compile_collectives
December 18, 2024 21:07 39m 26s
Copy #6674: Allow to compile collective for PT > 2.3
nv-lightning-v100 #13820: Pull request #6894 opened by loadams
December 18, 2024 21:01 3h 17m 48s loadams/test-compile-collectives
December 18, 2024 21:01 3h 17m 48s
Fix checkpointable_layers Logic
nv-lightning-v100 #13819: Pull request #6881 synchronize by Quentin-Anthony
December 18, 2024 20:25 2h 2m 14s Quentin-Anthony:qanthony/fix-act-recomp
December 18, 2024 20:25 2h 2m 14s
Fix checkpointable_layers Logic
nv-lightning-v100 #13818: Pull request #6881 synchronize by Quentin-Anthony
December 18, 2024 20:24 Action required Quentin-Anthony:qanthony/fix-act-recomp
December 18, 2024 20:24 Action required