Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
535 workflow runs
535 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Restore compatibility with Python 3.8 (#1189)
Deploy nightly docs #642: Commit 0c74535 pushed by ptrendx
September 20, 2024 23:05 1m 4s main
September 20, 2024 23:05 1m 4s
Allow downloading of model weights automatically (#1172)
Deploy nightly docs #641: Commit 195d703 pushed by ptrendx
September 20, 2024 20:38 1m 13s main
September 20, 2024 20:38 1m 13s
[PyTorch] Relax the contiguous check for flash attention (#1176)
Deploy nightly docs #640: Commit 0ee5ccd pushed by cyanguwa
September 19, 2024 00:55 1m 22s main
September 19, 2024 00:55 1m 22s
Expose rotary_base as an arg instead of hardcoding (#944)
Deploy nightly docs #639: Commit c0caadb pushed by ptrendx
September 18, 2024 22:31 2m 7s main
September 18, 2024 22:31 2m 7s
[PyTorch] Check network interface name when initializing Userbuffers …
Deploy nightly docs #638: Commit 841634c pushed by denera
September 18, 2024 18:09 1m 8s main
September 18, 2024 18:09 1m 8s
[PyTorch] Port fused optimizer tests to pytest (#1185)
Deploy nightly docs #637: Commit 7e1068b pushed by timmoon10
September 18, 2024 01:22 1m 8s main
September 18, 2024 01:22 1m 8s
Add docs for installing from PyPI (#1184)
Deploy nightly docs #636: Commit eb60b1a pushed by ksivaman
September 17, 2024 19:34 1m 1s main
September 17, 2024 19:34 1m 1s
Allow specifying cmake setup directory (#1186)
Deploy nightly docs #635: Commit 28f95bd pushed by timmoon10
September 17, 2024 18:59 1m 6s main
September 17, 2024 18:59 1m 6s
Changed VERSION to 1.12.0.dev
Deploy nightly docs #634: Commit 528d44b pushed by ptrendx
September 17, 2024 17:19 1m 11s main
September 17, 2024 17:19 1m 11s
[Common] Default CUDA_HOME to /usr/local/cuda when dynamically loadin…
Deploy nightly docs #633: Commit 44fd316 pushed by denera
September 17, 2024 14:58 1m 10s main
September 17, 2024 14:58 1m 10s
[JAX] Context Parallel Attention with All-Gather (#1106)
Deploy nightly docs #632: Commit 9101a78 pushed by mgoldfarb-nvidia
September 17, 2024 14:26 1m 8s main
September 17, 2024 14:26 1m 8s
Update CI users (#1181)
Deploy nightly docs #631: Commit d2d4cf9 pushed by timmoon10
September 16, 2024 20:46 1m 13s main
September 16, 2024 20:46 1m 13s
Add dtensor support for TE optimizers (#1171)
Deploy nightly docs #630: Commit af5daa0 pushed by ksivaman
September 16, 2024 17:08 1m 28s main
September 16, 2024 17:08 1m 28s
[JAX] Fix unit tests to work around cuDNN 9.4 regression of 0 length…
Deploy nightly docs #629: Commit df69965 pushed by denera
September 16, 2024 15:06 1m 18s main
September 16, 2024 15:06 1m 18s
Update CI users (#1180)
Deploy nightly docs #628: Commit c55007b pushed by timmoon10
September 11, 2024 19:13 21s main
September 11, 2024 19:13 21s
[PyTorch] Lower atol/rtol for F16 attention tests (#1157)
Deploy nightly docs #627: Commit e6e0603 pushed by ksivaman
September 11, 2024 14:36 1m 22s main
September 11, 2024 14:36 1m 22s
[PyTorch] Proxy class for low-precision tensor (#1127)
Deploy nightly docs #626: Commit 2d57db8 pushed by ksivaman
September 11, 2024 13:12 1m 3s main
September 11, 2024 13:12 1m 3s
Add a context parallelism implementation with QKVO all-to-all (#1160)
Deploy nightly docs #625: Commit 40dda92 pushed by cyanguwa
September 9, 2024 20:50 1m 13s main
September 9, 2024 20:50 1m 13s
Added Adobe analytics to the documentation (#1162)
Deploy nightly docs #624: Commit 2a9845e pushed by ptrendx
September 9, 2024 18:34 1m 4s main
September 9, 2024 18:34 1m 4s
[PyTorch] Propagate fp8 scale-inverse modification to GroupedLinear
Deploy nightly docs #623: Commit 047a507 pushed by ksivaman
September 9, 2024 14:30 1m 6s main
September 9, 2024 14:30 1m 6s
Revert "[C] Suppress 128-D warning from cudnn-frontend" (#1161)
Deploy nightly docs #622: Commit bdea56f pushed by ksivaman
September 5, 2024 18:40 1m 14s main
September 5, 2024 18:40 1m 14s
[C] Suppress 128-D warning from cudnn-frontend (#1158)
Deploy nightly docs #621: Commit 206c1d9 pushed by ksivaman
September 5, 2024 18:21 2m 5s main
September 5, 2024 18:21 2m 5s
[PyTorch] Implement Fp8 padding and unpadding module (#1129)
Deploy nightly docs #620: Commit 215db88 pushed by phu0ngng
September 5, 2024 18:03 1m 1s main
September 5, 2024 18:03 1m 1s
Added offloading support FP8 attention (#1131)
Deploy nightly docs #619: Commit 454e389 pushed by ksivaman
September 5, 2024 17:54 1m 29s main
September 5, 2024 17:54 1m 29s
[PyTorch] FP8 MHA with RoPE and Miscellaneous Improvements (#1100)
Deploy nightly docs #618: Commit 5fafeb0 pushed by yaox12
September 5, 2024 05:57 1m 10s main
September 5, 2024 05:57 1m 10s