-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PyTorch] Add support for cuDNN FusedAttention + THD + CP #885
Commits on May 31, 2024
-
add seq_offsets_qkvo for cudnn thd
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d9ec133 - Browse repository at this point
Copy the full SHA d9ec133View commit details -
add seq_offsets_qkvo to AttnFuncWithCP
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9211a7c - Browse repository at this point
Copy the full SHA 9211a7cView commit details -
fix seq_offsets calculation of cudnn thd
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 541268b - Browse repository at this point
Copy the full SHA 541268bView commit details -
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for eb3f66d - Browse repository at this point
Copy the full SHA eb3f66dView commit details -
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9ac207a - Browse repository at this point
Copy the full SHA 9ac207aView commit details -
add thd test for cudnn FA with CP
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4239df9 - Browse repository at this point
Copy the full SHA 4239df9View commit details -
skip GQA/MQA test for cuDNN THD
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 9b309ab - Browse repository at this point
Copy the full SHA 9b309abView commit details -
make sure seq_offsets are computed with qkv_group of hd_hd_hd while CP>1
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4891fac - Browse repository at this point
Copy the full SHA 4891facView commit details
Commits on Jun 1, 2024
-
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e886136 - Browse repository at this point
Copy the full SHA e886136View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6ae7384 - Browse repository at this point
Copy the full SHA 6ae7384View commit details -
fix attn mask type for cudnn thd with cp
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 4a38798 - Browse repository at this point
Copy the full SHA 4a38798View commit details -
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for a8807e6 - Browse repository at this point
Copy the full SHA a8807e6View commit details -
fix attn_mask_type for cudnn fa with thd
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 5a6a0a5 - Browse repository at this point
Copy the full SHA 5a6a0a5View commit details
Commits on Jun 2, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d657950 - Browse repository at this point
Copy the full SHA d657950View commit details
Commits on Jun 3, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 3b8780c - Browse repository at this point
Copy the full SHA 3b8780cView commit details -
assert cudnn+thd does not support attn bias
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for b3997ce - Browse repository at this point
Copy the full SHA b3997ceView commit details -
check if attn_mask_type has padding
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e7b9ea7 - Browse repository at this point
Copy the full SHA e7b9ea7View commit details -
Configuration menu - View commit details
-
Copy full SHA for df2e164 - Browse repository at this point
Copy the full SHA df2e164View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12cd072 - Browse repository at this point
Copy the full SHA 12cd072View commit details
Commits on Jun 4, 2024
-
change cp test batch size to 2
Signed-off-by: Xiaowei Ren <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for cc3eb4f - Browse repository at this point
Copy the full SHA cc3eb4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 442f347 - Browse repository at this point
Copy the full SHA 442f347View commit details
Commits on Jun 5, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 70042d2 - Browse repository at this point
Copy the full SHA 70042d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for d37b399 - Browse repository at this point
Copy the full SHA d37b399View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5302419 - Browse repository at this point
Copy the full SHA 5302419View commit details -
Configuration menu - View commit details
-
Copy full SHA for a6c620d - Browse repository at this point
Copy the full SHA a6c620dView commit details -
Merge branch 'xren/cp_thd' of github.com:xrennvidia/TransformerEngine…
… into xren/cp_thd
Configuration menu - View commit details
-
Copy full SHA for a98abd4 - Browse repository at this point
Copy the full SHA a98abd4View commit details -
Configuration menu - View commit details
-
Copy full SHA for dfe2f03 - Browse repository at this point
Copy the full SHA dfe2f03View commit details
Commits on Jun 6, 2024
-
Configuration menu - View commit details
-
Copy full SHA for a8cebd7 - Browse repository at this point
Copy the full SHA a8cebd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for f22bc19 - Browse repository at this point
Copy the full SHA f22bc19View commit details
Commits on Jun 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b13e3aa - Browse repository at this point
Copy the full SHA b13e3aaView commit details