[PyTorch] Add contiguous check for `te_grouped_gemm` #1146

BeingGod · 2024-08-29T07:13:16Z

Description

The type of input and output of te_grouped_gemm is std::vector<at::Tensor>. If the tensor is not contiguous it will causes potential accuracy problem and it is hard to debug. It's a useful safe operator to check every tensor is contiguous.

Why use is_contiguous instead of contiguous?
If tensor is non-contiguous contiguous will call aten::clone to generate a new tensor. It will cause potential performance fallback and generally user is hard to find reason (maybe them need profile).

Fixes # (issue)

Type of change

Documentation change (change only to the documentation, either a fix or a new content)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Infra/Build change
Code refractor

Changes

Please list the changes introduced in this PR:

Add contiguous check for te_grouped_gemm

Checklist:

I have read and followed the contributing guidelines
The functionality is complete
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

timmoon10

This is a good idea. Can you sign your commit to pass the DCO check?

Signed-off-by: beinggod <[email protected]>

…_gemm

BeingGod · 2024-08-30T02:59:27Z

This is a good idea. Can you sign your commit to pass the DCO check?

Done.

timmoon10 · 2024-08-30T22:31:49Z

/te-ci pytorch

…_gemm

timmoon10 approved these changes Aug 29, 2024

View reviewed changes

ksivaman approved these changes Aug 29, 2024

View reviewed changes

[PyTorch] Add contiguous check for grouped gemm

4862d27

Signed-off-by: beinggod <[email protected]>

BeingGod force-pushed the dev/zhangrb/add_contiguous_check_for_grouped_gemm branch from 88fbcb8 to 4862d27 Compare August 30, 2024 02:58

Merge branch 'main' into dev/zhangrb/add_contiguous_check_for_grouped…

226709f

…_gemm

Merge branch 'main' into dev/zhangrb/add_contiguous_check_for_grouped…

17802b4

…_gemm

ksivaman merged commit ddc5774 into NVIDIA:main Sep 3, 2024
13 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PyTorch] Add contiguous check for `te_grouped_gemm` #1146

[PyTorch] Add contiguous check for `te_grouped_gemm` #1146

BeingGod commented Aug 29, 2024 •

edited

Loading

timmoon10 left a comment

BeingGod commented Aug 30, 2024

timmoon10 commented Aug 30, 2024

[PyTorch] Add contiguous check for te_grouped_gemm #1146

[PyTorch] Add contiguous check for te_grouped_gemm #1146

Conversation

BeingGod commented Aug 29, 2024 • edited Loading

Description

Type of change

Changes

Checklist:

timmoon10 left a comment

Choose a reason for hiding this comment

BeingGod commented Aug 30, 2024

timmoon10 commented Aug 30, 2024

[PyTorch] Add contiguous check for `te_grouped_gemm` #1146

[PyTorch] Add contiguous check for `te_grouped_gemm` #1146

BeingGod commented Aug 29, 2024 •

edited

Loading