[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

hanhanW · 2024-12-11T07:00:52Z

Whether applying transposition from narrow-N to narrow-M is backend
implementation details, and we do not need to expose it to the type
converter. The encoding itself has enough information, like indexing
maps, narrow dimensions, etc., to infer the shapes and encoding info.
Instead of updating the RankedTensorType and the attached encoding in
type converter, we can just cook the logic in getEncodingInfo methods.
From the encoding, we know that whether it is narrow-N case, and we can
update the MaterializeEncodingInfo correspondingly. The type converter
can infer the transposed tensor type from it. Thus, we can simplify the
logic in the type conversion.

The documentation of transposeNarrowN is moved to
[CPU|GPU]EncodingExternalModels.cpp because all the implementation
locates at the files.

hanhanW · 2024-12-11T07:03:54Z

The PR depends on #19452. It is ready for review.

@bjacob please take a look at my documentation of transposeNarrowN. I moved them [CPU|GPU]EncodingExternalModels.cpp and borrow some words from you. Thanks!

Whether apply transposion from narrow-N to narrow-M is backend implementation details, and we do not need to expose it to the type converter. The encoding itself has enough information, like indexing maps, narrow dimensions, etc., to infer the shapes and encoding info. Instead of updating the RankedTensorTypes and the attached encoding in type converter, we can just cook the logic in `getEncodingInfo` methods. From the encoding, we know that whether it is narrow-N case, and we can update the MaterializeEncodingInfo correspondingly. The type converter can infer the transposed tensor type from it. Thus, we can simplify the logic in the type conversion. The documentation of `transposeNarrowN` is moved to [CPU|GPU]EncodingExternalModels.cpp because all the implementaion locates at the files. Signed-off-by: hanhanW <[email protected]>

hanhanW requested review from bjacob and lialan December 11, 2024 07:03

hanhanW marked this pull request as ready for review December 11, 2024 07:03

hanhanW requested review from antiagainst, qedawkins and MaheshRavishankar as code owners December 11, 2024 07:03

hanhanW force-pushed the data-tiling-cleanups-narrow-n branch from 75e6979 to 00c3e8e Compare December 11, 2024 07:05

hanhanW mentioned this pull request Dec 11, 2024

[DT] Unify encoding materialization pass into a single pass. #19454

Open

hanhanW changed the base branch from users/hanhanW/data-tiling-cleanups-3-upstream-diff to main December 13, 2024 04:45

hanhanW force-pushed the data-tiling-cleanups-narrow-n branch from 00c3e8e to dd92fd3 Compare December 13, 2024 04:46

hanhanW requested review from Max191 and removed request for antiagainst and qedawkins December 13, 2024 04:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

hanhanW commented Dec 11, 2024 •

edited

Loading

hanhanW commented Dec 11, 2024 •

edited

Loading

[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

Are you sure you want to change the base?

[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

Conversation

hanhanW commented Dec 11, 2024 • edited Loading

hanhanW commented Dec 11, 2024 • edited Loading

hanhanW commented Dec 11, 2024 •

edited

Loading

hanhanW commented Dec 11, 2024 •

edited

Loading