Support blocked dot operand layout conversion to linear layout #5423

binarman · 2024-12-13T14:27:44Z

Goal

Support FMA/Blocked dot operand layout in linear layout converter.

DoD

Implemented LinearLayout converter and related tests are implemented and passing, switch to Linear layout converter do not break python tests.

Existing Linear Layout converters for dot operands

Nvidia MMA dot operand
AMD MFMA dot operand

Both of these examples are fully functional, but I want to refactor MFMA and WMMA converters in similar to Nvidia MMA fashion soon. Please try to follow MMA style.

In progress PRs

WMMA dot operand

Legacy SharedLayout->dotOperand converter examples

Implementation details

Implement linear converter similar to WMMA/MFMA (I am going to refactor these converter a little soon) in LinearLayoutConversions.cpp; Add appropriate call in DotOperandEncodingAttr::toLinearLayout function
Add blocked layout in MemoryOpToLLVM.cpp:isSupportedDotOpLayout to enable shared->blocked dotOperand conversion with LL; check all test_core.py tests work.
Enable LL converter in ConvertLayoutOpToLLVM.cpp:transferWithinBlock to enable ttg.convert_layout operation conversion with LL. Maybe you need to rework some other places as well, we will clarify this during implementation.
Implement ctest tests in LinearLayoutConversionsTest.cpp and few lit tests to specifically verify ttg.convert_layout operation conversion

The text was updated successfully, but these errors were encountered:

binarman · 2024-12-13T14:28:07Z

+cc @simonidaa
please, consider this task next

Jokeren · 2024-12-13T14:41:51Z

+@lezcano Maybe you can offload this to AMD? Or there's something you've already done but not pushed yet?

Jokeren · 2024-12-13T14:45:31Z

@binarman btw, something related, we plan to completely abandon the getCSwizzleOffset method and use allocShape in memory descriptor to determine if prefetch has occurred.

binarman · 2024-12-13T14:52:04Z

Thank you @Jokeren! this is good to know, I will consider removing getCSwizzleOffset from our converters.

minjang · 2024-12-13T16:33:10Z

Very good to hear for this work!

(For my experimental project, using LL for non-GPUs, yeah, this was a missing feature. I hacked badly just to make it passable for unit tests, but looking forward to seeing this support.)

lezcano · 2024-12-13T17:13:11Z

I did not start with this, so feel free to take over.

binarman · 2024-12-19T12:12:04Z

@lezcano told me this task is blocking him, so I am going to implement it asap,
@simonidaa I will look for something else, but still related to LL for you

binarman · 2024-12-19T21:38:04Z

I've made a draft PR: #5469
Along the way I understood that current FMA implementation does not align with the rest of layouts, so I decided to rework it. Currently some tests are failing probably will finish this work tomorrow.

lezcano mentioned this issue Dec 23, 2024

Implement conversion from FMA dot operand to linear layout #5469

Merged

antiagainst closed this as completed in #5469 Dec 30, 2024

antiagainst closed this as completed in d9facf3 Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support blocked dot operand layout conversion to linear layout #5423

Support blocked dot operand layout conversion to linear layout #5423

binarman commented Dec 13, 2024

binarman commented Dec 13, 2024

Jokeren commented Dec 13, 2024

Jokeren commented Dec 13, 2024

binarman commented Dec 13, 2024

minjang commented Dec 13, 2024

lezcano commented Dec 13, 2024

binarman commented Dec 19, 2024

binarman commented Dec 19, 2024

Support blocked dot operand layout conversion to linear layout #5423

Support blocked dot operand layout conversion to linear layout #5423

Comments

binarman commented Dec 13, 2024

Goal

DoD

Existing Linear Layout converters for dot operands

In progress PRs

Legacy SharedLayout->dotOperand converter examples

Implementation details

binarman commented Dec 13, 2024

Jokeren commented Dec 13, 2024

Jokeren commented Dec 13, 2024

binarman commented Dec 13, 2024

minjang commented Dec 13, 2024

lezcano commented Dec 13, 2024

binarman commented Dec 19, 2024

binarman commented Dec 19, 2024