[SpecializeDMACode] Properly lower `compute_core_index` #109

zero9178 · 2024-08-15T10:33:53Z

Barriers may be inserted in a loop resulting from the lowering of a scf.forall. The lowering of compute_core_index unfortunately returns num_compute_cores + 1 which is outside the specified range of the operation and leads to such loops and their barriers being skipped by the DMA core. As the pass already assumes non-divergent control flow, we can fix this by specializing compute_core_index to any integer in the defined range of the op.

The op as it is currently used and specified returns the index of compute cores (in our current simulations 0 to exclusive 8) and has unspecified behavior on the DMA core. This will be used in a later PR to properly lower it when doing DMA code specialization.

Barriers may be inserted in a loop resulting from the lowering of a `scf.forall`. The lowering of `compute_core_index` unfortunately returns `num_compute_cores + 1` which is outside the specified range of the operation and leads to such loops and their barriers being skipped by the DMA core. As the pass already assumes non-divergent control flow, we can fix this by specializing `compute_core_index` to any integer in the defined range of the op.

zero9178 added 2 commits August 15, 2024 11:30

Base automatically changed from compute-core-rename to main August 15, 2024 10:46

zero9178 merged commit 03aaad4 into main Aug 15, 2024
1 check passed

zero9178 deleted the dma-compute-core-index-lowering branch August 15, 2024 10:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SpecializeDMACode] Properly lower `compute_core_index` #109

[SpecializeDMACode] Properly lower `compute_core_index` #109

zero9178 commented Aug 15, 2024

[SpecializeDMACode] Properly lower compute_core_index #109

[SpecializeDMACode] Properly lower compute_core_index #109

Conversation

zero9178 commented Aug 15, 2024

[SpecializeDMACode] Properly lower `compute_core_index` #109

[SpecializeDMACode] Properly lower `compute_core_index` #109