Skip to content

Conversation

rishi-yadav
Copy link

@rishi-yadav rishi-yadav commented Oct 8, 2025

Intel XE DPAS Tests - Combined Data Types

Test Category Data Types M Dimensions Template Pattern Tile Configurations
MMA S8/U8 (8 tests) 8×16, 4×16, 2×16, 1×16 XE_DPAS_TT<M, int32_t, int8_t/uint8_t> 64×64×8×16×32 to 8×64×1×16×32
MMA BF16 (4 tests) 8×16, 4×16, 2×16, 1×16 XE_DPAS_TT<M, float, bfloat16_t> 256×256×32×64×32 to 8×64×1×16×16
MMA F16 (4 tests) 8×16, 4×16, 2×16, 1×16 XE_DPAS_TT<M, float, half_t> 64×64×8×16×16 to 8×64×1×16×16
MMA TF32 (4 tests) 8×16, 4×16, 2×16, 1×16 XE_DPAS_TT<M, float, tfloat32_t> 64×64×8×16×32 to 64×64×8×16×16
TiledMMA BF16 (3 tests) 8×16 (all) XE_DPAS_TT<8, float, bfloat16_t> 256×256×32, 128×64×32 (various SG layouts)

Summary by Category

  • Integer Types (S8/U8): 8 tests
  • Floating Point (BF16/F16/TF32): 12 tests
  • Template Validation (TiledMMA): 3 tests

Total: 23 tests PASSED - Complete Intel XE DPAS modern API coverage across 5 data types and multiple tiling strategies.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants