Skip to content

Conversation

@fischeti
Copy link

This PR adds another opgroup SHFL for SIMD shuffling instructions

Operations

Currently only two operations are supported:

  • SHUFFLE: Shuffle the input vector in operand_i[0] with the mask in operand_i[1]
  • SHUFFLE2: Shuffle the input vectors in operand_i[0] and operand_i[2] with the mask in operand_i[1]

The SHUFFLE2 operation is selected by setting the op_mod_i flag.

Verification

The unit was integrated and tested in the snitch cluster for all SIMD formats (FP8, FP16, FP32).

Evaluation

Synthesis results of the snitch cluster showed an overhead of ~3kGE/FPU with all formats enabled. The critical path is not affected, even if no pipeline registers are configured.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants