Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[GPU] Fix accuracy issue in PagedAttention kernel for large prompts (…
…4K/8K tokens) by correcting index calculation in sub_group_broadcast function to ensure accurate data broadcasting within the subgroup
- Loading branch information