Skip to content

Commit

Permalink
[GPU] Optimized operations in the blas kernels with the latest buffe…
Browse files Browse the repository at this point in the history
…r changes.

    Updated the pipeline for both fp32 and fp16.
    SwiGLU, RmsNorm and Concat ops updated.

        **Self evaluation:**
        1. Build test:   [X]Passed [ ]Failed [ ]Skipped
        2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Niket Agarwal <[email protected]>
  • Loading branch information
niket-agarwal committed Jan 7, 2025
1 parent 32621ff commit f59f070
Show file tree
Hide file tree
Showing 5 changed files with 246 additions and 213 deletions.
Loading

0 comments on commit f59f070

Please sign in to comment.