Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Ampere bfloat-float benchmark #67

Merged

Conversation

aacostadiaz
Copy link
Collaborator

@aacostadiaz aacostadiaz commented May 20, 2024

This PR adds a GEMM example for ampere where the input is bfloat and the output is float.

@aacostadiaz aacostadiaz added the incremental Incremental changes label May 20, 2024
@aacostadiaz aacostadiaz force-pushed the aacosta/nvidia-bfloat-demo branch from 7180fae to fc7e60b Compare May 27, 2024 16:05
@aacostadiaz aacostadiaz changed the title Add Ampere bfloat-float example Add Ampere bfloat-float benchmark May 27, 2024
@aacostadiaz aacostadiaz force-pushed the aacosta/nvidia-bfloat-demo branch 4 times, most recently from 86fba07 to 1953484 Compare May 29, 2024 16:51
@aacostadiaz aacostadiaz removed the incremental Incremental changes label May 30, 2024
@aacostadiaz aacostadiaz force-pushed the aacosta/nvidia-bfloat-demo branch from 1953484 to cb9fe72 Compare May 31, 2024 09:53
@aacostadiaz aacostadiaz force-pushed the aacosta/nvidia-bfloat-demo branch from cb9fe72 to 74cbb6d Compare May 31, 2024 09:53
@mehdi-goli mehdi-goli merged commit 2acd549 into codeplaysoftware:sycl-develop May 31, 2024
3 checks passed
aacostadiaz added a commit that referenced this pull request Jul 16, 2024
* Add generic example runner

* Init d and ref_d with different values

* Move runner to benchmark folder

* Add generic example runner

* Add Ampere half-float example

* Update benchmarks/CMakeLists.txt

Co-authored-by: Mehdi Goli <[email protected]>

* Add Ampere half-float example

* Add Ampere half-float example

* Add Ampere half-float example

* Add Ampere bfloat-float example

---------

Co-authored-by: Mehdi Goli <[email protected]>
aacostadiaz added a commit to aacostadiaz/cutlass-fork that referenced this pull request Aug 6, 2024
* Add generic example runner

* Init d and ref_d with different values

* Move runner to benchmark folder

* Add generic example runner

* Add Ampere half-float example

* Update benchmarks/CMakeLists.txt

Co-authored-by: Mehdi Goli <[email protected]>

* Add Ampere half-float example

* Add Ampere half-float example

* Add Ampere half-float example

* Add Ampere bfloat-float example

---------

Co-authored-by: Mehdi Goli <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants