Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implementing optimized 2d rotation transform vertex. #46

Merged
merged 1 commit into from
Oct 12, 2023

Conversation

balancap
Copy link
Contributor

@balancap balancap commented Oct 11, 2023

This PR is starting a collection of basic linalg C++ functions, well
optimized, which can be re-used in more complex C++ vertices.

This PR is starting with axplusby and rotation2d functions.
Note: investigate additional optimizations to reduce the rpt loop
in axplusby from 3 cycles to 2 cycles.

@balancap balancap marked this pull request as draft October 11, 2023 11:16
@balancap balancap force-pushed the ipu-small-matmuls-custom-vertices branch from 5b5ad56 to b266fb1 Compare October 12, 2023 09:28
This PR is starting a collection of basic linalg C++ functions, well
optimized, which can be re-used in more complex C++ vertices.

This PR is starting with `axplusby` and `rotation2d` functions.
Note: investigate additional optimizations to reduce the `rpt` loop
in `axplusby` from 3 cycles to 2 cycles.
@balancap balancap force-pushed the ipu-small-matmuls-custom-vertices branch from b266fb1 to 2c90ae8 Compare October 12, 2023 09:44
@balancap balancap changed the title [WIP] Small matmuls IPU vertices. Implementing optimized 2d rotation transform vertex. Oct 12, 2023
@balancap balancap marked this pull request as ready for review October 12, 2023 09:46
@balancap balancap merged commit 5d552e0 into main Oct 12, 2023
6 checks passed
@balancap balancap deleted the ipu-small-matmuls-custom-vertices branch October 12, 2023 10:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant