-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Transform API] Simple tiling can be tedious to implement #666
Comments
Some options could be:
|
For posterity, the resulting kernel is this:
and after reduction realization, it becomes
I guess my first question here is what you would like to happen. Duplicating
|
I would first privatize the temporary |
Consider the simple batched matvec example:
This results in a linearization error with:
Notice how the duplication options force us to realize the reduction. Making this simple transformation quite tedious to implement. We should have some interface to make this transformation easier as (at least in the context of einsums) this is a pretty common transformation.
The text was updated successfully, but these errors were encountered: