-
Notifications
You must be signed in to change notification settings - Fork 13.4k
metal : initial Metal4 tensor API support #16634
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
Any early performance data? |
6271c44 to
6726e53
Compare
|
@jeffbolznv I think the performance using the tensor API is the same as the old simdgroup-based implementation, but I haven't done detailed analysis yet. I don't have hardware yet to test the actual Neural Accelerators that exist in the new chips and if they would be utilized with these changes. |
6726e53 to
57fa815
Compare
|
Looking for volunteers with iPhone 17 or MacBook M5 for testing |
I have an iPhone 17, how can I help? |
TODOs
mul_mm_idkernel