-
Notifications
You must be signed in to change notification settings - Fork 55
Issues: NVIDIA/Fuser
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Feature request: Consider privatization instead of forwarding in fusion segmentation
Segmentation
Issues related to nvFuser Segmentation
#3832
opened Feb 5, 2025 by
naoyam
Feature request: Extend the privatization to improve segmentation
Segmentation
Issues related to nvFuser Segmentation
#3830
opened Feb 5, 2025 by
naoyam
Feature request: Fusing sibling exprs in segmentation
Segmentation
Issues related to nvFuser Segmentation
#3829
opened Feb 5, 2025 by
naoyam
Automate performant Hopper matmul
H100 Perf
improve performance on H100
Matmuls
#3819
opened Feb 4, 2025 by
jacobhinkle
2 of 22 tasks
Reduction scheduler fails to recognize iter domains not captured by reference
bug
Something isn't working
#3811
opened Feb 3, 2025 by
naoyam
Make FusionProfile object not a singleton and allow copying
#3771
opened Jan 28, 2025 by
kshitij12345
pytest benchmark reporting incorrect benchmark time
bug
Something isn't working
Python Benchmarks
#3753
opened Jan 23, 2025 by
jjsjann123
Run the IdModel exact-ness validation as part of the launch-time validation
idmodel
#3752
opened Jan 23, 2025 by
naoyam
TensorDomain::flatten should squeeze broadcast IDs as done by the usual reshape transform
#3691
opened Jan 9, 2025 by
naoyam
Optimize matrix multiplication performance using tma multicast
Matmuls
#3689
opened Jan 9, 2025 by
rdspring1
1 of 7 tasks
Inlining error in Hopper matmul with AxisMapping and grid swizzling
Matmuls
#3671
opened Jan 6, 2025 by
jacobhinkle
pad on broadcast dimensions hitting assert during transform replay
#3660
opened Dec 31, 2024 by
jjsjann123
Feature request: Extend the uop forwarding in the fusion segmenter to include other single-input trivial ops
rope
#3647
opened Dec 25, 2024 by
naoyam
Vectorization analysis returns wrong Vectorization Factor
bug
Something isn't working
#3640
opened Dec 24, 2024 by
jjsjann123
Feature request: Extend the remove broadcast + squeeze pass
rope
#3635
opened Dec 23, 2024 by
naoyam
Performance gap between manual nvfuser definition and
thunder.jit
for rmsnorm
#3629
opened Dec 20, 2024 by
Priya2698
Previous Next
ProTip!
no:milestone will show everything without a milestone.