You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Disabling dataflow-based fusion logic introduced a substantial performance regression. In the models we are tracking, we can see an increase in memory usage.
This will print the peak memory consumption for both runs, on H100 this prints ~71GB and ~67GB after reverting.
Additional info
With this issue, I am not looking to revert the commit, but to track the process of bringing back the dataflow fusion and taking advantage of the moment to brainstorm a bit on it and improve it from the state that it was before being deleted.
IvanYashchuk
changed the title
High memory consumption without fusion rematerialization
High memory consumption without dataflow-based fusion
Feb 19, 2025
⏱️ Perf regression
Disabling dataflow-based fusion logic introduced a substantial performance regression. In the models we are tracking, we can see an increase in memory usage.
To Reproduce
There are a few steps to follow:
This will print the peak memory consumption for both runs, on H100 this prints ~71GB and ~67GB after reverting.
Additional info
With this issue, I am not looking to revert the commit, but to track the process of bringing back the dataflow fusion and taking advantage of the moment to brainstorm a bit on it and improve it from the state that it was before being deleted.
Mentioning PR for tracking #1763
cc @riccardofelluga
The text was updated successfully, but these errors were encountered: