You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I run my code using the gradients_memory() method, which indeed works.
I then try to get the bottleneck tensors it found, by taking the "Checkpoint nodes used" list. I add them manually to a "checkpoints" collection, and then use gradients_collection(). Then, I get OOM.
I don't understand why that is so - if I take the same tensors, shouldn't the behaviour be the same?
The text was updated successfully, but these errors were encountered:
Yes, behavior should be the same. If you notice the code after printing "Checkpoint nodes used" doesn't know whether checkpoints variable was specified manually, or automatically. So I would double-check that Checkpoint nodes used message prints the same message in both cases and debug from there
First, thanks very much for this contribution!
I don't understand why that is so - if I take the same tensors, shouldn't the behaviour be the same?
The text was updated successfully, but these errors were encountered: