You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hi, I have found the reason in the paper "A Survey on Dropout Methods and Experimental Verification in Recommendation“, which said " To keep training and testing conditions consistent, all the weights need to be multiplied by p during testing, i.e., take W(l)test = pW(l); or it multiplies all outputs by 1/p during training so that the expected output is consistent with testing time. "
Hi!
In the original paper of stochastic depth, The formula is expressed as
Why mention it be a bug? And fix it as
The text was updated successfully, but these errors were encountered: