You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
can it be ensured that the std deviation cannot be negative? this throws an error in line 63 of reward_model.py.
this is an edge case as the reward should never be negative but sometimes the opt problem can generate a flow of -0.00 and we want to be robust to that.
The text was updated successfully, but these errors were encountered:
can it be ensured that the std deviation cannot be negative? this throws an error in
line 63
ofreward_model.py
.this is an edge case as the reward should never be negative but sometimes the opt problem can generate a flow of -0.00 and we want to be robust to that.
The text was updated successfully, but these errors were encountered: