std deviation in reward computation can be negative sometimes #4

siddharth119 · 2021-06-15T21:28:35Z

can it be ensured that the std deviation cannot be negative? this throws an error in line 63 of reward_model.py.

this is an edge case as the reward should never be negative but sometimes the opt problem can generate a flow of -0.00 and we want to be robust to that.

The text was updated successfully, but these errors were encountered:

siddharth119 assigned wgosrich Jun 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

std deviation in reward computation can be negative sometimes #4

std deviation in reward computation can be negative sometimes #4

siddharth119 commented Jun 15, 2021

std deviation in reward computation can be negative sometimes #4

std deviation in reward computation can be negative sometimes #4

Comments

siddharth119 commented Jun 15, 2021