Why my train loss equal to nan? #17

bingoohe · 2019-06-09T01:36:10Z

Hi,
When I run dssm_rnn.py, the train loss always shows nan. Change learning rate, no matter what.
I print out the variables in the model, and the variable embedding in the word_embeddings_layer shows nan for the first time.
How to deal with it. Thanks!

InsaneLife · 2019-06-10T07:47:55Z

loss = -tf.reduce_sum(tf.log(hit_prob))
should add a minimal number like
loss = -tf.reduce_sum(tf.log(hit_prob + 1e-8))

bingoohe · 2019-06-11T02:57:25Z

The log function is a reason. There is also a case where I will get 0 when calculating the norm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why my train loss equal to nan? #17

Why my train loss equal to nan? #17

bingoohe commented Jun 9, 2019

InsaneLife commented Jun 10, 2019

bingoohe commented Jun 11, 2019

Why my train loss equal to nan? #17

Why my train loss equal to nan? #17

Comments

bingoohe commented Jun 9, 2019

InsaneLife commented Jun 10, 2019

bingoohe commented Jun 11, 2019