Skip to content

Commit

Permalink
fix pg bug
Browse files Browse the repository at this point in the history
  • Loading branch information
pinard.liu committed Feb 28, 2019
1 parent de58c08 commit 34af750
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion reinforcement-learning/policy_gradient.py
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ def create_softmax_network(self):
labels=self.tf_acts)
self.loss = tf.reduce_mean(self.neg_log_prob * self.tf_vt) # reward guided loss

self.train_op = tf.train.AdamOptimizer(LEARNING_RATE).minimize(-self.loss)
self.train_op = tf.train.AdamOptimizer(LEARNING_RATE).minimize(self.loss)

def weight_variable(self, shape):
initial = tf.truncated_normal(shape)
Expand Down

0 comments on commit 34af750

Please sign in to comment.