Question about the effect of state encoding indentity connection in dynamics network #35

puyuan1996 · 2022-11-06T10:46:56Z

Thanks for your open-sourced code very much.

I'm a little confused about the reason for the identity connection of state encoding in DynamicsNetwork in model.py:

Why do we add this state encoding identity connection, rather than using action encoding, and what is its empirical impact on atari results?

Looking forward to your reply！

YeWR · 2022-11-23T05:58:08Z

Thank you for your comments.

The identity connection here follows the same architecture of resnets. The residual part provides richer and better gradients when the network is deep. Considering the dynamics network unrolls 5 steps recurrently, the gradient flow of the final unroll can be much deeper (over 10 layers). Consequently, we add the identity connection here.

As for empirical results, we find that such an identity connection shapes a better reward model. We collect some datasets and try to predict the reward through supervised learning for these data. We find that the model with the identity connection has a lower test error of the reward prediction.

Hope this address your concerns.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the effect of state encoding indentity connection in dynamics network #35

Question about the effect of state encoding indentity connection in dynamics network #35

puyuan1996 commented Nov 6, 2022 •

edited

Loading

YeWR commented Nov 23, 2022

Question about the effect of state encoding indentity connection in dynamics network #35

Question about the effect of state encoding indentity connection in dynamics network #35

Comments

puyuan1996 commented Nov 6, 2022 • edited Loading

YeWR commented Nov 23, 2022

puyuan1996 commented Nov 6, 2022 •

edited

Loading