You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
GRU with block-diagonal recurrent weights (needs investigation since i don't know what is the block-diagonal variant: https://arxiv.org/abs/1905.12340)
Critic regresses both imagined return and real ones: "The critic replay loss uses the imagination returns $R^t_{\lambda}$ at the start states of the imagination rollouts as on-policy value annotations for the replay trajectory to then compute λ-returns over the replay rewards."
Hi!
Sharing slight change in Dreamer V3 according to their updated(2024/04/17) manuscript
https://arxiv.org/pdf/2301.04104
Also their codes are updated few hours ago
https://github.com/danijar/dreamerv3
It includes change in the optimizer (LaProp), experiments with larger num envs, etc
Just wanted to let you know! Thanks!
The text was updated successfully, but these errors were encountered: