Potential Misalignment in p2e_dv2 and p2e_dv3 Implementations with Original Paper #322

tallance · 2024-10-10T19:13:48Z

I've noticed a potential misalignment in the p2e_dv2 and p2e_dv3 implementations regarding what the ensemble predicts. According to the Plan2Explore paper, the ensemble should predict the image embedding, not the posterior state. The implementation in p2e_dv1 appears aligned with this:

loss -= next_obs_embedding_dist.log_prob(embedded_obs.detach()[1:]).mean()

However, in p2e_dv2 and p2e_dv3, it seems to aim to predict the next (randomized) posterior state:

loss -= next_obs_embedding_dist.log_prob(posteriors.view(sequence_length, batch_size, -1).detach()[1:]).mean()

Could this be an intentional modification, or am I missing something about how these predictions should be handled?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Potential Misalignment in p2e_dv2 and p2e_dv3 Implementations with Original Paper #322

Potential Misalignment in p2e_dv2 and p2e_dv3 Implementations with Original Paper #322

tallance commented Oct 10, 2024

Potential Misalignment in p2e_dv2 and p2e_dv3 Implementations with Original Paper #322

Potential Misalignment in p2e_dv2 and p2e_dv3 Implementations with Original Paper #322

Comments

tallance commented Oct 10, 2024