Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Montezuma's revenge - has this been tried using this codebase? #85

Open
sunchipsster1 opened this issue Nov 20, 2022 · 2 comments
Open
Labels

Comments

@sunchipsster1
Copy link

No description provided.

@sunchipsster1 sunchipsster1 changed the title Montezuma Montezuma's revenge - have you tried it? :) Nov 20, 2022
@sunchipsster1
Copy link
Author

Hello! And thank you so much for this wonderful resource :) :)

I am currently working on montezuma's revenge, and have been trying to use your awesome codebase to better understand baselines that have been reported to work for montezuma's (e.g. Rainbow). I really enjoy your codebase because it is written in Pytorch rather than tensorflow or jax.

However, I have been unable to reproduce the reported result in the paper that Montezuma's should learn > 400 reward on rainbow, as I have not been able to get > 0 reward at all for any seeds.

I have been running:
python -u main.py --replay-frequency 1 --architecture canonical --game montezuma_revenge --reward-clip 1 --max-episode-length 1000000 --replay-frequency 16 --target-update int(3.2e4) --learn-start int(100e3)

Have you gotten rainbow to work on Montezuma (get > 0 reward), and what hyperparameters did you use? Thank you so much in advance for your kind help! :)

@sunchipsster1 sunchipsster1 changed the title Montezuma's revenge - have you tried it? :) Montezuma's revenge - has this been tried using this codebase? Nov 20, 2022
@Kaixhin
Copy link
Owner

Kaixhin commented Nov 21, 2022

Back when I did release v1.3, as stated, I was unable to achieve any reward on Montezuma's Revenge (the only other result I couldn't match was on H.E.R.O.). However, there were a few changes to the codebase since, which hopefully might allow learning to happen.

I noticed that you are running with several hyperparameters that are different to the original paper. All you should need is python --game montezuma_revenge (with different seeds). So I would recommend trying that with a few seeds.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants