Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Faster Convergence #51

Open
vwxyzjn opened this issue Jan 27, 2022 · 2 comments
Open

Faster Convergence #51

vwxyzjn opened this issue Jan 27, 2022 · 2 comments

Comments

@vwxyzjn
Copy link
Collaborator

vwxyzjn commented Jan 27, 2022

Training an agent now still takes a long time. The particular experiment in #36 took 4d 9h 11m 14s to finish.

Looking at the reward chart, it appears the agent could achieve 70% of the final performance in just 50M steps (or about 10 hours into training)

image

We should try to optimize based on the 10 hours time computational budget.

@vwxyzjn
Copy link
Collaborator Author

vwxyzjn commented Jan 27, 2022

The bottleneck I think is still largely on the NN side. So one thing worth trying is to reduce the NN size.

Alternatively, I noticed the learning rate annealing, in the end, seems to really help the algorithm converge. So maybe we could also try using a smaller learning rate and just turn off annealing.

Maybe we could tune with the discount factor (we should also visualize the discounted returns (what the agent actually optimized for).

5196CF77-ED9B-43F5-AEF2-C1601A4AAEBC

@vwxyzjn
Copy link
Collaborator Author

vwxyzjn commented Jan 31, 2022

#56 tries to address this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant