-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: implement the sliding tile puzzle env #189
feat: implement the sliding tile puzzle env #189
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks really great! Couple minor changes and we got to wait and see how well it performs 🔥
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the PR! Left a few comments on a first review :)
What's the status of this PR? Anything I could help with? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry this took so long, but awesome work @ElshadaiK I think it's pretty much ready just some minor comments from my side
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall, still investigating why training doesn't work.
0284549
to
72af3fe
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just some small things we need to add here. Also need to add this mkdocs.yaml and add the gif
Co-authored-by: Sasha <[email protected]>
Co-authored-by: Sasha <[email protected]>
Co-authored-by: Sasha <[email protected]>
Co-authored-by: Clément Bonnet <[email protected]>
Co-authored-by: Sasha <[email protected]>
Co-authored-by: Sasha <[email protected]>
Co-authored-by: Sasha <[email protected]>
5acc203
to
00d6980
Compare
Co-authored-by: Sasha <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Details:
Implements the full
SlidingTilePuzzle
environment with actor-critic and random networks as well as documentation.Notes:
Gifs are still to be updated. Training of an a2c agent is ongoing