Reinforcement Learning Notes

The (introductory) notes included Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization. For the state-of-the-art advances, one can refer to paper directly and some excellent blog.

Hope you enjoy your learning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Reinforcement Learning Notes

Files

README.md

Latest commit

History

README.md

File metadata and controls

Reinforcement Learning Notes