Skip to content

Latest commit

 

History

History
5 lines (3 loc) · 305 Bytes

README.md

File metadata and controls

5 lines (3 loc) · 305 Bytes

Reinforcement Learning Notes

The (introductory) notes included Bandit Algorithms, MDP, Model-free Methods, Value Function Approximation, Policy Optimization. For the state-of-the-art advances, one can refer to paper directly and some excellent blog.

Hope you enjoy your learning.