Exploring and implementing different Reinforcement Learning algorithms to better my understanding, particulary model-free control methods.
Value-Based, Policy-Based and Policy-Optimization Methods, and Actor-Critic Methods
Code Refactoring
SARSA,Q-Learning,DQN, Double DQNREINFORCE,REINFORCE with BaselineAdvantage Actor-Critic for TD(0)/TD(λ)- A2C/A3C
TRPO,PPODDPG,TD3, SAC