Reinforcement Learning

some practices to keep records...

Q Learning

Rooms practice
Goal: Given a grid world with 6 rooms, find an optimized path to reach goal!

DQN (using TF-Agents)

Cartpole practice
Goal: Given a cartpole, try to keep the pole upright by moving the cartpole left or right!

PPO