CS885-RL

This repository is for the Reinforcement Learning course CS885 taught by Prof. Pascal Poupart at the University of Waterloo. It covers planning by dynamic programming (value iteration, policy iteration, and modified policy iteration), Q-learning, three bandit algorithms (epsilon-greedy, Thompson sampling, and UCB), REINFORCE, and model-based reinforcement learning.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
notebook.ipynb		notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS885-RL

About

Releases

Packages

Languages

h-shahidi/cs885-rl

Folders and files

Latest commit

History

Repository files navigation

CS885-RL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages