Skip to content

h-shahidi/cs885-rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

CS885-RL

This repository is for the Reinforcement Learning course CS885 taught by Prof. Pascal Poupart at the University of Waterloo. It covers planning by dynamic programming (value iteration, policy iteration, and modified policy iteration), Q-learning, three bandit algorithms (epsilon-greedy, Thompson sampling, and UCB), REINFORCE, and model-based reinforcement learning.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published