KMA-RL1

Based on the CS-234 course

Lecute 1 - Introduction to Reinforcement Learning

Additional Materials

High level introduction: SB chapter 1
Linear algebra review
Probability review
Python tutorial

Lecture discussion

Lecture 2 - Tabular MDP planning

SB chapter 3, 4.1-4.4

Lecture 3 - Tabular RL policy evaluation

SB chapter 5.1, 5.5, 6.1-6.3
David Silver's Lecture 4

Lecture 4 - Q-learning

SB chapter 5.2, 5.4, 6.4-6.5, 6.7

Lecture (4, 5, 6) - RL with function approximation

SB chapter 9.3, 9.6, 9.7

Lecture (7, 8) - Policy Search

Practical part

Practice-related

Recordings

Homeworks

Homework 1 (1, 2, 3)

Submission link
First deadline: 6/03/22 (30 points max)
Second deadline: 13/03/22 (25 points max)
Final deadline: 24/04/22 (20 points max)

Homework 2 (4, 5, 6)

Submission link
First deadline:
Second deadline:
Final deadline: 24/04/22

Homework 3 (7, 8)

Submission link
First deadline:
Second deadline:
Final deadline: 24/04/22

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
homework1		homework1
homework2		homework2
homework3		homework3
lecture1		lecture1
lecture2		lecture2
lecture3		lecture3
lecture4		lecture4
lecture5		lecture5
lecture6		lecture6
lecture7		lecture7
lecture8		lecture8
practice notebooks		practice notebooks
.gitignore		.gitignore
README.md		README.md
SB-RLbook.pdf		SB-RLbook.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KMA-RL1

Lecute 1 - Introduction to Reinforcement Learning

Additional Materials

Lecture discussion

Lecture 2 - Tabular MDP planning

Lecture 3 - Tabular RL policy evaluation

Lecture 4 - Q-learning

Lecture (4, 5, 6) - RL with function approximation

Lecture (7, 8) - Policy Search

Practical part

Practice-related

Recordings

Homeworks

Homework 1 (1, 2, 3)

Homework 2 (4, 5, 6)

Homework 3 (7, 8)

List of papers

About

Releases

Packages

Contributors 2

Languages

fido-ai/KMA-RL1

Folders and files

Latest commit

History

Repository files navigation

KMA-RL1

Lecute 1 - Introduction to Reinforcement Learning

Additional Materials

Lecture discussion

Lecture 2 - Tabular MDP planning

Lecture 3 - Tabular RL policy evaluation

Lecture 4 - Q-learning

Lecture (4, 5, 6) - RL with function approximation

Lecture (7, 8) - Policy Search

Practical part

Practice-related

Recordings

Homeworks

Homework 1 (1, 2, 3)

Homework 2 (4, 5, 6)

Homework 3 (7, 8)

List of papers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages