Easy21

1 Implementation of Easy21: easy21-implement.py

2 Monte-Carlo Control in Easy21: easy21_mc_control.py

3 TD Learning in Easy21: easy21_sarsa_lambda.py

4 Linear Function Approximation in Easy21: easy21_sarsa_lambda_approx

5 Discussion

What are the pros and cons of bootstrapping in Easy21?
Pros: no need to wait until the end of an episode, accelerate the learning process, decrease the variance
Cons: may introduce bias

Would you expect bootstrapping to help more in blackjack or Easy21 ? Why?
Help more in Easy21, because it takes a longer time on average to finish an episode in easy21 due to the fact that a value of a card can be negative depending on its color.

What are the pros and cons of function approximation in Easy21?
Pros: memory saving, learning speed acceleration
Cons: can only solve the problem approximately since a function approximator cannot represent all the state-action values accurately

How would you modify the function approximator suggested in this section to get better results in Easy21?
Have no idea right now.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
.gitattributes		.gitattributes
Easy21-Johannes.pdf		Easy21-Johannes.pdf
Learning process for lambda 0 and 1 under linear approximation.png		Learning process for lambda 0 and 1 under linear approximation.png
Learning process for lambda 0 and 1.png		Learning process for lambda 0 and 1.png
MSE against lambda under linear approximation.png		MSE against lambda under linear approximation.png
MSE against lambda.png		MSE against lambda.png
README.md		README.md
Value function from MC.png		Value function from MC.png
__init__.py		__init__.py
easy21-implement.py		easy21-implement.py
easy21.py		easy21.py
easy21_mc_control.py		easy21_mc_control.py
easy21_sarsa_lambda.py		easy21_sarsa_lambda.py
easy21_sarsa_lambda_approx.py		easy21_sarsa_lambda_approx.py
test1.png		test1.png
test2.png		test2.png
test3.png		test3.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Easy21

About

Releases

Packages

Languages

Jfhseh/Easy21

Folders and files

Latest commit

History

Repository files navigation

Easy21

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages