Skip to content

a modified version of the temporal-difference method Q-learning and SARSA. Additionally,a modified version of the action selection methods softmax and ϵ-greedy.

Notifications You must be signed in to change notification settings

zhoupeng1225/Reward-based-learning-agents

Repository files navigation

Reward-based-learning-agents

a modified version of the temporal-difference method Q-learning and SARSA. Additionally,a modified version of the action selection methods softmax and ϵ-greedy.

Process raw data with z5443641.py file in different methods

About

a modified version of the temporal-difference method Q-learning and SARSA. Additionally,a modified version of the action selection methods softmax and ϵ-greedy.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages