RL-algorithms

Repository with Reinforcement Learning (RL) algorithms tested in different simulations.

Install

To install the latest stable version:

$ pip install rl-algorithms

For a specific version:

$ pip install rl-algorithms==0.0.1

To install the latest version available on Github:

$ pip install git+https://github.com/blurry-mood/RL-algorithms

Run simulations

Install the desired environment, for example Robotic Warehouse:

$ cd environments
$ sh warehouse_bot.sh

Run simulation, for instance using a Q-Learning agent:

$ cd simulations/qlearning
$ python qlearning.py

What's available

Algorithms

On-Policy Monte Carlo
Q-Learning
SARSA
n-step SARSA
Deep Q-Learning (DQN)
Deep Q-Learning with SARSA update rule (DSN)
REINFORCE
Actor-Critic

Environments

Technical details

Every algorithm is implemented as a subclass of Agent. It imperatively needs to implement some methods, namely, save, load, take_action, update, and decode_state.

Each algorithm re-implements all methods but decode_state, the latter is left for user to implement based on target environment. take_action method provides a description to what that method should look like (its inputs & outputs).
Based on the environment, a class extending the desired algorithm class must reimplement decode_state.

Here's a concrete example on how to use the package:

from rl_algorithms import QLearning

class MyAgent(QLearning):
    def decode_state(self, state):
        return tuple(state)

qlearning = MyAgent(actions=list(range(10)), alpha=1e-2, gamma=0.85, eps=0.2)

For more examples, check the content of files inside simulations/ folder.

The base agents (algorithms) are implemented in a way that makes them ready to use off-the-shelf; the same methods are called in the same order in the script.
Here's a script that illustrates the idea:

"""
After defining the agent class & instance (for e.g. named qlearning)
"""

qlearning.load('qlearning_minihack')    # load agent

for episode in range(10):
    state = env.reset()
    env.render(state)
    n = 0
    done = False

    qlearning.start_episode()           # initialize agent for a new episode
    
    while not done:
        n += 1
        
        action = qlearning.take_action(state)   # take action based on state

        state, reward, done, info = env.step(action)
        
        qlearning.update(state, reward) # learn from reward

        env.render(state)

    qlearning.end_episode()             # update agent's internal logic
    qlearning.save('qlearning_minihack')    # save agent in Hard drive

The only thing that changes from an algorithm to another are the inputs and outputs of each method.
Also, some algorithms don't require calling all methods, for Q-Learning start_episode and end_episode can be safely discarded.

Bug or Feature

RL-Algorithms is a growing package. If you encounter a bug or would like to request a feature, please feel free to open an issue here.

Name	Name	Last commit message	Last commit date
Latest commit ayoubassis test reinforce on minihack Jan 18, 2022 a386994 · Jan 18, 2022 History 43 Commits
.github/workflows	.github/workflows	Create python-publish.yml	Dec 27, 2021
environments	environments	rename warehouse_bot.sh	Dec 27, 2021
rl_algorithms	rl_algorithms	fix bug in Reinforce algo	Jan 17, 2022
simulations	simulations	test reinforce on minihack	Jan 18, 2022
utils	utils	close window when `x` is clicked	Dec 24, 2021
.gitignore	.gitignore	ignore `runs` dir	Nov 2, 2021
LICENSE	LICENSE	Initial commit	Oct 26, 2021
README.md	README.md	document package (1st version)	Dec 27, 2021
TODO.md	TODO.md	more planning	Jan 1, 2022
requirements.txt	requirements.txt	refactor code & add `Actor Critic`+`REINFORCE`	Dec 27, 2021
setup.py	setup.py	fix bug in Reinforce algo	Jan 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-algorithms

Install

Run simulations

What's available

Algorithms

Environments

Technical details

Bug or Feature

About

Releases

Packages

Languages

License

ayoubassis/RL-algorithms

Folders and files

Latest commit

History

Repository files navigation

RL-algorithms

Install

Run simulations

What's available

Algorithms

Environments

Technical details

Bug or Feature

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages