Skip to content

A simple stochastic OpenAI environment for training RL agents

Notifications You must be signed in to change notification settings

phddamianboborzi/banana-gym

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains a PIP package which is an OpenAI environment for simulating an enironment in which bananas get sold.

Installation

Install the OpenAI gym.

Then install this package via

pip install -e .

Usage

import gym
import gym_banana

env = gym.make('Banana-v0')

See https://github.com/matthiasplappert/keras-rl/tree/master/examples for some examples.

The Environment

Imagine you are selling bananas. One at a time. And the bananas get bad pretty quickly. Let's say in 3 days. The probability that I will sell the banana is given by

$$p(x) = (1+e)/(1. + e^(x+1))$$

where x-1 is my profit. This x-1 is my reward. If I don't sell the banana, the agent gets a reward of -1 (the price of the banana).

About

A simple stochastic OpenAI environment for training RL agents

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 95.3%
  • Makefile 4.7%