BioGym 🐾

BioGym is a spatio-temporal wildlife management RL environment created on the Gymnasium library (formerly OpenAI Gym).

It was created as part of my Master's thesis (TDT4900), which I wrote spring 2023 at NTNU under the superivision of Keith L. Downing. My thesis is available here.

I presented my thesis at the NorwAI Innovate Conference 2023, and also had a poster, which you can see here.

Master's thesis 🎓

My thesis focused on Deep Reinforcement Learning for Spatio-Temporal Wildlife Management, and my research goal was to:

Explore the use of different deep reinforcement learning (DRL) algorithms on the task of spatio-temporal wildlife management, with the aim of maintaining a diverse and stable ecosystem.

To do this, I created a spatio-temporal wildlife management simulation, based on a tri-trophic predator-prey model. This was then wrapped as a Gymnasium environment, so that it followed the standard API of the library. This made it possible to "plug-and-play" with RL algorithms from Stable Baselines3. It also makes it easier for those interested to work with this RL environment.

I trained and tested three different RL algorithms on the RL environment (DQN, A2C and PPO), which gives a higher reward for more biodiversity. Different metrics were used to measure biodiversity. The RL agent had the possibility to remove or add population of one of the three species at each timestep.

Overview

Below is an overview of the system designed to investigate the applicability of DRL on a spatio-temporal wildlife management simulation:

Results

While all RL algorithms were able to improve with training, PPO standed out with stable performance and steady increase in performance. Below is one of the results gathered when training the algorithms on BioGym:

DRL algorithm preformance on a 2x2 grid with 10x action multiplier - Equal training time

Interestingly, the RL algorithms employed quite different policies. Some would always remove population of one species, while others both removed and added populations. In my thesis I hypothesize how the policy each algorithm learns are connected to the algorithm's design.

For details on research background and goal, related work, research process, and results - please take a look at my thesis.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.vscode		.vscode
agents		agents
bio_env_configs		bio_env_configs
logs		logs
logs_csv		logs_csv
results		results
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
bio_environment.py		bio_environment.py
bio_world.py		bio_world.py
config_parser.py		config_parser.py
main.py		main.py
plots.ipynb		plots.ipynb
pygame_renderer.py		pygame_renderer.py
sns_renderer.py		sns_renderer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BioGym 🐾

Master's thesis 🎓

Overview

Results

About

Releases

Packages

Languages

License

AnmolS99/BioGym

Folders and files

Latest commit

History

Repository files navigation

BioGym 🐾

Master's thesis 🎓

Overview

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages