Skip to content

Latest commit

 

History

History
11 lines (5 loc) · 609 Bytes

README.md

File metadata and controls

11 lines (5 loc) · 609 Bytes

A Multi-step Minimax Q-learning Algorithm

This repository contains the code for the new multi-step algorithm proposed for solving two-player zero-sum stochastic games. This repository can be utilized to reproduce the results provided in the paper titled "A Multi-step Minimax Q-learning algorithm for two-player zero-sum Markov games."

Requirements: nashpy package

Acknowledgments

This code is based on and adapted from Two-Player-SOR. Many thanks to the original authors for their contributions.