This project combines the Proximal Policy Optimization (PPO) algorithm with the Transformer architecture to solve reinforcement learning tasks. The goal is to leverage the power of the Transformer's attention mechanism and the stability of PPO to train agents capable of achieving high-performance results on various environments.
- Clone the repository:
git clone https://github.com/datvodinh10/ppo-transformer.git
- Install requirement:
pip install -r requirements.txt
Open main.ipynb -> Run All