Skip to content

Latest commit

 

History

History
32 lines (21 loc) · 910 Bytes

README.md

File metadata and controls

32 lines (21 loc) · 910 Bytes

Open In Colab

Proximal Policy Optimization + Transformer

This project combines the Proximal Policy Optimization (PPO) algorithm with the Transformer architecture to solve reinforcement learning tasks. The goal is to leverage the power of the Transformer's attention mechanism and the stability of PPO to train agents capable of achieving high-performance results on various environments.

Graph

Forward

Backward

Installation

  1. Clone the repository:
git clone https://github.com/datvodinh10/ppo-transformer.git
  1. Install requirement:
pip install -r requirements.txt

Run

Open main.ipynb -> Run All