Based on PARL, the DDPG model of deep reinforcement learning is reproduced, and the same level of indicators of the paper is reproduced in the classic Mujoco game.
Please see here to know more about Mujoco game.
- python3.5+
- paddlepaddle>=1.0.0
- parl
- gym
- tqdm
- mujoco-py>=1.50.1.0
# To train an agent for HalfCheetah-v2 game
python train.py
# To train for other game
# python train.py --env [ENV_NAME]