Based on PARL, the MADDPG algorithm of deep reinforcement learning has been reproduced.
A simple multi-agent particle world based on gym. Please see here to install and know more about the environment.
Mean episode reward (every 1000 episodes) in training process (totally 25000 episodes).
simple |
simple_adversary |
simple_push |
simple_reference |
simple_speaker_listener |
simple_spread |
simple_tag |
simple_world_comm |
Display after 25000 episodes.
simple |
simple_adversary |
simple_push |
simple_reference |
simple_speaker_listener |
simple_spread |
simple_tag |
simple_world_comm |
- python3.5+
- paddlepaddle>=1.6.1
- parl
- multiagent-particle-envs
- gym==0.10.5
# To train an agent for simple_speaker_listener scenario
python train.py
# To train for other scenario, model is automatically saved every 1000 episodes
# python train.py --env [ENV_NAME]
# To show animation effects after training
# python train.py --env [ENV_NAME] --show --restore