Policy Based RL Algorithms Implemented REINFORCE Actor-Critic Advantage Actor-Critic (A2C) Asynchronous Advantage Actor-Critic (A3C) Comparison Reinforce Actor-Critic A2C