Skip to content

Deep Reinforcement Learning and Artificial Intelligence in Robotics

Notifications You must be signed in to change notification settings

OakLake/DeepRL-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Learning

Learning and Artificial Intelligence in Robotics

Contents:

  • Fixed Topology Neural Network Search.
  • Q-Learning.
  • Deep RL Policy Network.
  • Deep Q-Learning (DQN).
  • Deep Q-Learning (DQN) +[target network, reward clipping, frame skipping].
  • Deep Deterministic Policy Gradient (DDPG).
  • Deep Neuroevolution. [working, no results to show!]

Results:

Random Neural Network

MountainCar-v0:
  • Using random weight search for fixed topology neural network.


Q-Learning

CartPole-v1:
  • Tabular Q-Learning.


Deep Q-Learning

LunarLander-v2:
  • Deep Q Learning with frame skipping(repeat same action for 3 frames), target network updated at (epsiode%2==0) & reward clipping(-1,1). landing at epsiode 720:

    paremeters, for below: - refresh target net every 10 episodes. - skip 3 frames. - minibatch size 32. - at episode 460.


Deep RL Policy

LunarLander-v2:


Deep Deterministic Policy Gradient (DDPG).

InvertedPendulum-v2:

Pendulum-v0:

InvertedDoublePendulum-v2:

Link: https://youtu.be/fXbqDDaJDvg

Reacher-v2:

--------

Interesting Resouces:

Releases

No releases published

Packages

No packages published

Languages