Deep-Reinforcement-Learning-using-Capsule-Network

Capsule Network

A Capsule Neural Network (CapsNet) is a machine learning system that is a type of artificial neural network (ANN) that can be used to better model hierarchical relationships. The approach is an attempt to more closely mimic biological neural organization. The idea is to add structures called capsules to a convolutional neural network (CNN), and to reuse output from several of those capsules to form more stable (with respect to various perturbations) representations for higher order capsules. The output is a vector consisting of the probability of an observation, and a pose for that observation. This vector is similar to what is done for example when doing classification with localization in CNNs. Among other benefits, capsnets address the "Picasso problem" in image recognition: images that have all the right parts but that are not in the correct spatial relationship (e.g., in a "face", the positions of the mouth and one eye are switched). For image recognition, capsnets exploit the fact that while viewpoint changes have nonlinear effects at the pixel level, they have linear effects at the part/object level. This can be compared to inverting the rendering of an object of multiple parts.

https://medium.com/ai%C2%B3-theory-practice-business/understanding-hintons-capsule-networks-part-i-intuition-b4b559d1159b
https://towardsdatascience.com/a-simple-and-intuitive-explanation-of-hintons-capsule-networks-b59792ad46b1
https://www.youtube.com/watch?v=rTawFwUvnLE --Geoffrey Hinton talk on Capsule Neural Network

Dynamic Routing

The fact that the output of a capsule is a vector makes it possible to use a powerful dynamic routing mechanism to ensure that the output of the capsule gets sent to an appropriate parent in the layer above.

https://arxiv.org/abs/1710.09829

Deep Q-Learning

Deep Q Learning is a branch of Reinforcement Learning that deals with problems using CNN as function approximator, we have implemented Deep Q-Learning for Pong game based application where we have trained the network to play againt a AI opponent. The codes for execution of the alogirthm for both Single Machine as well as distributed implementation for various model configurations has been uploaded on this repository, which is using PYGAME for creating the environment for execution.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
CNN_Deep-Q		CNN_Deep-Q
Capsule Network for Catcher Game		Capsule Network for Catcher Game
Capsule Network for Flappy Bird		Capsule Network for Flappy Bird
Capsule__Deep_Q-learning(DCQL)		Capsule__Deep_Q-learning(DCQL)
experiments_notebook_files		experiments_notebook_files
CATCHER_capsule_net.ipynb		CATCHER_capsule_net.ipynb
CapsNet_RL_1024.ipynb		CapsNet_RL_1024.ipynb
MatCap_RL_1024.ipynb		MatCap_RL_1024.ipynb
README.md		README.md
catcher_cnn.ipynb		catcher_cnn.ipynb
pong_fun.py		pong_fun.py
small_caps_net_working_2.ipynb		small_caps_net_working_2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Reinforcement-Learning-using-Capsule-Network

Capsule Network

Dynamic Routing

Deep Q-Learning

About

Releases

Packages

Languages

dolaram/Deep-Reinforcement-Learning-using-Capsule-Network

Folders and files

Latest commit

History

Repository files navigation

Deep-Reinforcement-Learning-using-Capsule-Network

Capsule Network

Dynamic Routing

Deep Q-Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages