epsilon-greedy-exploration

Here are 12 public repositories matching this topic...

junthbasnet / Playing-Pong-with-Deep-Reinforcement-Learning

🏓Deep learning model is presented to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards in RL Pong environment.

reinforcement-learning deep-reinforcement-learning artificial-intelligence deep-q-network pong-game epsilon-greedy-exploration credit-assignment-problem

Updated Mar 28, 2020
Python

addy1997 / RL-Algorithms

Star

This repository has RL algorithms implemented using python

reinforcement-learning q-learning sarsa hacktoberfest expected-sarsa monte-carlo-methods rl-algorithms q-learning-vs-sarsa hacktoberfest2020 epsilon-greedy-exploration double-sarsa double-expected-sarsa gradient-bandits

Updated Oct 18, 2020
Jupyter Notebook

Jash-2000 / Pole-Balance-Control-Algorithms

Star

Developed various model-based and model-free Intelligent and Naive algorithms for the beam balance environment in OpenAI Gym.

deep-reinforcement-learning epsilon-greedy-exploration boltzman-policy-reward variational-pid-controller

Updated Mar 29, 2021
Jupyter Notebook

Anjali001 / Reinforcement-Learning

Star

reinforcement-learning policy-gradient reinforce greedy-algorithm td-learning sarsa-learning td-lambda exploration-exploitation epsilon-greedy-exploration ucb-algorithm

Updated May 16, 2022
Jupyter Notebook

matakshay / DeepRL-for-Delayed-Rewards

Star

Deep RL for Temporal Credit Assignment in decision processes with delayed rewards

deep-neural-networks monte-carlo deep-reinforcement-learning q-learning pytorch reinforcement-learning-algorithms sarsa markov-decision-processes multi-layer-perceptron temporal-differencing-learning node2vec state-representation-learning graph-neural-networks graph-representation-learning pytorch-geometric model-free-rl epsilon-greedy-exploration delayed-rewards episodic-rewards temporal-credit-assignment

Updated Jun 18, 2022
Jupyter Notebook

alxndrTL / RL-essais-cliniques

Star

reinforcement-learning clinical-trials multi-armed-bandit exploration-exploitation epsilon-greedy-exploration ucb-algorithm essais-cliniques

Updated Sep 6, 2022

namhainguyen2803 / Blackjack-Using-ReinforcementLearning

Star

A mini project during 3 days of Tet Holiday 2023

game reinforcement-learning blackjack-game epsilon-greedy-exploration monte-carlo-every-visit

Updated Jan 26, 2023
Python

HrayrMuradyan / A-B-Testing

Star

The simulation of Epsilon-Greedy and Thompson Sampling algorithms for Bayesian A/B Testing. The project shows how both algorithms find the optimal bandit and approximate the rewards of each bandit, given the true reward. Visualizations are done to demonstrate the learning process and convergence.

thompson-sampling epsilon-greedy ab-testing abtesting a-b-testing epsilon-greedy-exploration bayesian-ab-testing

Updated Apr 13, 2023
Jupyter Notebook

OscarHuangWind / Preference-Guided-DQN-Atari

Star

[TNNLS] PGDQN: A generalized and efficient preference-guided epsilon-greedy policy equipped DQN for Atari and Autonomous Driving

pytorch dqn atari autonomous-driving epsilon-greedy-exploration

Updated Oct 9, 2023
Python

takud1 / dqn-lunar-lander

Star

A Deep Q-Network reinforcement learning model trained to safely land a lunar lander in the Farama Gymnasium

deep-reinforcement-learning deep-q-learning epsilon-greedy-exploration

Updated Dec 24, 2023
Python

Daksh2060 / gridworld-reinforcement-learning

Star

This project implements Value Iteration and Q-Learning algorithms to solve a variety of gridworld mazes and puzzles. It provides pre-defined policies that can be customized by adjusting parameters and policy optimization through iterative reinforcement learning. It also brings exploration capabilities to the agent with Epsilon Greedy Q-Learning.

python reinforcement-learning gridworld-environment q-learning-algorithm epsilon-greedy-exploration

Updated Apr 25, 2024
Python

dd-jero / Multi-Lane-Autonomous-Driving-Based-on-Deep-Reinforcement-Learning-Considering-Obs-TrafficSig

Star

심층강화학습기반 장애물과 신호등을 고려한 다차선 자율주행 연구

python unity deep-reinforcement-learning deep-q-network prioritized-experience-replay ml-agents noisy-networks virtual-environment epsilon-greedy-exploration

Updated Jun 18, 2024
ASP.NET

Improve this page

Add a description, image, and links to the epsilon-greedy-exploration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the epsilon-greedy-exploration topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epsilon-greedy-exploration

Here are 12 public repositories matching this topic...

junthbasnet / Playing-Pong-with-Deep-Reinforcement-Learning

addy1997 / RL-Algorithms

Jash-2000 / Pole-Balance-Control-Algorithms

Anjali001 / Reinforcement-Learning

matakshay / DeepRL-for-Delayed-Rewards

alxndrTL / RL-essais-cliniques

namhainguyen2803 / Blackjack-Using-ReinforcementLearning

HrayrMuradyan / A-B-Testing

OscarHuangWind / Preference-Guided-DQN-Atari

takud1 / dqn-lunar-lander

Daksh2060 / gridworld-reinforcement-learning

dd-jero / Multi-Lane-Autonomous-Driving-Based-on-Deep-Reinforcement-Learning-Considering-Obs-TrafficSig

Improve this page

Add this topic to your repo