Multi-Armed Bandits for Optimizing New Peers in Peer-to-Peer Networks

Idea

Consider the setting of a peer-to-peer network wherein a new peer joins with the intent to be brought "up to speed" with the rest of the network as soon as possible. However, the new peer does not know the network speeds of its seeds, just how much data it receives over time when it chooses a peer and receives data from them for one time step. The reward is how many bytes received in that time slot.

We want to be careful about defining the reward, because we want the agent to choose the peer that is transmitting the fastest. However, consider that network speeds may change, and the optimal seed to leech from will not always be the best.

Various algorithms will be considered, starting with epsilon-greedy and UCB (upper confidence bound). More here.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
docs		docs
report		report
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Armed Bandits for Optimizing New Peers in Peer-to-Peer Networks

Idea

About

Contributors 2

Languages

oscarsandford/network-bandit

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed Bandits for Optimizing New Peers in Peer-to-Peer Networks

Idea

About

Resources

Stars

Watchers

Forks

Contributors 2

Languages