EC

Episodic control algorithms:

MFEC
NEC

Default options for MFEC:

python main.py --kernel mean

Default options for NEC:

python main.py --algorithm NEC \
               --key-size 128 \
               --num-neighbours 50 \
               --dictionary-capacity 500000 \
               --episodic-multi-step 100 \
               --epsilon-final 0.001 \
               --discount 0.99 \
               --learn-start 50000

Used in Memory-efficient episodic control reinforcement learning with dynamic online k-means and Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control, both presented at the Workshop on Biological and Artificial Reinforcement Learning, NeurIPS 2019. The source code for these papers can be found in this repo here, but is not as organised as this codebase.

Notes

Notes about this codebase based on conversations with Robert Kirk

This uses brute-force kNN search, as opposed to the approximate kNN search used in the original works. Without knowing the full details of DeepMind's k-d tree, especially how they might update the tree with new data, I chose to stick to the exact version to be safe.
As in the original works, this uses a hash of the state in order to detect exact state matches.
Key/value gradients for the DND are taken from the first instance of each updated key, but averaging would make more sense.
The gradients for the keys and values in the DND do not get applied properly (effectively, the gradients are zero) - this is a bug! Despite the bug, this codebase can still reproduce the results from the NEC paper (hence those updates may not be important).

Acknowledgements

Alex Pritzel for implementation details
@RobertKirk for debugging

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
BioArtRLCode.zip		BioArtRLCode.zip
LICENSE.md		LICENSE.md
README.md		README.md
agents.py		agents.py
envs.py		envs.py
main.py		main.py
memory.py		memory.py
models.py		models.py
optimisers.py		optimisers.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EC

Notes

Acknowledgements

About

Releases

Packages

Languages

License

Kaixhin/EC

Folders and files

Latest commit

History

Repository files navigation

EC

Notes

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages