Beauty sleep
LM with a revisor block learning to predict the next embedding weights.
Attempting to learn what it will learn with the effect of improving representations.
Turns out this is more of a framework for now for training :)
Lots of it was borrowed from
python -m pip install -r requirements.txt
python3 train --config_path configs/small.yaml