Skip to content

Pull requests: allenai/OLMo

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Figure for plotting Pareto frontier (Flops x Perf)
#753 opened Nov 27, 2024 by kyleclo Loading…
Update README.md
#751 opened Nov 26, 2024 by revbucket Loading…
Adds support for converting from safetensors
#740 opened Oct 23, 2024 by soldni Loading…
Create an eval-only script for existing ckpts
#736 opened Oct 20, 2024 by liujch1998 Loading…
Add regression tests for training
#730 opened Oct 7, 2024 by 2015aroras Loading…
Docs model ladder
#708 opened Aug 19, 2024 by IanMagnusson Draft
Add OLMoE checkpoints and run config
#707 opened Aug 19, 2024 by 2015aroras Loading…
DNM: Loss issue checkpoint with refine1b setups
#682 opened Jul 31, 2024 by undfined Loading…
[wip] Kylel/readme
#681 opened Jul 31, 2024 by kyleclo Draft
Ladder 1xC
#677 opened Jul 27, 2024 by AkshitaB Loading…
Alternative evals
#675 opened Jul 23, 2024 by AkshitaB Loading…
1 task done
MoE
#639 opened Jun 30, 2024 by Muennighoff Loading…
muP implementation
#637 opened Jun 28, 2024 by AkshitaB Loading…
Unit tests
#635 opened Jun 26, 2024 by AkshitaB Loading…
Config for Amberish experiments at 1B
#621 opened Jun 12, 2024 by drschwenk Loading…
Normal baselines
#618 opened Jun 12, 2024 by AkshitaB Loading…
added git ref to the config keys
#617 opened Jun 11, 2024 by drschwenk Loading…
Optionally load trainer state
#573 opened May 13, 2024 by Muennighoff Loading…
Reverse weight decay
#567 opened May 3, 2024 by AkshitaB Loading…
1 task done
Add reorder cache for beam search
#526 opened Mar 26, 2024 by cshaib Loading…
Add scripts for Dave
#516 opened Mar 21, 2024 by epwalsh Draft
Scripts for QKV experiments
#510 opened Mar 20, 2024 by AkshitaB Loading…
hf_olmo: support flash attn 2
#471 opened Feb 29, 2024 by wade3han Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.