Skip to content

Pull requests: axolotl-ai-cloud/axolotl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

make sure padding is labeled as -100 for pretraining
#2227 opened Dec 30, 2024 by winglian Loading…
skip over rows in pretraining dataset
#2223 opened Dec 26, 2024 by winglian Loading…
support for custom lr groups for non-embedding modules
#2213 opened Dec 22, 2024 by winglian Loading…
KD trainer w/ logprobs
#2202 opened Dec 19, 2024 by winglian Draft
refactor trainer to prevent circular dependencies later
#2200 opened Dec 18, 2024 by winglian Loading…
use 2.5.1 docker images as latest tag as it seems stable
#2198 opened Dec 18, 2024 by winglian Loading…
convert-diff-transformer CLI command / codepath
#2197 opened Dec 17, 2024 by djsaunde Draft
5 of 7 tasks
[Fixing #2149] load_from_disk for rl tpye training
#2193 opened Dec 15, 2024 by leeparkuky Loading…
2 of 3 tasks
perform flakey patched tests in individual runner hold don't merge this yet
#2185 opened Dec 13, 2024 by winglian Loading…
rebased hymba multipack support
#2178 opened Dec 11, 2024 by bursteratom Loading…
[feature] sweeps
#2171 opened Dec 10, 2024 by winglian Loading…
Multimodal integration - pixtral/llava/qwen2-vl
#2170 opened Dec 10, 2024 by bursteratom Loading…
Fix: RL base feature parity
#2133 opened Dec 6, 2024 by NanoCode012 Draft
5 tasks
refactor(optimizer): use optimizer_cls_and_kwargs for custom optim
#2012 opened Nov 4, 2024 by NanoCode012 Loading…
3 of 6 tasks
add soap optimizer support
#1978 opened Oct 17, 2024 by winglian Loading…
shampoo optim support
#1919 opened Sep 18, 2024 by winglian Loading…
multipack support for phi moe
#1870 opened Aug 26, 2024 by winglian Loading…
semi-weekly 8bit lora zero3 check hold don't merge this yet
#1852 opened Aug 22, 2024 by winglian Loading…
add q-galore optimizer
#1752 opened Jul 14, 2024 by winglian Loading…
Implements SPPO Alignment Algoritm
#1735 opened Jul 11, 2024 by kaykyr Loading…
1 of 3 tasks
ProTip! Updated in the last three days: updated:>2024-12-27.