-
-
Notifications
You must be signed in to change notification settings - Fork 861
Issues: axolotl-ai-cloud/axolotl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
qwen_25
chat template not working on main
bug
#1998
opened Oct 27, 2024 by
fblgit
6 of 8 tasks
RuntimeError: CUDA error: an illegal memory access was encountered [When Running SFT on Qwen2.5]
bug
Something isn't working
#1991
opened Oct 23, 2024 by
Malikeh97
6 of 8 tasks
Deepspeed zero3 training is loading models to GPUs (on init) instead of RAM
bug
Something isn't working
#1983
opened Oct 19, 2024 by
RameshArvind
6 of 8 tasks
Support for Sequence / Context Parallelism
enhancement
New feature or request
#1972
opened Oct 15, 2024 by
dwzhu-pku
5 tasks done
Flash attention and multipack failing for qwen and mistral
bug
Something isn't working
#1966
opened Oct 12, 2024 by
tiger241
6 of 8 tasks
Add resize_token_embeddings feature
enhancement
New feature or request
#1965
opened Oct 11, 2024 by
ccdv-ai
5 tasks done
Should New feature or request
tokenizer_legacy
be default as false
?
enhancement
#1955
opened Oct 10, 2024 by
tongyx361
5 tasks done
Llama will not save properly
bug
Something isn't working
#1947
opened Oct 6, 2024 by
mfirth-truffle
6 of 8 tasks
Feature Request: Adding dataset deduplication process
enhancement
New feature or request
#1946
opened Oct 5, 2024 by
Weyaxi
5 tasks done
fix_untrained_tokens doesn't work with zero-3
bug
Something isn't working
#1944
opened Oct 4, 2024 by
winglian
6 of 8 tasks
ML Flow Checkpointing
bug
Something isn't working
#1938
opened Oct 2, 2024 by
wanderingweights
6 of 8 tasks
Cannot install on Google Colab
bug
Something isn't working
#1933
opened Sep 27, 2024 by
benjamin-marie
5 of 8 tasks
Using two 8xH100 nodes to train. encounter error bf16 requested, but AMP is not supported on this GPU. Requires Ampere series or above.
bug
Something isn't working
#1924
opened Sep 23, 2024 by
michaellin99999
6 of 8 tasks
mistrall small support
enhancement
New feature or request
#1922
opened Sep 21, 2024 by
win4r
5 tasks done
Gemma 2 chat template inserts eos_token after every chat turn
bug
Something isn't working
#1921
opened Sep 20, 2024 by
Nero10578
6 of 8 tasks
Different training losses when flash_attention is on/off
bug
Something isn't working
#1918
opened Sep 18, 2024 by
zhangchen-xu
6 of 8 tasks
Add Support for Loading a Specific Dataset Revision
enhancement
New feature or request
#1911
opened Sep 12, 2024 by
thomascleberg
5 tasks done
Running Example on Free T4 GPU through Google Colab
bug
Something isn't working
#1905
opened Sep 8, 2024 by
hammad93
6 of 8 tasks
pretrain doesn't work on json\jsonl
bug
Something isn't working
#1895
opened Sep 5, 2024 by
SicariusSicariiStuff
6 of 8 tasks
Llama 3.1 liger example is not working
bug
Something isn't working
#1892
opened Sep 4, 2024 by
Stealthwriter
6 of 8 tasks
Preprocess --debug does not show newline \n token if previous string is ">" but shows if I add any other letter in the role fastchat
bug
Something isn't working
#1890
opened Sep 3, 2024 by
Nero10578
7 of 8 tasks
Training with a large json dataset (>650K) throw error:pyarrow.lib.ArrowInvalid: offset overflow while concatenating arrays
bug
Something isn't working
#1888
opened Sep 3, 2024 by
bofei5675
6 of 8 tasks
Load existing LORA and continue training it
enhancement
New feature or request
#1887
opened Sep 1, 2024 by
Nero10578
5 tasks done
MixLoRA finetuning
enhancement
New feature or request
#1880
opened Aug 28, 2024 by
winglian
5 tasks done
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.