-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Tensors' device passed to a model is not correct when ACCELERATE_TORCH_DEVICE is privateuseone
#31750
opened Jul 2, 2024 by
kiszk
2 of 4 tasks
Fix 'Can't infer missing attention mask on Request for a new feature
mps
device'
Feature request
#31744
opened Jul 2, 2024 by
BlueBlazin
Encountering an error while loading a model using state_dict and quantization simultaneously
#31743
opened Jul 2, 2024 by
paprika0741
2 of 4 tasks
[i18n-<languageCode>] Translating docs to <languageName>
WIP
Label your PR/Issue with WIP for some long outstanding Issues/PRs that are work in progress
#31739
opened Jul 1, 2024 by
Foxyford
10 tasks
Pass Request for a new feature
HFQuantizer
to from_pretrained
kwargs
Feature request
#31738
opened Jul 1, 2024 by
liamd101
Inconsistent special_token addition in EncoderDecoderModel forward pass
#31729
opened Jul 1, 2024 by
emergenz
1 of 4 tasks
cannot import get_full_repo_name from huggingface_hub after updating pytorch
#31728
opened Jul 1, 2024 by
junchen14
4 tasks
how to generate router_logits in moe models using model.generate()?
Generation
#31722
opened Jul 1, 2024 by
Jimmy-Lu
1 of 4 tasks
Model loading OOM when using FSDP + QLoRA
Accelerate
PEFT
PyTorch FSDP
Quantization
#31721
opened Jul 1, 2024 by
Neo9061
2 of 4 tasks
how to remove kv cache?
Feature request
Request for a new feature
Generation
#31717
opened Jun 30, 2024 by
tsw123678
Error when using AutoTokenizer to load local files without network
#31712
opened Jun 29, 2024 by
pppppkun
2 of 4 tasks
Add Request for a new feature
bot_token
attribute to PreTrainedTokenizer
and PreTrainedTokenizerFast
Feature request
#31709
opened Jun 29, 2024 by
aw632
When I used galore, the learning rate was set to 8e-6, but the training rate was 0.001
#31707
opened Jun 29, 2024 by
Minami-su
meta-llama/Llama-2-7b-chat-hf tokenizer
model_max_length
attribute needs to be fixed.
#31705
opened Jun 28, 2024 by
rohitdwivedula
4 tasks
Unable to load models with adapter weights in offline mode
#31700
opened Jun 28, 2024 by
amyeroberts
1 of 4 tasks
Any config for DeBERTa series as decoders for TSDAE?
Feature request
Request for a new feature
#31688
opened Jun 28, 2024 by
bobox2997
NameError: free variable 'state_dict' referenced before assignment in enclosing scope
Accelerate
#31685
opened Jun 28, 2024 by
AllentDan
1 of 4 tasks
Whisper - list index out of range with word level timestamps
Audio
#31683
opened Jun 28, 2024 by
maxkvbn
2 of 4 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.