huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 28.4k
Star 142k

Code
Issues 1k
Pull requests 636
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

[Community contributions] Model cards

#36979 opened Mar 25, 2025 by stevhliu

Open 13

Labels 132 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,038 Open 15,902 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

a logic error in _preprocess function of Qwen2VLImageProcessor Class bug

#37064 opened Mar 28, 2025 by InsaneGe

4 tasks

Incorrect calculation of strides leading to loss of param data upon tensor parallel use while sliced model loading bug

#37051 opened Mar 27, 2025 by kmehant

1 of 4 tasks

AutoTrain Unsloth support bug

#37050 opened Mar 27, 2025 by urroxyz

Persistent generation issues with MT5 models (base and fine-tuned) across environments

#37048 opened Mar 27, 2025 by Elpharran

Optionality of attention_mask argument in Attention classes/functions.

#37046 opened Mar 27, 2025 by Godofnothing

Latest TorchAO config breaks serialization

#37035 opened Mar 27, 2025 by airMeng

add MiniCPM-o New model

#37029 opened Mar 27, 2025 by jp1924

2 tasks done

run_mim.py script from image-pretraining example is not working bug

#37020 opened Mar 26, 2025 by jafraustro

1 of 4 tasks

SwitchTransformer: Initialization of tensor to collect expert results is incorrect for dropped tokens (from ML POV) bug

#37017 opened Mar 26, 2025 by mario-aws

2 of 4 tasks

Add NeoBERT New model

#37015 opened Mar 26, 2025 by capemox

2 tasks done

Gemma3 adding new tokens <image_soft_token> has been added accidentally bug

#37011 opened Mar 26, 2025 by Serzhanov

4 tasks

[Question] Handling of custom flex attention block masks

#37006 opened Mar 26, 2025 by ccdv-ai

GGUF model with architecture gemma3 is not supported yet. bug

#37002 opened Mar 26, 2025 by chunxingque

4 tasks

Add ArlowGPT New model

#36988 opened Mar 26, 2025 by yuchenxie4645

1 of 2 tasks

FSDP Not Working For Mamba2 bug

#36982 opened Mar 25, 2025 by zixianwang2022

2 of 4 tasks

[Community contributions] Model cards contributions-welcome Good First Documentation Issue Good First Issue

#36979 opened Mar 25, 2025 by stevhliu 100+

[Contributions Welcome] Add Fast Image Processors contributions-welcome Good First Issue Good Second Issue

Issues that are more difficult to do than "Good First" issues - give it a try if you want!

Processing Vision

#36978 opened Mar 25, 2025 by yonigozlan

18 of 69 tasks

QuestionAnswering for Gemma 3 Feature request

Request for a new feature

#36977 opened Mar 25, 2025 by DavidAdamczyk

Gemma3: Cuda error: misaligned address bug

#36961 opened Mar 25, 2025 by falkbene

2 of 4 tasks

Incorrect size mismatch skipping in _find_mismatched_keys causes model loading failures despite ignore_mismatched_sizes=True bug

#36960 opened Mar 25, 2025 by novotnj3

4 tasks

Symbolic trance with past_key_values input is not supported yet for the qwen2. bug

#36959 opened Mar 25, 2025 by My-captain

4 tasks

Started getting new warnings for gemma3 after upgrading from 4.49.0-gemma3 to 4.50.0

#36942 opened Mar 24, 2025 by HJJ256

Add param_to_hook_all_reduce parameter in HF Trainer Feature request

Request for a new feature

#36941 opened Mar 24, 2025 by awsankur

Gemma3 not supported in main branch bug

#36940 opened Mar 24, 2025 by xihuai18

2 of 4 tasks

AttributeError: 'HybridCache' object has no attribute 'float' — PaliGemma2 Evaluation Fails with BF16 bug Cache

#36938 opened Mar 24, 2025 by iremeyiokur

2 of 4 tasks

Previous 1 2 3 4 5 … 41 42 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly