Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V4.40 release IFU #36

Open
wants to merge 821 commits into
base: main_old
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
821 commits
Select commit Hold shift + click to select a range
a1a7454
fix galore layerwise with frozen params (#29743)
peterjc123 Mar 20, 2024
776c9d3
[Tests] Remove unused code (#29737)
NielsRogge Mar 20, 2024
76b3b20
Update Mamba types and pass through use_cache attr to MambaModel (#29…
koayon Mar 20, 2024
870bbb4
fix jinja2 package version check (#29754)
Ricardo-L-C Mar 20, 2024
11ef35e
Support sharded safetensors in TF (#29350)
Rocketknight1 Mar 20, 2024
1248f09
v4.40.0.dev.0
ArthurZucker Mar 20, 2024
3c17c52
SuperPointModel -> SuperPointForKeypointDetection (#29757)
amyeroberts Mar 20, 2024
9d99948
Add correct batched handling for apply_chat_template (#29222)
Rocketknight1 Mar 20, 2024
d91fd7f
Add LLaVa-1.6, bis (#29586)
NielsRogge Mar 20, 2024
c78f577
Update test reqs to include sentencepiece (#29756)
muellerzr Mar 20, 2024
17e4467
Fix docker image build (#29762)
ydshieh Mar 20, 2024
8dd4ce6
[`BitsAndBytesConfig`] Warning for unused `kwargs` & safety checkers …
benjaminye Mar 20, 2024
ff84190
[`BC 4.37 -> 4.38`] for Llama family, memory and speed (#29753)
ArthurZucker Mar 20, 2024
5d1a58a
[docs] Remove redundant `-` and `the` from custom_tools.md (#29767)
windsonsea Mar 21, 2024
0639034
Fixed typo in quantization_config.py (#29766)
kurokiasahi222 Mar 21, 2024
9556054
OWL-ViT box_predictor inefficiency issue (#29712)
RVV-karma Mar 21, 2024
691c3d7
Allow `-OO` mode for `docstring_decorator` (#29689)
matthid Mar 21, 2024
fd734be
fix issue with logit processor during beam search in Flax (#29636)
giganttheo Mar 21, 2024
2ddceef
Fix docker image build for `Latest PyTorch + TensorFlow [dev]` (#29764)
ydshieh Mar 21, 2024
73a73b4
[`LlavaNext`] Fix llava next unsafe imports (#29773)
ArthurZucker Mar 21, 2024
de627f5
Cast bfloat16 to float32 for Numpy conversions (#29755)
Rocketknight1 Mar 21, 2024
f0bfb15
Silence deprecations and use the DataLoaderConfig (#29779)
muellerzr Mar 21, 2024
10d232e
Add deterministic config to `set_seed` (#29778)
muellerzr Mar 21, 2024
ef6e371
Add support for `torch_dtype` in the run_mlm example (#29776)
jla524 Mar 21, 2024
5ffef2a
Generate: remove legacy generation mixin imports (#29782)
gante Mar 21, 2024
ee38fc3
Llama: always convert the causal mask in the SDPA code path (#29663)
gante Mar 21, 2024
b469ebc
Prepend `bos token` to Blip generations (#29642)
zucchini-nlp Mar 21, 2024
fadb053
Change in-place operations to out-of-place in LogitsProcessors (#29680)
zucchini-nlp Mar 21, 2024
e68ff30
[`quality`] update quality check to make sure we check imports 😈 (#2…
ArthurZucker Mar 22, 2024
3479161
Fix type hint for train_dataset param of Trainer.__init__() to allow …
stevemadere Mar 22, 2024
aa17cf9
Enable AMD docker build CI (#29803)
IlyasMoutawwakil Mar 22, 2024
13b2370
Correct llava mask & fix missing setter for `vocab_size` (#29389)
fxmarty Mar 22, 2024
e85654f
rm input dtype change in CPU (#28631)
jiqing-feng Mar 22, 2024
34e07f4
Generate: remove unused attributes in `AssistedCandidateGenerator` (#…
gante Mar 22, 2024
884b221
replaced concatenation to f-strings to improve readability and unify …
igeni Mar 22, 2024
2e7cb46
[`cleanup`] vestiges of causal mask (#29806)
ArthurZucker Mar 22, 2024
7e1413d
Complete security policy with mentions of remote code (#29707)
LysandreJik Mar 22, 2024
c5f0288
[`SuperPoint`] Fix doc example (#29816)
amyeroberts Mar 22, 2024
dafe370
[DOCS] Fix typo for llava next docs (#29829)
aliencaocao Mar 23, 2024
76a33a1
model_summary.md - Restore link to Harvard's Annotated Transformer. (…
gamepad-coder Mar 24, 2024
39114c0
Remove static pretrained maps from the library's internals (#29112)
LysandreJik Mar 25, 2024
afe73ae
Fix the behavior of collecting 'num_input_tokens_seen' (#29099)
youliangh Mar 25, 2024
8e9a220
Populate torch_dtype from model to pipeline (#28940)
B-Step62 Mar 25, 2024
00a09ed
fix 😭
ArthurZucker Mar 25, 2024
e3e16dd
[`revert commit`] revert 00a09ed448082da3d6d35fb23a37b7d04f7b4dcd
ArthurZucker Mar 25, 2024
7eb3ba8
remove quotes in code example (#29812)
johko Mar 25, 2024
b5a6d6e
Add warnings if training args differ from checkpoint trainer state (#…
jonflynng Mar 26, 2024
b32bf85
Replace 'decord' with 'av' in VideoClassificationPipeline (#29747)
Tyx-main Mar 26, 2024
de81a67
Fix header in IFE task guide (#29859)
merveenoyan Mar 26, 2024
b9ceb03
[docs] Indent ordered list in add_new_model.md (#29796)
windsonsea Mar 26, 2024
998b5bb
Allow `bos_token_id is None` during the generation with `inputs_embed…
LZHgrla Mar 26, 2024
ef60995
Add `cosine_with_min_lr` scheduler in Trainer (#29341)
liuyanyi Mar 26, 2024
07d7952
Disable AMD memory benchmarks (#29871)
IlyasMoutawwakil Mar 26, 2024
f01e160
Set custom_container in build docs workflows (#29855)
Wauplin Mar 26, 2024
8e08aca
Support `num_attention_heads` != `num_key_value_heads` in Flax Llama …
bminixhofer Mar 27, 2024
1c39974
Add Qwen2MoE (#29377)
bozheng-hit Mar 27, 2024
cefb819
Mamba `slow_forward` gradient fix (#29563)
vasqu Mar 27, 2024
a81cf9e
Fix 29807, sinusoidal positional encodings overwritten by post_init()…
hovnatan Mar 27, 2024
4d8427f
Reimplement "Automatic safetensors conversion when lacking these file…
LysandreJik Mar 27, 2024
31c575b
fix fuyu device_map compatibility (#29880)
SunMarc Mar 27, 2024
0efcf32
Move `eos_token_id` to stopping criteria (#29459)
zucchini-nlp Mar 27, 2024
7576974
add Cambricon MLUs support (#29627)
huismiling Mar 27, 2024
a25037b
MixtralSparseMoeBlock: add gate jitter (#29865)
lorenzoverardo Mar 27, 2024
d9dc993
Fix typo in T5Block error message (#29881)
Mingosnake Mar 28, 2024
b256516
[`make fix-copies`] update and help (#29924)
ArthurZucker Mar 28, 2024
543889f
[`GptNeox`] don't gather on pkv when using the trainer (#29892)
ArthurZucker Mar 28, 2024
3a7e683
[`pipeline`]. Zero shot add doc warning (#29845)
ArthurZucker Mar 28, 2024
22d159d
Adding Flash Attention 2 Support for GPT2 (#29226)
EduardoPach Mar 28, 2024
7c19faf
[doc] fix some typos and add `xpu` to the testing documentation (#29894)
faaany Mar 28, 2024
248d5d2
Tests: replace `torch.testing.assert_allclose` by `torch.testing.asse…
gante Mar 28, 2024
c9d2e85
Add beam search visualizer to the doc (#29876)
aymeric-roucher Mar 28, 2024
855b95c
Safe import of LRScheduler (#29919)
amyeroberts Mar 28, 2024
aac7099
add functions to inspect model and optimizer status to trainer.py (#2…
CKeibel Mar 28, 2024
441de62
RoPE models: add numerical sanity-check test for RoPE scaling (#29808)
gante Mar 28, 2024
e677479
[`Mamba`] from pretrained issue with `self.embeddings` (#29851)
ArthurZucker Mar 28, 2024
a2a7f71
[ `TokenizationLlama`] fix the way we convert tokens to strings to ke…
ArthurZucker Mar 28, 2024
4df5b9b
Allow GradientAccumulationPlugin to be configured from AcceleratorCon…
fabianlim Mar 28, 2024
2bbbf1b
[`BC`] Fix BC for other libraries (#29934)
ArthurZucker Mar 28, 2024
e203646
Fix doc issue #29758 in DebertaV2Config class (#29842)
vinayakkgarg Mar 28, 2024
536ea2a
[`LlamaSlowConverter`] Slow to Fast better support (#29797)
ArthurZucker Mar 28, 2024
ba56ed0
Update installs in image classification doc (#29947)
MariaHei Mar 28, 2024
43d17c1
Mark `test_eager_matches_sdpa_generate` flaky for some models (#29479)
ydshieh Mar 29, 2024
5ad7f17
Super tiny fix 12 typos about "with with" (#29926)
fzyzcjy Mar 29, 2024
6fd93fe
Fix rope theta for OpenLlama (#29893)
jla524 Mar 30, 2024
156d30d
Add warning message for `run_qa.py` (#29867)
jla524 Mar 30, 2024
e644b60
fix: get mlflow version from mlflow-skinny (#29918)
Mar 30, 2024
f6701bc
Reset alarm signal when the function is ended (#29706)
coldnight Mar 30, 2024
46d6368
Update model card and link of blog post. (#29928)
bozheng-hit Mar 30, 2024
6e58407
[`BC`] Fix BC for AWQ quant (#29965)
TechxGenus Mar 30, 2024
3b8e293
Rework tests to compare trainer checkpoint args (#29883)
muellerzr Mar 31, 2024
569f6c7
Fix FA2 tests (#29909)
ylacombe Apr 1, 2024
fa2c49b
Fix copies main ci (#29979)
ArthurZucker Apr 1, 2024
e4f5b57
[tests] fix the wrong output in `ImageToTextPipelineTests.test_condit…
faaany Apr 1, 2024
c9f6e5e
Generate: move misplaced test (#29902)
gante Apr 1, 2024
096f304
[docs] Big model loading (#29920)
stevhliu Apr 2, 2024
83b26dd
[`generate`] fix breaking change for patch (#29976)
ArthurZucker Apr 2, 2024
416711c
Fix 29807 sinusoidal positional encodings in Flaubert, Informer and X…
hovnatan Apr 2, 2024
33288ff
[bnb] Fix bug in `_replace_with_bnb_linear` (#29958)
SunMarc Apr 2, 2024
fed27ff
Adding FlaxNoRepeatNGramLogitsProcessor (#29677)
giganttheo Apr 2, 2024
0d04b1e
Add Flash Attention 2 support to Musicgen and Musicgen Melody (#29939)
ylacombe Apr 2, 2024
cb5927c
[Docs] Make an ordered list prettier in add_tensorflow_model.md (#29949)
windsonsea Apr 2, 2024
15cd687
Fix `skip_special_tokens` for `Wav2Vec2CTCTokenizer._decode` (#29311)
msublee Apr 2, 2024
9b0a8ea
Hard error when ignoring tensors. (#27484) (#29906)
Narsil Apr 2, 2024
5080ab1
Generate: fix logits processors doctests (#29718)
gante Apr 2, 2024
fce52ce
Fix `remove_columns` in `text-classification` example (#29351)
mariosasko Apr 2, 2024
b44df05
Update `tests/utils/tiny_model_summary.json` (#29941)
ydshieh Apr 3, 2024
81642d2
Make EncodecModel.decode ONNX exportable (#29913)
fxmarty Apr 3, 2024
17b06e2
Fix Swinv2ForImageClassification NaN output (#29981)
miguelm-almeida Apr 3, 2024
851f253
Fix Qwen2Tokenizer (#29929)
jklj077 Apr 3, 2024
bcd42c4
Fix `kwargs` handling in `generate_with_fallback` (#29225)
cifkao Apr 3, 2024
240e106
Fix probability computation in `WhisperNoSpeechDetection` when recomp…
cifkao Apr 3, 2024
cc75f1a
Fix vipllava for generation (#29874)
zucchini-nlp Apr 3, 2024
34bfe95
[docs] Fix audio file (#30006)
stevhliu Apr 3, 2024
c10b5dd
Superpoint imports fix (#29898)
zucchini-nlp Apr 3, 2024
695d823
[`Main CIs`] Fix the red cis (#30022)
ArthurZucker Apr 3, 2024
863e256
Make clearer about zero_init requirements (#29879)
muellerzr Apr 3, 2024
03732de
Enable multi-device for efficientnet (#29989)
jla524 Apr 3, 2024
4e6c5eb
Add a converter from mamba_ssm -> huggingface mamba (#29705)
byi8220 Apr 4, 2024
75b76a5
[`ProcessingIdefics`] Attention mask bug with padding (#29449)
byi8220 Apr 4, 2024
517a3e6
Refactor Cohere Model (#30027)
saurabhdash2512 Apr 4, 2024
24d787c
Add `whisper` to `IMPORTANT_MODELS` (#30046)
ydshieh Apr 5, 2024
8b52fa6
skip `test_encode_decode_fast_slow_all_tokens` for now (#30044)
ydshieh Apr 5, 2024
79d62b2
if output is tuple like facebook/hf-seamless-m4t-medium, waveform is …
sywangyi Apr 5, 2024
d704c0b
Fix mixtral ONNX Exporter Issue. (#29858)
AdamLouly Apr 5, 2024
1ab7136
[Trainer] Allow passing image processor (#29896)
NielsRogge Apr 5, 2024
4207a40
[bnb] Fix offload test (#30039)
SunMarc Apr 5, 2024
de11d0b
Update quantizer_bnb_4bit.py: In the ValueError string there should b…
miRx923 Apr 5, 2024
4879531
[test fetcher] Always include the directly related test files (#30050)
ydshieh Apr 5, 2024
17cd7a9
Fix `torch.fx` symbolic tracing for LLama (#30047)
michaelbenayoun Apr 5, 2024
b17b54d
Refactor daily CI workflow (#30012)
ydshieh Apr 5, 2024
d9fa13c
Add docstrings and types for MambaCache (#30023)
koayon Apr 5, 2024
9b5a645
Fix auto tests (#30067)
ydshieh Apr 5, 2024
76fa17c
Fix whisper kwargs and generation config (#30018)
zucchini-nlp Apr 5, 2024
1fc34aa
doc: Correct spelling mistake (#30107)
caiyili Apr 8, 2024
1ed93be
[Whisper] Computing features on GPU in batch mode for whisper feature…
vaibhavagg303 Apr 8, 2024
a907a90
Change log level to warning for num_train_epochs override (#30014)
xu-song Apr 8, 2024
836e88c
Make MLFlow version detection more robust and handles mlflow-skinny (…
helloworld1 Apr 8, 2024
5e673ed
updated examples/pytorch/language-modeling scripts and requirements.t…
Patchwork53 Apr 8, 2024
d16f0ab
[tests] add `require_bitsandbytes` marker (#30116)
faaany Apr 8, 2024
f565873
fixing issue 30034 - adding data format for run_ner.py (#30088)
JINO-ROHIT Apr 8, 2024
7f9aff9
Patch fix - don't use safetensors for TF models (#30118)
amyeroberts Apr 8, 2024
0201f64
[#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0…
UtkarshaGupte Apr 8, 2024
08c8443
Accept token in trainer.push_to_hub() (#30093)
mapmeld Apr 8, 2024
1773afc
fix learning rate display in trainer when using galore optimizer (#30…
vasqu Apr 8, 2024
1897874
Fix falcon with SDPA, alibi but no passed mask (#30123)
fxmarty Apr 8, 2024
a71def0
Trainer / Core : Do not change init signature order (#30126)
younesbelkada Apr 8, 2024
ef38e2a
Make vitdet jit trace complient (#30065)
fxmarty Apr 8, 2024
7afade2
Fix typo at ImportError (#30090)
DrAnaximandre Apr 8, 2024
8c00b53
Adding `mps` as device for `Pipeline` class (#30080)
fnhirwa Apr 8, 2024
2f12e40
[`StableLm`] Add QK normalization and Parallel Residual Support (#29745)
jon-tow Apr 8, 2024
4e3490f
Fix failing DeepSpeed model zoo tests (#30112)
pacman100 Apr 9, 2024
af4c026
Add datasets.Dataset to Trainer's train_dataset and eval_dataset type…
ringohoffman Apr 9, 2024
841e87e
Fix docs Pop2Piano (#30140)
zucchini-nlp Apr 9, 2024
ec59a42
Revert workaround for TF safetensors loading (#30128)
Rocketknight1 Apr 9, 2024
ba1b24e
[Trainer] Fix default data collator (#30142)
NielsRogge Apr 9, 2024
e9c23fa
[Trainer] Undo #29896 (#30129)
NielsRogge Apr 9, 2024
08a194f
Fix slow tests for important models to be compatible with A10 runners…
ydshieh Apr 9, 2024
6487e9b
Send headers when converting safetensors (#30144)
ydshieh Apr 9, 2024
58a939c
Fix quantization tests (#29914)
SunMarc Apr 9, 2024
21e23ff
[docs] Fix image segmentation guide (#30132)
stevhliu Apr 9, 2024
6cdbd73
[CI] Fix setup (#30147)
SunMarc Apr 9, 2024
4157976
Fix length related warnings in speculative decoding (#29585)
zucchini-nlp Apr 10, 2024
56d001b
Fix and simplify semantic-segmentation example (#30145)
qubvel Apr 10, 2024
bb76f81
[CI] Quantization workflow fix (#30158)
SunMarc Apr 10, 2024
1854637
[tests] make 2 tests device-agnostic (#30008)
faaany Apr 10, 2024
b7d002b
Add str to TrainingArguments report_to type hint (#30078)
ringohoffman Apr 10, 2024
50c1c19
[UDOP] Fix tests (#29573)
NielsRogge Apr 10, 2024
505854f
[UDOP] Improve docs, add resources (#29571)
NielsRogge Apr 10, 2024
0f94e3e
Fix accelerate kwargs for versions <0.28.0 (#30086)
vasqu Apr 10, 2024
33bca54
Fix typing annotation in hf_argparser (#30156)
xu-song Apr 10, 2024
0fe4405
Add recurrent gemma (#30143)
ArthurZucker Apr 10, 2024
3280b13
Fixing a bug when MlFlow try to log a torch.tensor (#29932)
etiennebonnafoux Apr 10, 2024
4f7a9f9
Fix natten install in docker (#30161)
ydshieh Apr 10, 2024
f569172
FIX / bnb: fix torch compatiblity issue with `itemize` (#30162)
younesbelkada Apr 10, 2024
d71f5b3
Update config class check in auto factory (#29854)
Rocketknight1 Apr 10, 2024
a5e5c92
Fixed typo in comments/documentation for Pipelines documentation (#30…
DamonGuzman Apr 10, 2024
b752ad3
Adding grounding dino (#26087)
EduardoPach Apr 11, 2024
fbdb978
Fix Llava chat template examples (#30130)
lewtun Apr 11, 2024
e50be9a
Guard XLA version imports (#30167)
muellerzr Apr 11, 2024
58b170c
chore: remove repetitive words (#30174)
hugehope Apr 11, 2024
e516d1b
fix: Fixed `ruff` configuration to avoid deprecated configuration war…
Sai-Suraj-27 Apr 11, 2024
386ef34
[Processor classes] Update docs (#29698)
NielsRogge Apr 11, 2024
5569552
Update output of SuperPointForKeypointDetection (#29809)
NielsRogge Apr 11, 2024
edf0935
Falcon: make activation, ffn_hidden_size configurable (#30134)
sshleifer Apr 11, 2024
0bd58f1
Docs PR template (#30171)
stevhliu Apr 11, 2024
2c66600
ENH: [`CI`] Add new workflow to run slow tests of important models on…
younesbelkada Apr 12, 2024
c82b38a
Fix pipeline logger.warning_once bug (#30195)
amyeroberts Apr 12, 2024
caa5c65
fix: Replaced deprecated `logger.warn` with `logger.warning` (#30197)
Sai-Suraj-27 Apr 12, 2024
ac33aee
fix typo (#30220)
mdeff Apr 12, 2024
65657d5
fix fuyu doctest (#30215)
molbap Apr 12, 2024
bf9a7ab
Fix `RecurrentGemmaIntegrationTest.test_2b_sample` (#30222)
ydshieh Apr 12, 2024
4f7b434
Update modeling_bark.py (#30221)
bes-dev Apr 12, 2024
db7d155
Fix/Update for doctest (#30216)
ydshieh Apr 12, 2024
b109257
Fixed config.json download to go to user-supplied cache directory (#3…
ulatekh Apr 12, 2024
8fd2de9
Add test for parse_json_file and change typing to os.PathLike (#30183)
xu-song Apr 15, 2024
06b1192
fix: Replace deprecated `assertEquals` with `assertEqual` (#30241)
Sai-Suraj-27 Apr 15, 2024
f010786
Set pad_token in run_glue_no_trainer.py #28534 (#30234)
JINO-ROHIT Apr 15, 2024
b3595cf
fix: Replaced deprecated `typing.Text` with `str` (#30230)
Sai-Suraj-27 Apr 15, 2024
b6b6daf
Refactor doctest (#30210)
ydshieh Apr 15, 2024
fc8eda3
fix: Fixed `type annotation` for compatability with python 3.8 (#30243)
Sai-Suraj-27 Apr 15, 2024
ec344b5
Separate out kwargs in processor (#30193)
amyeroberts Apr 15, 2024
fe2d20d
Fix doctest more (for `docs/source/en`) (#30247)
ydshieh Apr 15, 2024
7668101
round epoch only in console (#30237)
xdedss Apr 15, 2024
440bd3c
update github actions packages' version to suppress warnings (#30249)
ydshieh Apr 15, 2024
667939a
[tests] add the missing `require_torch_multi_gpu` flag (#30250)
faaany Apr 15, 2024
6b78360
Add Idefics2 (#30253)
amyeroberts Apr 15, 2024
8127f39
[Docs] Update recurrent_gemma.md for some minor nits (#30238)
sayakpaul Apr 15, 2024
5be2130
Remove incorrect arg in codellama doctest (#30257)
Rocketknight1 Apr 15, 2024
51bcadc
Update `ko/_toctree.yml` (#30062)
jungnerd Apr 15, 2024
cbc2cc1
More fixes for doctest (#30265)
ydshieh Apr 16, 2024
ddf5f25
FIX: Fix corner-case issue with the important models workflow (#30212)
younesbelkada Apr 16, 2024
b86d0f4
FIX: Fix 8-bit serialization tests (#30051)
younesbelkada Apr 16, 2024
487505f
Allow for str versions of dicts based on typing (#30227)
muellerzr Apr 16, 2024
60dea59
Workflow: Update tailscale to release version (#30268)
younesbelkada Apr 16, 2024
0eaef0c
add `push_to_hub` to pipeline (#29172)
not-lain Apr 16, 2024
e27d930
Raise relevent err when wrong type is passed in as the accelerator_co…
muellerzr Apr 16, 2024
c63f158
BLIP - fix pt-tf equivalence test (#30258)
amyeroberts Apr 16, 2024
37b5946
fix: Fixed a `raise` statement (#30275)
Sai-Suraj-27 Apr 16, 2024
5fabebd
Fix test fetcher (doctest) + `Idefics2`'s doc example (#30274)
ydshieh Apr 16, 2024
40eb6d6
Fix SDPA sliding window compatibility (#30127)
fxmarty Apr 17, 2024
4114524
Fix SpeechT5 forward docstrings (#30287)
ylacombe Apr 17, 2024
080b700
FIX / AWQ: Fix failing exllama test (#30288)
younesbelkada Apr 17, 2024
98717cb
Configuring Translation Pipelines documents update #27753 (#29986)
UtkarshaGupte Apr 17, 2024
304c6a1
Enable fx tracing for Mistral (#30209)
zucchini-nlp Apr 17, 2024
05dab4e
Fix test `ExamplesTests::test_run_translation` (#30281)
ydshieh Apr 17, 2024
eb75516
Fix `Fatal Python error: Bus error` in `ZeroShotAudioClassificationPi…
ydshieh Apr 17, 2024
812a5de
FIX: Fix push important models CI (#30291)
younesbelkada Apr 17, 2024
8d6b509
Add token type ids to CodeGenTokenizer (#29265)
st81 Apr 17, 2024
c15aad0
Add strategy to store results in evaluation loop (#30267)
qubvel Apr 17, 2024
8e5f76f
Upgrading to tokenizers 0.19.0 (#30289)
Narsil Apr 17, 2024
e4ea19b
Add OLMo model family (#29890)
2015aroras Apr 17, 2024
05bdef1
Re-enable SDPA's FA2 path (#30070)
fxmarty Apr 17, 2024
ec92f98
Fix quality Olmo + SDPA (#30302)
fxmarty Apr 17, 2024
b65df51
Add Flash Attention 2 to M2M100 model (#30256)
visheratin Apr 18, 2024
7915a25
Fix donut token2json multiline (#30300)
qubvel Apr 18, 2024
28a2283
Fix all torch pipeline failures except one (#30290)
ydshieh Apr 18, 2024
3f20877
Add jamba (#29943)
tomeras91 Apr 18, 2024
9459efb
Add atol for sliding window test (#30303)
fxmarty Apr 18, 2024
7509a0a
Fix RecurrentGemma device_map (#30273)
SunMarc Apr 18, 2024
acab997
Revert "Re-enable SDPA's FA2 path (#30070)" (#30314)
ArthurZucker Apr 18, 2024
63c5e27
Do not drop mask with SDPA for more cases (#30311)
fxmarty Apr 18, 2024
005b957
Add DBRX Model (#29921)
abhi-mosaic Apr 18, 2024
5728b5a
FIX: Fixes unexpected behaviour for Llava / LLama & AWQ Fused modules…
younesbelkada Apr 18, 2024
745bbfe
Release: v4.40.0
LysandreJik Apr 18, 2024
f8fec6b
Make EosTokenCriteria compatible with mps (#30376)
pcuenca Apr 23, 2024
9fe3f58
v4.40.1
ArthurZucker Apr 23, 2024
bb98e7c
Fix for Neuron (#30259)
michaelbenayoun May 2, 2024
6530a98
Fix copies for DBRX - neuron fix (#30610)
amyeroberts May 2, 2024
4fdf58a
v4.40.2
ArthurZucker May 6, 2024
35458a2
upstream v4.40 IFU
Cemberk May 29, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .circleci/TROUBLESHOOT.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Troubleshooting

This is a document explaining how to deal with various issues on Circle-CI. The entries may include actually solutions or pointers to Issues that cover those.
This is a document explaining how to deal with various issues on Circle-CI. The entries may include actual solutions or pointers to Issues that cover those.

## Circle CI

Expand Down
1 change: 1 addition & 0 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -157,6 +157,7 @@ jobs:
command: pip freeze | tee installed.txt
- store_artifacts:
path: ~/transformers/installed.txt
- run: python -c "from transformers import *" || (echo '🚨 import failed, this means you introduced unprotected imports! 🚨'; exit 1)
- run: ruff check examples tests src utils
- run: ruff format tests src utils --check
- run: python utils/custom_init_isort.py --check_only
Expand Down
63 changes: 37 additions & 26 deletions .circleci/create_circleci_config.py
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@
"RUN_PT_FLAX_CROSS_TESTS": False,
}
# Disable the use of {"s": None} as the output is way too long, causing the navigation on CircleCI impractical
COMMON_PYTEST_OPTIONS = {"max-worker-restart": 0, "dist": "loadfile"}
COMMON_PYTEST_OPTIONS = {"max-worker-restart": 0, "dist": "loadfile", "v": None}
DEFAULT_DOCKER_IMAGE = [{"image": "cimg/python:3.8.12"}]


Expand All @@ -52,14 +52,14 @@ class CircleCIJob:
name: str
additional_env: Dict[str, Any] = None
cache_name: str = None
cache_version: str = "0.7"
cache_version: str = "0.8.2"
docker_image: List[Dict[str, str]] = None
install_steps: List[str] = None
marker: Optional[str] = None
parallelism: Optional[int] = 1
pytest_num_workers: int = 8
pytest_num_workers: int = 12
pytest_options: Dict[str, Any] = None
resource_class: Optional[str] = "xlarge"
resource_class: Optional[str] = "2xlarge"
tests_to_run: Optional[List[str]] = None
working_directory: str = "~/transformers"
# This should be only used for doctest job!
Expand Down Expand Up @@ -128,22 +128,6 @@ def to_dict(self):
steps.extend([{"run": l} for l in self.install_steps])
steps.extend([{"run": 'pip install "fsspec>=2023.5.0,<2023.10.0"'}])
steps.extend([{"run": "pip install pytest-subtests"}])
steps.append(
{
"save_cache": {
"key": f"v{self.cache_version}-{self.cache_name}-{cache_branch_prefix}-pip-" + '{{ checksum "setup.py" }}',
"paths": ["~/.cache/pip"],
}
}
)
steps.append(
{
"save_cache": {
"key": f"v{self.cache_version}-{self.cache_name}-{cache_branch_prefix}-site-packages-" + '{{ checksum "setup.py" }}',
"paths": ["~/.pyenv/versions/"],
}
}
)
steps.append({"run": {"name": "Show installed libraries and their versions", "command": "pip freeze | tee installed.txt"}})
steps.append({"store_artifacts": {"path": "~/transformers/installed.txt"}})

Expand Down Expand Up @@ -227,7 +211,7 @@ def to_dict(self):
# failure.
test_command = f"({test_command}) || true"
else:
test_command += " || true"
test_command = f"({test_command} | tee tests_output.txt) || true"
steps.append({"run": {"name": "Run tests", "command": test_command}})

# Deal with errors
Expand Down Expand Up @@ -264,6 +248,25 @@ def to_dict(self):

steps.append({"store_artifacts": {"path": "~/transformers/tests_output.txt"}})
steps.append({"store_artifacts": {"path": "~/transformers/reports"}})

# save cache at the end: so pytest step runs before cache saving and we can see results earlier
steps.append(
{
"save_cache": {
"key": f"v{self.cache_version}-{self.cache_name}-{cache_branch_prefix}-pip-" + '{{ checksum "setup.py" }}',
"paths": ["~/.cache/pip"],
}
}
)
steps.append(
{
"save_cache": {
"key": f"v{self.cache_version}-{self.cache_name}-{cache_branch_prefix}-site-packages-" + '{{ checksum "setup.py" }}',
"paths": ["~/.pyenv/versions/"],
}
}
)

job["steps"] = steps
return job

Expand All @@ -283,6 +286,8 @@ def job_name(self):
"pip install -U --upgrade-strategy eager .[sklearn,tf-cpu,torch,testing,sentencepiece,torch-speech,vision]",
"pip install -U --upgrade-strategy eager tensorflow_probability",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
# TODO: remove this one after fixing the dependency issue(s) above
"pip install -U --upgrade-strategy eager torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu",
],
marker="is_pt_tf_cross_test",
pytest_options={"rA": None, "durations": 0},
Expand Down Expand Up @@ -312,7 +317,7 @@ def job_name(self):
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
],
parallelism=1,
pytest_num_workers=6,
pytest_num_workers=12,
)


Expand Down Expand Up @@ -348,7 +353,7 @@ def job_name(self):
"pip install -U --upgrade-strategy eager .[sklearn,torch,testing,sentencepiece,torch-speech,vision,timm,video]",
],
marker="is_pipeline_test",
pytest_num_workers=6,
pytest_num_workers=12,
)


Expand Down Expand Up @@ -470,15 +475,18 @@ def job_name(self):
"pip install -U --upgrade-strategy eager 'git+https://github.com/facebookresearch/detectron2.git'",
"sudo apt install tesseract-ocr",
"pip install -U --upgrade-strategy eager pytesseract",
"pip install -U --upgrade-strategy eager natten",
"pip install --upgrade-strategy eager sentencepiece",
"pip install -U --upgrade-strategy eager natten==0.15.1+torch210cpu -f https://shi-labs.com/natten/wheels",
"pip install -U --upgrade-strategy eager python-Levenshtein",
"pip install -U --upgrade-strategy eager opencv-python",
"pip install -U --upgrade-strategy eager nltk",
"pip uninstall -y torch torchvision torchaudio && pip install -U --upgrade-strategy eager 'torch<2.2.0' 'torchvision<0.17' 'torchaudio<2.2.0'"
],
tests_to_run=[
"tests/models/*layoutlmv*",
"tests/models/*nat",
"tests/models/deta",
"tests/models/udop",
"tests/models/nougat",
],
pytest_num_workers=1,
Expand Down Expand Up @@ -513,8 +521,11 @@ def job_name(self):
"pip install --upgrade --upgrade-strategy eager pip",
"pip install -U --upgrade-strategy eager -e .[dev]",
"pip install -U --upgrade-strategy eager -e git+https://github.com/huggingface/accelerate@main#egg=accelerate",
"pip install --upgrade --upgrade-strategy eager pytest pytest-sugar",
"pip install -U --upgrade-strategy eager natten",
"pip install --upgrade --upgrade-strategy eager 'pytest<8.0.0' pytest-sugar",
"pip install -U --upgrade-strategy eager natten==0.15.1+torch210cpu -f https://shi-labs.com/natten/wheels",
"pip install -U --upgrade-strategy eager g2p-en",
# TODO: remove this one after fixing the dependency issue(s) above
"pip install -U --upgrade-strategy eager torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu",
"find -name __pycache__ -delete",
"find . -name \*.pyc -delete",
# Add an empty file to keep the test step running correctly even no file is selected to be tested.
Expand Down
2 changes: 1 addition & 1 deletion .github/ISSUE_TEMPLATE/bug-report.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ body:
- Big Model Inference: @SunMarc
- quantization (bitsandbytes, autogpt): @SunMarc and @younesbelkada

Documentation: @stevhliu and @MKhalusova
Documentation: @stevhliu

Model hub:

Expand Down
2 changes: 1 addition & 1 deletion .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ Fixes # (issue)

## Before submitting
- [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
- [ ] Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests),
- [ ] Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#create-a-pull-request),
Pull Request section?
- [ ] Was this discussed/approved via a Github issue or the [forum](https://discuss.huggingface.co/)? Please add a link
to it if that's the case.
Expand Down
79 changes: 79 additions & 0 deletions .github/actions/post-slack/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,79 @@
name: Send message to slack

description: 'Send results to slack'
author: 'Hugging Face'
inputs:
slack_channel:
required: true
type: string
title:
required: true
type: string
status:
required: true
type: string
slack_token:
required: true
type: string

runs:
using: "composite"
steps:
- name: Create content to post
id: create-message
run: |
if [ "${{ inputs.status }}" == "success" ]; then
echo STATUS_MESSAGE='🟢 Tests are passing!' >> $GITHUB_ENV
else
echo STATUS_MESSAGE='🔴 Tests failed! Please check the GitHub action link below' >> $GITHUB_ENV
fi
shell: bash

- name: Post Canceled results Slack channel
id: post-slack
uses: slackapi/slack-github-action@6c661ce58804a1a20f6dc5fbee7f0381b469e001
with:
# Slack channel id, channel name, or user id to post message.
# See also: https://api.slack.com/methods/chat.postMessage#channels
channel-id: ${{ inputs.slack_channel }}
# For posting a rich message using Block Kit
payload: |
{
"text": "${{ inputs.title }}",
"blocks": [
{
"type": "header",
"text": {
"type": "plain_text",
"text": "${{ inputs.title }}"
}
},
{
"type": "section",
"text": {
"type": "mrkdwn",
"text": "${{ env.STATUS_MESSAGE }}"
}
},
{
"type": "section",
"text": {"type": "mrkdwn", "text": "*Click the button for more details about the commit*"},
"accessory": {
"type": "button",
"text": {"type": "plain_text", "text": "Check Commit results"},
"url": "${{ github.event.pull_request.html_url || github.event.head_commit.url }}"
}
},
{
"type": "section",
"text": {"type": "mrkdwn", "text": "*Click here for more details about the action ran*"},
"accessory": {
"type": "button",
"text": {"type": "plain_text", "text": "Check Action results"},
"url": "${{ github.server_url }}/${{ github.repository }}/actions/runs/${{ github.run_id }}"
}
}
]
}
env:
SLACK_BOT_TOKEN: ${{ inputs.slack_token }}
2 changes: 1 addition & 1 deletion .github/workflows/TROUBLESHOOT.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Troubleshooting

This is a document explaining how to deal with various issues on github-actions self-hosted CI. The entries may include actually solutions or pointers to Issues that cover those.
This is a document explaining how to deal with various issues on github-actions self-hosted CI. The entries may include actual solutions or pointers to Issues that cover those.

## GitHub Actions (self-hosted CI)

Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/add-model-like.yml
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ jobs:
name: "Add new model like template tests"
runs-on: ubuntu-22.04
steps:
- uses: actions/checkout@v3
- uses: actions/checkout@v4

- name: Install dependencies
run: |
Expand Down Expand Up @@ -74,7 +74,7 @@ jobs:

- name: Test suite reports artifacts
if: ${{ always() }}
uses: actions/upload-artifact@v3
uses: actions/upload-artifact@v4
with:
name: run_all_tests_new_models_test_reports
path: reports/tests_new_models
Loading
Loading