Commits · eb0ab3ed4bf61066edb3d38e4131cd9b8a6e94bc · 某某某 / transformers-new

18 Nov, 2024 3 commits

Fix broken link (#34618) · eb0ab3ed
Ofek Lev authored 7 months ago

eb0ab3ed
VLMs: `patch_size` -> `num_image_tokens` in processing (#33424) · 1646ffb4
Raushan Turganbay authored 7 months ago
```
* use num additional tokens

* fix copies + docs

* another fix copies :)

* add docs

* move order for BC
```
1646ffb4

Add OLMo November 2024 (#34551) · 3ee24e22

Shane A authored 7 months ago

* Add model skeletion with transformers-cli add-new-model-like

* Convert config to modular, add rms_norm_eps, delete clip_qkv

* Convert model to modular, add RMSNorm

* Add flash attention with qk norm and no qkv clipping

* Add decoder layer with RMSNorm after attention/feedforward layers

* Add base and causal model

* Add converter improvements from OLMo repo

* Update weight loading in OLMo to HF converter

* Set correct default for rms_norm_eps

* Set correct pipeline_model_mapping in test

* Run make fixup

* Fix model type

* Re-run modular conversion

* Manually set config docs to fix build errors

* Convert olmo-1124 to olmo_1124 to fix flash attention docs errors

* Start updating tests

* Update tests

* Copy upstream test_eager_matches_sdpa_inference_1_bfloat16 changes to olmo_1124

* Rename input_layernorm and post_attention_layernorm to reflect their ops better

* Use correct tokenizer

* Remove test unsupported by GPT2 tokenizer

* Create GenerationConfig outside of from_pretrained call

* Use simpler init file structure

* Add explicit __all__ to support simplified init

* Make safetensor serialization the default

* Update OLMo November 2024 docs

3ee24e22

15 Nov, 2024 7 commits

🧼 remove v4.44 deprecations (#34245) · 13493215

Joao Gante authored 7 months ago


* remove v4.44 deprecations

* PR comments

* deprecations scheduled for v4.50

* hub version update

* make fiuxp

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

13493215

Remove FSDP wrapping from sub-models. (#34452) · 8d50fda6

AbdelKarim ELJANDOUBI authored 7 months ago

* Remove FSDP wrapping from sub-models.

* solve conflict trainer.py

* make fixup

* add unit test for fsdp_auto_wrap_policy when using auto_find_batch_size

* put back extract_model_from_parallel

* use transformers unwrap_model

8d50fda6

FSDP grad accum fix (#34645) · b0c0ba7b

Wing Lian authored 7 months ago

* add gradient accumulation steps tests for fsdp

* invert no_sync context to fix training for fsdp

b0c0ba7b

add xpu path for awq (#34712) · 52ea4aa5
jiqing-feng authored 7 months ago
```
* add xpu path for awq

* update readme
```
52ea4aa5
fix(wandb): pass fake dataset to avoid exception in trainer (see #34455) (#34720) · 7b3d615b
CezaPasc authored 7 months ago

7b3d615b
Update llava.md (#34749) · f5dbfab7
Lysandre Debut authored 7 months ago
```
LLava -> Llava
```
f5dbfab7

Retain newlines in chat template when `continue_final_message=True` (#34253) · 8ba3e150

lewtun authored 7 months ago


* Retain newlines in chat template when

* Add try/except

* Add regression test

* Simplify test

* Apply suggestions from code review

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

8ba3e150

13 Nov, 2024 4 commits

[docs] add xpu device check (#34684) · a3d69a89

Fanli Lin authored 7 months ago


* add XPU path

* use accelerate API

* Update docs/source/en/tasks/semantic_segmentation.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update more places with accelerate API

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a3d69a89

Fix example in EsmConfig docstring (#34653) · 68f8186a
Xiao Yuan authored 7 months ago

68f8186a
[docs] Broken link in generation_strategies (#34717) · e7c36a9d
Pedro Cuenca authored 7 months ago
```
[docs] Broken link
```
e7c36a9d
🌐 [i18n-KO] Translated marian.md to Korean (#34698) · be8748a5
MaCAT authored 7 months ago
```
* initial translation

* removed english

* Fixed Trivial Typos, updated _toctree.yml
```
be8748a5

11 Nov, 2024 3 commits

Agents: Small fixes in streaming to gradio + add tests (#34549) · 33eef992
Aymeric Roucher authored 8 months ago
```
* Better support transformers.agents in gradio: small fixes and additional tests
```
33eef992

[i18n-ar] Translated file : `docs/source/ar/torchscript.md` into Arabic (#33079) · 6de2a4d1

Ahmed Almaghz authored 8 months ago


* Add docs/source/ar/torchscript.md to Add_docs_source_ar_torchscript.md

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Merge troubleshooting.md with this Branch

* Update _toctree.yml

* Update torchscript.md

* Update troubleshooting.md

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

6de2a4d1

[docs] update not-working model revision (#34682) · 25f510a9
Fanli Lin authored 8 months ago
```
update revision
```
25f510a9

10 Nov, 2024 2 commits

Agents: turn any Space into a Tool with `Tool.from_space()` (#34561) · 3ea3ab62
Aymeric Roucher authored 8 months ago
```
* Agents: you can now load a Space as a tool
```
3ea3ab62

Update llm_engine.py (#33332) · 134ba90d

Louis Brulé Naudet authored 8 months ago

* Update llm_engine.py
- Added support for optional token and max_tokens parameters in the constructor.
- Provided usage examples and detailed documentation for each method.

134ba90d

09 Nov, 2024 1 commit

[i18n-ar] Translated file : `docs/source/ar/trainer.md` into Arabic (#33080) · 768f3c01

Ahmed Almaghz authored 8 months ago


* Add docs/source/ar/trainer.md to Add_docs_source_ar_trainer.md

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update trainer.md

* Update trainer.md

* Update trainer.md

* Create _toctree.yml

* Delete docs/source/ar/_toctree.yml

* Update _toctree.yml - add trainer

* Update _toctree.yml

* merge serialization.md into this branch

* merge sagemaker.md into this PR

* Update _toctree.yml

* Update docs/source/ar/trainer.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

768f3c01

08 Nov, 2024 1 commit

🌐

[i18n-KO] Translated bert.md to Korean (#34627) · a06a0d12

MaCAT authored 8 months ago


* Translated bert.md, Need additional check

* Translation 2nd ver, changed _toctree.yml

* Fixed Typo

* Update bert.md

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>

* Update bert.md

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>

* Update bert.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update bert.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a06a0d12

07 Nov, 2024 2 commits
- 🌐 [i18n-KO] Translated `timesformer.md` to Korean (#33972) · 1cf17077
  Jiwook Han authored 8 months ago
```
* docs: ko: model_doc/timesformer.md

* feat: nmt draft

* fix: manual edits

* fix_toctree

* fix toctree on Video Models
```
  1cf17077
- fix(dvclive): pass fake dataset to avoid exception in trainer init (#34455) · 6938524a
  Ivan Shcheklein authored 8 months ago
```
fix(dvclive): pass fake dataset to avoid exception in trainer
```
  6938524a
05 Nov, 2024 12 commits

🌐 [i18n-KO] Translated `convbert.md` to Korean (#34599) · 7bbc6247
Ahnjj_DEV authored 8 months ago
```
* docs: ko: convbert.md

* Update _toctree.yml

* feat: nmt draft
```
7bbc6247
Fix `use_parallel_residual` and `qkv_bias` for StableLM GGUF config extraction (#34450) · e83aaaa8
Isotr0py authored 8 months ago
```
* fix stablelm qkv_bias

* fix stablelm qkv_bias and use_parallel_residual

* remove original_model.config for stablelm gguf test
```
e83aaaa8
Fix torchvision interpolation CI (#34539) · 9f28d0c5
Yoni Gozlan authored 8 months ago
```
fix-torch-interpolation-ci
```
9f28d0c5

Changing __repr__ in torchao to show quantized Linear (#34202) · d2bae7ee

Mohamed Mekkouri authored 8 months ago

* Changing __repr__ in torchao

* small update

* make style

* small update

* add LinearActivationQuantizedTensor

* remove some cases

* update imports & handle return None

* update

d2bae7ee

Remove `@slow` for `test_eager_matches_sdpa_inference` (#34558) · f2d5dfba

Yih-Dar authored 8 months ago


* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f2d5dfba

Fix #34494 assistant tokens when truncated (#34531) · 082e57e0
Yoni Gottesman authored 8 months ago
```
* Fix assistant tokens when truncated

* fix test

* fix test

* step
```
082e57e0
Revert "Fix Whisper CI" (#34605) · 74d3824c
Yih-Dar authored 8 months ago
```
Revert "Fix Whisper CI (#34541)"

This reverts commit eb811449.
```
74d3824c
Remove unused test_dataset (#34516) · 45b0c768
Eon Kim authored 8 months ago

45b0c768

DistilBERT is ExecuTorch compatible (#34475) · 663c8512

Guang Yang authored 8 months ago


* DistillBERT is ExecuTorch compatible

* [run_slow] distilbert

* [run_slow] distilbert

---------

Co-authored-by: Guang Yang <guangyang@fb.com>

663c8512

Load sub-configs from composite configs (#34410) · 893ad04f

Raushan Turganbay authored 8 months ago

* save/load sub-configs

* nit forgot these

* fix copies

* move test to common

* use dict for sub-configs

* add load-save-laod test

* clean up modeling check

* oops this are correct keys

* fix some tests, missed some composite configs

* this model was missed

893ad04f

FIX: Broken repr of TorchAoConfig (#34560) · 5e1fd4e2

Benjamin Bossan authored 8 months ago

FIX Broken repr of TorchAoConfig

The __repr__ method references a non-existent self.kwargs. This is now
fixed.

There does not appear to be a uniform way of defining __repr__ for
quantization configs. I copied the method as implemented for HQQ:

https://github.com/huggingface/transformers/blob/e2ac16b28a0b8b900e136750309ca40c49d975c5/src/transformers/utils/quantization_config.py#L285-L287

5e1fd4e2

Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395) · d0b1d8d8

AbdelKarim ELJANDOUBI authored 8 months ago

* Skip DeepSpeed ZeRO Stage 3 model initialization when it is intended to be quantized.

* Propagate the quantization state using a context manager

* make fixup

d0b1d8d8

04 Nov, 2024 5 commits

Fix Whisper CI (#34541) · eb811449

Yih-Dar authored 8 months ago


update

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

eb811449

fix TrainerState doc because num_input_tokens_seen is unused by defau… (#34593) · bfa021be

kang sheng authored 8 months ago


fix TrainerState doc because num_input_tokens_seen is unused by default config

Co-authored-by: kangsheng <kangsheng@meituan.com>

bfa021be

🌐

[i18n-KO] Update README_ko.md (#33098) · 0a6795af

Ju Hoon Park authored 8 months ago

* Update README_ko.md

Delete the blank paragraph in the language selection button and Edit to synchronize with the English version of README.md

* [i18n-KO] Update README_ko.md

* Additional edit for keep consistency with main [documentation](https://huggingface.co/docs/transformers/v4.44.2/ko/index). (메인 문서와 일관성 유지를 위한 수정)

* Update README_ko.md

Additional update.
* Change docs link to Korean translated page if it exists.

* Change doc link to korean translated if it exists.

Change the link of doc and delete a row 'migration' of the table Learn more[더 알아보기], since it does not exist in the main version of doc.

* modify a link of the main README.md

from
`https://huggingface.co/docs/transformers/index#supported-frameworks`

to
`https://huggingface.co/docs/transformers/index#supported-models-and-frameworks`

since the title of 'supported table' changed.

* [i18n-ko] edit links and sync with main `README.md`

* docs/change comment to Korean1

Change English comment to Korean

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* docs/change comment to Korean2

Change English comment to Korean

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* revise to original

to seperate `edit_README_ko_md` and `README.md`

* Synchronization with English documentation.

Synchronization with English documentation, and translated a line of comment from English to Korean.

---------

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

0a6795af

🌐 [i18n-KO] Translated perf_train_special.md to Korean (#34590) · 1112c546
MaCAT authored 8 months ago
```
* Translated to Ko, 1st version

* updated _toctree.yml
```
1112c546

[i18n-HI] Translated TFLite page to Hindi (#34572) · a86bd6f2

Karthik Vallamsetla authored 8 months ago


* [i18n-HI] Translated TFLite page to Hindi

* [i18n-HI] Translated TFLite page to Hindi

* Update docs/source/hi/tflite.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

---------

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

a86bd6f2