Commits · test_safetensors_abi3 · zhusg / transformers-new

07 Nov, 2024 1 commit
- [DO NOT MERGE] Testing dev release of safetensors · 60de6938
  Nicolas Patry authored 7 months ago
  
  60de6938
05 Nov, 2024 12 commits

🌐 [i18n-KO] Translated `convbert.md` to Korean (#34599) · 7bbc6247
Ahnjj_DEV authored 7 months ago
```
* docs: ko: convbert.md

* Update _toctree.yml

* feat: nmt draft
```
7bbc6247
Fix `use_parallel_residual` and `qkv_bias` for StableLM GGUF config extraction (#34450) · e83aaaa8
Isotr0py authored 7 months ago
```
* fix stablelm qkv_bias

* fix stablelm qkv_bias and use_parallel_residual

* remove original_model.config for stablelm gguf test
```
e83aaaa8
Fix torchvision interpolation CI (#34539) · 9f28d0c5
Yoni Gozlan authored 7 months ago
```
fix-torch-interpolation-ci
```
9f28d0c5

Changing __repr__ in torchao to show quantized Linear (#34202) · d2bae7ee

Mohamed Mekkouri authored 7 months ago

* Changing __repr__ in torchao

* small update

* make style

* small update

* add LinearActivationQuantizedTensor

* remove some cases

* update imports & handle return None

* update

d2bae7ee

Remove `@slow` for `test_eager_matches_sdpa_inference` (#34558) · f2d5dfba

Yih-Dar authored 7 months ago


* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f2d5dfba

Fix #34494 assistant tokens when truncated (#34531) · 082e57e0
Yoni Gottesman authored 7 months ago
```
* Fix assistant tokens when truncated

* fix test

* fix test

* step
```
082e57e0
Revert "Fix Whisper CI" (#34605) · 74d3824c
Yih-Dar authored 7 months ago
```
Revert "Fix Whisper CI (#34541)"

This reverts commit eb811449.
```
74d3824c
Remove unused test_dataset (#34516) · 45b0c768
Eon Kim authored 7 months ago

45b0c768

DistilBERT is ExecuTorch compatible (#34475) · 663c8512

Guang Yang authored 7 months ago


* DistillBERT is ExecuTorch compatible

* [run_slow] distilbert

* [run_slow] distilbert

---------

Co-authored-by: Guang Yang <guangyang@fb.com>

663c8512

Load sub-configs from composite configs (#34410) · 893ad04f

Raushan Turganbay authored 7 months ago

* save/load sub-configs

* nit forgot these

* fix copies

* move test to common

* use dict for sub-configs

* add load-save-laod test

* clean up modeling check

* oops this are correct keys

* fix some tests, missed some composite configs

* this model was missed

893ad04f

FIX: Broken repr of TorchAoConfig (#34560) · 5e1fd4e2

Benjamin Bossan authored 7 months ago

FIX Broken repr of TorchAoConfig

The __repr__ method references a non-existent self.kwargs. This is now
fixed.

There does not appear to be a uniform way of defining __repr__ for
quantization configs. I copied the method as implemented for HQQ:

https://github.com/huggingface/transformers/blob/e2ac16b28a0b8b900e136750309ca40c49d975c5/src/transformers/utils/quantization_config.py#L285-L287

5e1fd4e2

Skip DeepSpeed ZeRO Stage 3 model initialization when bnb (#34395) · d0b1d8d8

AbdelKarim ELJANDOUBI authored 7 months ago

* Skip DeepSpeed ZeRO Stage 3 model initialization when it is intended to be quantized.

* Propagate the quantization state using a context manager

* make fixup

d0b1d8d8

04 Nov, 2024 9 commits

Fix Whisper CI (#34541) · eb811449

Yih-Dar authored 7 months ago


update

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

eb811449

fix TrainerState doc because num_input_tokens_seen is unused by defau… (#34593) · bfa021be

kang sheng authored 7 months ago


fix TrainerState doc because num_input_tokens_seen is unused by default config

Co-authored-by: kangsheng <kangsheng@meituan.com>

bfa021be

🌐

[i18n-KO] Update README_ko.md (#33098) · 0a6795af

Ju Hoon Park authored 7 months ago

* Update README_ko.md

Delete the blank paragraph in the language selection button and Edit to synchronize with the English version of README.md

* [i18n-KO] Update README_ko.md

* Additional edit for keep consistency with main [documentation](https://huggingface.co/docs/transformers/v4.44.2/ko/index). (메인 문서와 일관성 유지를 위한 수정)

* Update README_ko.md

Additional update.
* Change docs link to Korean translated page if it exists.

* Change doc link to korean translated if it exists.

Change the link of doc and delete a row 'migration' of the table Learn more[더 알아보기], since it does not exist in the main version of doc.

* modify a link of the main README.md

from
`https://huggingface.co/docs/transformers/index#supported-frameworks`

to
`https://huggingface.co/docs/transformers/index#supported-models-and-frameworks`

since the title of 'supported table' changed.

* [i18n-ko] edit links and sync with main `README.md`

* docs/change comment to Korean1

Change English comment to Korean

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* docs/change comment to Korean2

Change English comment to Korean

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* revise to original

to seperate `edit_README_ko_md` and `README.md`

* Synchronization with English documentation.

Synchronization with English documentation, and translated a line of comment from English to Korean.

---------

Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

0a6795af

🌐 [i18n-KO] Translated perf_train_special.md to Korean (#34590) · 1112c546
MaCAT authored 7 months ago
```
* Translated to Ko, 1st version

* updated _toctree.yml
```
1112c546

[i18n-HI] Translated TFLite page to Hindi (#34572) · a86bd6f2

Karthik Vallamsetla authored 7 months ago


* [i18n-HI] Translated TFLite page to Hindi

* [i18n-HI] Translated TFLite page to Hindi

* Update docs/source/hi/tflite.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

---------

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

a86bd6f2

Add text support to the Trainer's TensorBoard integration (#34418) · 48831b7d

JacobLinCool authored 7 months ago


* feat: add text support to TensorBoardCallback

* feat: ignore long strings in trainer progress

* docs: add docstring for max_str_len

* style: remove trailing whitespace

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

48831b7d

MPS: `isin_mps_friendly` can support 0D tensors (#34538) · 34927b0f
Joao Gante authored 7 months ago
```
* apply fix

* tested

* make fixup
```
34927b0f

VLM: special multimodal Tokenizer (#34461) · 187439c3

Raushan Turganbay authored 7 months ago


* kinda works

* update

* add tests

* update

* use special tokens in processors

* typo

* fix copies

* fix

* fix moshi after rebase

* update

* fix tests

* update

* Update docs/source/en/main_classes/tokenizer.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update docs

* test for load time adding tokens

* fix some more tests which are now fetched better

* one more fix

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

187439c3

Update trainer for easier handling of accumulate, compile fixes, and proper reporting (#34511) · ef976a7e

Zach Mueller authored 7 months ago


* Update trainer for easier handling of accumulate + proper reporting

* test

* Fixup tests

* Full fix

* Fix style

* rm comment

* Fix tests

* Minimize test + remove py 311 check

* Unused import

* Forward contrib credits from discussions

* Fix reported metrics

* Refactor, good as it's going to get

* rm pad tok id check

* object detection and audio are being annoying

* Fin

* Fin x2

---------

Co-authored-by: Gyanateet Dutta <Ryukijano@users.noreply.github.com>

ef976a7e

01 Nov, 2024 5 commits

[i18n-HI] Translated accelerate page to Hindi (#34443) · 33868a05

Karthik Vallamsetla authored 7 months ago


* [i18n-HI] Translated accelerate page to Hindi

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

---------

Co-authored-by: Kay <kay@Kays-MacBook-Pro.local>
Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

33868a05

Large modular logic refactoring (#34487) · e2ac16b2

Cyril Vallez authored 7 months ago

* rework converter

* Update modular_model_converter.py

* Update modular_model_converter.py

* Update modular_model_converter.py

* Update modular_model_converter.py

* cleaning

* cleaning

* finalize imports

* imports

* Update modular_model_converter.py

* Better renaming to avoid visiting same file multiple times

* start converting files

* style

* address most comments

* style

* remove unused stuff in get_needed_imports

* style

* move class dependency functions outside class

* Move main functions outside class

* style

* Update modular_model_converter.py

* rename func

* add augmented dependencies

* Update modular_model_converter.py

* Add types_to_file_type + tweak annotation handling

* Allow assignment dependency mapping + fix regex

* style + update modular examples

* fix modular_roberta example (wrong redefinition of __init__)

* slightly correct order in which dependencies will appear

* style

* review comments

* Performance + better handling of dependencies when they are imported

* style

* Add advanced new classes capabilities

* style

* add forgotten check

* Update modeling_llava_next_video.py

* Add prority list ordering in check_conversion as well

* Update check_modular_conversion.py

* Update configuration_gemma.py

e2ac16b2

🔴

fix `query_pre_attn_scalar` different of... · 86701f2b

Pablo Montalvo authored 7 months ago

🔴 🔴  fix `query_pre_attn_scalar` different of `num_heads` in default gemma2 config (#34540)

* fix query_pre_attn_scalar different of num_heads in default config

* propagate modular changes

* fix copies

* fix modular copies

* fix copies?

* correct copies fix

86701f2b

BLIP: enable generation tests (#34174) · 4cc0813e

Raushan Turganbay authored 7 months ago

* blip2 tests

* instructblips

* copies

* fix slow tests

* fix

* uncomment this

* clean up after rebase

* should be model main input

* fix overwritten tests

* oops len should be multiple of frame number

* style

* fix some tests

4cc0813e

Blip: get/set input embeddings correctly (#34152) · 6beb3f16

Raushan Turganbay authored 7 months ago

* set-get embeds

* add tests

* fix tests

* remove

* return dict True

* fix tests

* why did i remove this

* enabel torchscript tests

6beb3f16

31 Oct, 2024 13 commits

[i18n-ar] Translated file : `docs/source/ar/multilingual.md` into Arabic (#33048) · b53e44e8

Ahmed Almaghz authored 7 months ago


* Add docs/source/ar/multilingual.md to Add_docs_source_ar_multilingual.md

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update _toctree.yml

* Update _toctree.yml

* Add Translated files to branch for merg

* Update _toctree.yml

* Update _toctree.yml

* Update custom_models.md

* Update chat_templating.md

* Update docs/source/ar/create_a_model.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update create_a_model.md

* Update gguf.md

* Update gguf.md

* Update gguf.md

* Update gguf.md

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

b53e44e8

update doc (#34478) · 2801d7bc

jiqing-feng authored 7 months ago


* update doc

* Update docs/source/en/perf_train_cpu.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* delete closing tip

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2801d7bc

[CLIPSeg] Make interpolate_pos_encoding default to True (#34419) · df8640ce

NielsRogge authored 7 months ago

* Remove interpolate_pos_encoding

* Make fixup

* Make interpolate_pos_encoding default to True

* Reuse existing interpolation

* Add integration test

df8640ce

Add image text to text pipeline (#34170) · 203e2705

Yoni Gozlan authored 7 months ago

* Standardize image-text-to-text-models-output

add post_process_image_text_to_text to chameleon and cleanup

Fix legacy kwarg behavior and deprecation warning

add post_process_image_text_to_text to qwen2_vl and llava_onevision

Add post_process_image_text_to_text to idefics3, mllama, pixtral processor

* nit var name post_process_image_text_to_text udop

* nit fix deprecation warnings

* Add image-text-to-text pipeline

* add support for image url in chat template for pipeline

* Reformat to be fully compatible with chat templates

* Add tests chat template

* Fix imports and tests

* Add pipeline tag

* change logic handling of single prompt ans multiple images

* add pipeline mapping to models

* fix batched inference

* fix tests

* Add manual batching for preprocessing

* Fix outputs with nested images

* Add support for all common processing kwargs

* Add default padding when multiple text inputs (batch size>1)

* nit change version deprecation warning

* Add support for text only inference

* add chat_template warnings

* Add pipeline tests and add copied from post process function

* Fix batched pipeline tests

* nit

* Fix pipeline tests blip2

* remove unnecessary max_new_tokens

* revert processing kosmos2 and remove unnecessary max_new_tokens

* fix pipeline tests idefics

* Force try loading processor if pipeline supports it

* revert load_processor change

* hardcode loading only processor

* remove unnecessary try except

* skip imagetexttotext tests for kosmos2 as tiny model causes problems

* Make code clearer

* Address review comments

* remove preprocessing logic from pipeline

* fix fuyu

* add BC resize fuyu

* Move post_process_image_text_to_text to ProcessorMixin

* add guard in post_process

* fix zero shot object detection pipeline

* add support for generator input in pipeline

* nit

* change default image-text-to-text model to llava onevision

* fix owlv2 size dict

* Change legacy deprecation warning to only show when True

203e2705

Bug Fix for issue #34294 (#34295) · c443d8d5

fpgaminer authored 7 months ago

Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.

c443d8d5

make `test_eager_matches_sdpa_inference `less flaky (#34512) · 114dd812

Yih-Dar authored 7 months ago


* try

* try

* try

* try

* try

* try

* update

* update

* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

114dd812

feat: add benchmarks pg indexes (#34536) · 294c170f
Luc Georges authored 7 months ago
```
* feat: add benchmarks pg indexes

* refactor: remove debug `df -h`
```
294c170f

fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests (#34518) · b5919e12

Phillip Kuznetsov authored 7 months ago


* fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests

Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai>

* [run_slow] dpt, depth_anything

---------

Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai>

b5919e12

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check (#34535) · 4ca004ea
Joao Gante authored 7 months ago
```
it has complex inputs_embeds computation
```
4ca004ea

avoid calling `gc.collect` and `cuda.empty_cache` (#34514) · ab98f0b0

Yih-Dar authored 7 months ago


* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ab98f0b0

Fix step shifting when accumulate gradient (#33673) · dca93ca0

kibitzing authored 7 months ago


* replace total_batched_samples with step while counting grad accum step

* remove unused variable

* simplify condition for update step

* fix format by ruff

* simplify update step condition using accelerator.sync_gradients

* simplify update condition using do_sync_step

* remove print for test

---------

Co-authored-by: Zach Mueller <muellerzr@gmail.com>

dca93ca0

Fix: img size mismatch caused by incorrect unpadding in LLaVA-Next (#34522) · 1b86772d
jp authored 7 months ago
```
Fix: unpadding img mismatch
```
1b86772d
enable QA bf16 pipeline (#34483) · f3853161
jiqing-feng authored 7 months ago
```
* enable QA bf16 pipeline

* add tests
```
f3853161