Commits · 187439c3fa139b2102a874483e9f8f0cfa8e5557 · zhusg / transformers-new

04 Nov, 2024 2 commits

VLM: special multimodal Tokenizer (#34461) · 187439c3


* kinda works

* update

* add tests

* update

* use special tokens in processors

* typo

* fix copies

* fix

* fix moshi after rebase

* update

* fix tests

* update

* Update docs/source/en/main_classes/tokenizer.md

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update docs

* test for load time adding tokens

* fix some more tests which are now fetched better

* one more fix

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

187439c3

Update trainer for easier handling of accumulate, compile fixes, and proper reporting (#34511) · ef976a7e

Zach Mueller authored 8 months ago


* Update trainer for easier handling of accumulate + proper reporting

* test

* Fixup tests

* Full fix

* Fix style

* rm comment

* Fix tests

* Minimize test + remove py 311 check

* Unused import

* Forward contrib credits from discussions

* Fix reported metrics

* Refactor, good as it's going to get

* rm pad tok id check

* object detection and audio are being annoying

* Fin

* Fin x2

---------

Co-authored-by: Gyanateet Dutta <Ryukijano@users.noreply.github.com>

ef976a7e

01 Nov, 2024 5 commits

[i18n-HI] Translated accelerate page to Hindi (#34443) · 33868a05

Karthik Vallamsetla authored 8 months ago


* [i18n-HI] Translated accelerate page to Hindi

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

* Update docs/source/hi/accelerate.md

Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

---------

Co-authored-by: Kay <kay@Kays-MacBook-Pro.local>
Co-authored-by: K.B.Dharun Krishna <kbdharunkrishna@gmail.com>

33868a05

Large modular logic refactoring (#34487) · e2ac16b2

Cyril Vallez authored 8 months ago

* rework converter

* Update modular_model_converter.py

* Update modular_model_converter.py

* Update modular_model_converter.py

* Update modular_model_converter.py

* cleaning

* cleaning

* finalize imports

* imports

* Update modular_model_converter.py

* Better renaming to avoid visiting same file multiple times

* start converting files

* style

* address most comments

* style

* remove unused stuff in get_needed_imports

* style

* move class dependency functions outside class

* Move main functions outside class

* style

* Update modular_model_converter.py

* rename func

* add augmented dependencies

* Update modular_model_converter.py

* Add types_to_file_type + tweak annotation handling

* Allow assignment dependency mapping + fix regex

* style + update modular examples

* fix modular_roberta example (wrong redefinition of __init__)

* slightly correct order in which dependencies will appear

* style

* review comments

* Performance + better handling of dependencies when they are imported

* style

* Add advanced new classes capabilities

* style

* add forgotten check

* Update modeling_llava_next_video.py

* Add prority list ordering in check_conversion as well

* Update check_modular_conversion.py

* Update configuration_gemma.py

e2ac16b2

fix `query_pre_attn_scalar` different of... · 86701f2b

Pablo Montalvo authored 8 months ago

   fix `query_pre_attn_scalar` different of `num_heads` in default gemma2 config (#34540)

* fix query_pre_attn_scalar different of num_heads in default config

* propagate modular changes

* fix copies

* fix modular copies

* fix copies?

* correct copies fix

86701f2b

BLIP: enable generation tests (#34174) · 4cc0813e

Raushan Turganbay authored 8 months ago

* blip2 tests

* instructblips

* copies

* fix slow tests

* fix

* uncomment this

* clean up after rebase

* should be model main input

* fix overwritten tests

* oops len should be multiple of frame number

* style

* fix some tests

4cc0813e

Blip: get/set input embeddings correctly (#34152) · 6beb3f16

Raushan Turganbay authored 8 months ago

* set-get embeds

* add tests

* fix tests

* remove

* return dict True

* fix tests

* why did i remove this

* enabel torchscript tests

6beb3f16

31 Oct, 2024 13 commits

[i18n-ar] Translated file : `docs/source/ar/multilingual.md` into Arabic (#33048) · b53e44e8

Ahmed Almaghz authored 8 months ago


* Add docs/source/ar/multilingual.md to Add_docs_source_ar_multilingual.md

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/multilingual.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update _toctree.yml

* Update _toctree.yml

* Add Translated files to branch for merg

* Update _toctree.yml

* Update _toctree.yml

* Update custom_models.md

* Update chat_templating.md

* Update docs/source/ar/create_a_model.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update create_a_model.md

* Update gguf.md

* Update gguf.md

* Update gguf.md

* Update gguf.md

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

b53e44e8

update doc (#34478) · 2801d7bc

jiqing-feng authored 8 months ago


* update doc

* Update docs/source/en/perf_train_cpu.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* delete closing tip

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2801d7bc

[CLIPSeg] Make interpolate_pos_encoding default to True (#34419) · df8640ce

NielsRogge authored 8 months ago

* Remove interpolate_pos_encoding

* Make fixup

* Make interpolate_pos_encoding default to True

* Reuse existing interpolation

* Add integration test

df8640ce

Add image text to text pipeline (#34170) · 203e2705

Yoni Gozlan authored 8 months ago

* Standardize image-text-to-text-models-output

add post_process_image_text_to_text to chameleon and cleanup

Fix legacy kwarg behavior and deprecation warning

add post_process_image_text_to_text to qwen2_vl and llava_onevision

Add post_process_image_text_to_text to idefics3, mllama, pixtral processor

* nit var name post_process_image_text_to_text udop

* nit fix deprecation warnings

* Add image-text-to-text pipeline

* add support for image url in chat template for pipeline

* Reformat to be fully compatible with chat templates

* Add tests chat template

* Fix imports and tests

* Add pipeline tag

* change logic handling of single prompt ans multiple images

* add pipeline mapping to models

* fix batched inference

* fix tests

* Add manual batching for preprocessing

* Fix outputs with nested images

* Add support for all common processing kwargs

* Add default padding when multiple text inputs (batch size>1)

* nit change version deprecation warning

* Add support for text only inference

* add chat_template warnings

* Add pipeline tests and add copied from post process function

* Fix batched pipeline tests

* nit

* Fix pipeline tests blip2

* remove unnecessary max_new_tokens

* revert processing kosmos2 and remove unnecessary max_new_tokens

* fix pipeline tests idefics

* Force try loading processor if pipeline supports it

* revert load_processor change

* hardcode loading only processor

* remove unnecessary try except

* skip imagetexttotext tests for kosmos2 as tiny model causes problems

* Make code clearer

* Address review comments

* remove preprocessing logic from pipeline

* fix fuyu

* add BC resize fuyu

* Move post_process_image_text_to_text to ProcessorMixin

* add guard in post_process

* fix zero shot object detection pipeline

* add support for generator input in pipeline

* nit

* change default image-text-to-text model to llava onevision

* fix owlv2 size dict

* Change legacy deprecation warning to only show when True

203e2705

Bug Fix for issue #34294 (#34295) · c443d8d5

fpgaminer authored 8 months ago

Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.

c443d8d5

make `test_eager_matches_sdpa_inference `less flaky (#34512) · 114dd812

Yih-Dar authored 8 months ago


* try

* try

* try

* try

* try

* try

* update

* update

* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

114dd812

feat: add benchmarks pg indexes (#34536) · 294c170f
Luc Georges authored 8 months ago
```
* feat: add benchmarks pg indexes

* refactor: remove debug `df -h`
```
294c170f

fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests (#34518) · b5919e12

Phillip Kuznetsov authored 8 months ago


* fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests

Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai>

* [run_slow] dpt, depth_anything

---------

Signed-off-by: Phillip Kuznetsov <philkuz@gimletlabs.ai>

b5919e12

Qwen2VL: skip base `input_ids`-`inputs_embeds` equivalence check (#34535) · 4ca004ea
Joao Gante authored 8 months ago
```
it has complex inputs_embeds computation
```
4ca004ea

avoid calling `gc.collect` and `cuda.empty_cache` (#34514) · ab98f0b0

Yih-Dar authored 8 months ago


* update

* update

* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ab98f0b0

Fix step shifting when accumulate gradient (#33673) · dca93ca0

kibitzing authored 8 months ago


* replace total_batched_samples with step while counting grad accum step

* remove unused variable

* simplify condition for update step

* fix format by ruff

* simplify update step condition using accelerator.sync_gradients

* simplify update condition using do_sync_step

* remove print for test

---------

Co-authored-by: Zach Mueller <muellerzr@gmail.com>

dca93ca0

Fix: img size mismatch caused by incorrect unpadding in LLaVA-Next (#34522) · 1b86772d
jp authored 8 months ago
```
Fix: unpadding img mismatch
```
1b86772d
enable QA bf16 pipeline (#34483) · f3853161
jiqing-feng authored 8 months ago
```
* enable QA bf16 pipeline

* add tests
```
f3853161

30 Oct, 2024 11 commits

UPDATE Documentation for #TRANSLATING.md Documentation into Multiple... · 405b5626

anshumangahlot authored 8 months ago

UPDATE Documentation for #TRANSLATING.md Documentation into Multiple Languages.(Changes made) (#34226)

* Update TRANSLATING.md

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update TRANSLATING.md

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

405b5626

Add Image Processor Fast RT-DETR (#34354) · 48872fd6

Yoni Gozlan authored 8 months ago

* add fast image processor rtdetr

* add gpu/cpu test and fix docstring

* remove prints

* add to doc

* nit docstring

* avoid iterating over images/annotations several times

* change torch typing

* Add image processor fast documentation

48872fd6

Fix super tiny extra space typo (#34440) · 9f06fb05
fzyzcjy authored 8 months ago
```
Update training_args.py
```
9f06fb05

Add GGUF for Mamba (#34200) · 5251fe62

Vladislav Bronzov authored 8 months ago

* add mamba architecture for gguf

* add logic for weights conversion, some fixes and refactoring

* add lm_head layers, unit test refactoring

* more fixes for tests

* remove lm_head creation

* remove unused comments

5251fe62

Use torch 2.5 in scheduled CI (#34465) · eab6c491

Yih-Dar authored 8 months ago


* torch 2.5

* try

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

eab6c491

fix pixtral processor (#34486) · 241d7902

Pablo Montalvo authored 8 months ago

* fix pixtral processor

* test out full length batches + remove undue ValueError

* fix up processing

* fix tests

* fix

* last fixup

* style

* [run-slow] pixtral

* [run-slow] pixtral

* fix config key

* skip torchscript tests

* [run-slow] pixtral

* add missing key

* [run-slow] pixtral

* fix docs

* [run-slow] pixtral

* fix wrong url for integration test

* [run-slow] pixtral

* pixtralVisionModel does not have a lm head

* [run-slow] pixtral

241d7902

Tests: move `generate` tests to the right mixin and delete redundant tests (#34464) · 8a734ea2

Joao Gante authored 8 months ago

* tmp commit

* tmp commit

* cull overwrites of deleted tests

* typo

* more specific docstring

* make fixup

* parameterize at the top?

* correction

* more deletions :D

* tmp commit

* for VLMs too

* fix _check_outputs

* test nit

* make fixup

* fix another flaky

* test_generate_from_inputs_embeds -- handle missing attention mask

8a734ea2

VLMs: fix number of image tokens (#34332) · 913330ca
Raushan Turganbay authored 8 months ago
```
* fix

* fix tests

* add tests

* style

* style

* fix qwen after rebase

* fix video llava
```
913330ca
Mllama: update docs (#34334) · 0f764a5a
Raushan Turganbay authored 8 months ago
```
* update docs

* be more explicit

* use avaialble methods
```
0f764a5a

Fix format mistake in string repr of tokenizer objects (#34493) · 25a9fc58

Pethő Gergely authored 8 months ago

* fix repr string format for tokenizer objects

The repr of tokenizer tokens looks confusing and just stupid, like this: `Tokenizer(...), added_tokens_decoder={1: ..., 2: ...}`. The dict that is the value of the added_tokens_decoder attribute is outside of the parentheses of the tokenizer object, whereas all other attributes are inside the parentheses like they should be.

This commit fixes this bug.

* cos: add newline before closing parenthesis of repr string

25a9fc58

Roberta is ExecuTorch compatible (#34425) · cd277618

Guang Yang authored 8 months ago


* Roberta is ExecuTorch compatible

* [run_slow] roberta

---------

Co-authored-by: Guang Yang <guangyang@fb.com>

cd277618

29 Oct, 2024 9 commits

Un-deprecate timeout arg in pipelines (#34382) · 9bee9ff5
Matt authored 8 months ago
```
* Un-deprecate timeout

* Put "timeout" on the allowed list

* make fixup
```
9bee9ff5
fix incorrect warning (#34416) · e4449bb7
Yoni Gozlan authored 8 months ago

e4449bb7
Fix performance in get_imports regexp (#34298) · f55595b1
Aleksey Lobanov authored 8 months ago
```
* fix: Fix performance in get_imports regexp

* Minimize get_imports content regexp
```
f55595b1

Bump werkzeug from 3.0.3 to 3.0.6 in /examples/research_projects/decision_transformer (#34420) · 4e2e8809

dependabot[bot] authored 8 months ago

Bump werkzeug in /examples/research_projects/decision_transformer

Bumps [werkzeug](https://github.com/pallets/werkzeug) from 3.0.3 to 3.0.6.
- [Release notes](https://github.com/pallets/werkzeug/releases)
- [Changelog](https://github.com/pallets/werkzeug/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/werkzeug/compare/3.0.3...3.0.6

)

---
updated-dependencies:
- dependency-name: werkzeug
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

4e2e8809

Adding `optimizer_cls_and_kwargs` to `Trainer.__init__` (#34358) · e9ad4604

Apoorv Khandelwal authored 8 months ago

* Adding `optimizer_cls_and_kwargs` to `Trainer.__init__`

* formatting

* make fix-copies docstring

* added more docs for optimizer_cls_and_kwargs

* add docs for Trainer(optimizer_cls_and_kwargs)

* reverting anchor names

e9ad4604

Albert is ExecuTorch compatible (#34476) · f339042b
Guang Yang authored 8 months ago
```
Co-authored-by: Guang Yang <guangyang@fb.com>
```
f339042b
MobileBERT is ExecuTorch compatible (#34473) · 34620e8f
Guang Yang authored 8 months ago
```
Co-authored-by: Guang Yang <guangyang@fb.com>
```
34620e8f

Bug fix for drop path decay rate in swin transformer (#34291) · 56c45d57

Abhijit Deo authored 8 months ago


* potential bug fix for drop path

* variable name change

* forgot to rename the variables

* back to original

* modify dpr properly

* check_copies auto fix

* corresponsing swin2 changes

* auto fix

* linting

* default value for drop_path_rate as 0.0

* Update src/transformers/models/glm/modeling_glm.py

* maskformer fix

* ruff format

* changes made to tf code as well

* lint

---------

Co-authored-by: abhijit deo <167164474+deo-abhijit@users.noreply.github.com>

56c45d57

fix-qwen2vl-no-position_ids (#33487) · 0ab0a426
Shijie authored 8 months ago

0ab0a426