Commits · 51e395d13e46a8ecdda2b47381519bdfca87ba4a · 某某某 / transformers-new

22 Oct, 2024 9 commits

Fix FA2 attention for models supporting sliding window (#34093) · 51e395d1
Cyril Vallez authored 8 months ago
```
Fix FA2
```
51e395d1

[RT-DETR] Fix onnx inference bug for Optype (Where) (#33877) · eb6a7349

HALLOUARD authored 8 months ago

* feat: [RT-DETR] Add onnx runtime config and fix onnx inference bug Optype (Where)

* fix lint

* use dtype istead of torch.float32

* add doc

* remove onnx config

* use dtype info

* use tensor to fix lint

eb6a7349

Update PR templates (#34065) · 84b17e03
Marc Sun authored 8 months ago
```
update PR template
```
84b17e03
Sync video classification pipeline with huggingface_hub spec (#34288) · 681fc437
Matt authored 8 months ago
```
* Sync video classification pipeline

* Add disclaimer
```
681fc437
Fix Korean doc _toctree.yml (#34293) · 93352e81
regisss authored 8 months ago
```
Fix korean doc _toctree.yml
```
93352e81
[docs] Fix GenerationConfig params (#34299) · b644178e
Steven Liu authored 8 months ago
```
fix generationconfigs
```
b644178e

T5 compile compatibilty (#34089) · 73d65e63

Raushan Turganbay authored 8 months ago

* this worked in normal generation, needs more tests

* fix almost all tests in t5

* nit

* longt5, umt5, mt5

* style

* udop, pix2struct

* more models

* fix some tests

* fix onnx tests

* tracing tests fixed

* compile enabled and tested for t5 models

* fix small bug in slow tests

* [run-slow] t5

* uncomment

* style

* update with new generation refactoring

* nit

* fix copies

* this is the fix, had to change t5 to fix copies

* update

* [run-slow] t5

* [run-slow] t5

* update

* add test for encoder only T5

* clean up after rebase

* fix pop2piano

* add comment

* style

* fix copies after rebase

* fix copies  missed this one

73d65e63

VLM: add more modularity (#34175) · 5077bc03
Raushan Turganbay authored 8 months ago
```
* update

* fix tests + fix copies

* fix tests once more
```
5077bc03

Attn implementation for composite models (#32238) · 21d50258

Raushan Turganbay authored 8 months ago


* first try

* codestyle

* idefics2 is happy

* [run-slow] llava, llava_next, video_llava, vipllava, llava_next_video, idefics, idefics2, kosmos2, fuyu, blip, blip_2, instructblip, instructblipvideo, paligemma

* fix-copies

* [run-slow] llava, llava_next, video_llava, vipllava, llava_next_video, idefics, idefics2, kosmos2, fuyu, blip, blip_2, instructblip, instructblipvideo

* blip-2 needs to init vision from config

* when was this removed O_o

* minor fix

* tests

* this way?

* tests

* model-agnostic code

* codestyle

* add tests for idefics

* modify general test for VLMs

* no generation test for vlm yet!

* no generation test here also

* wanr in VIT-SDPA if output attn

* add more tests

* user can pass dict as attn impl

* repo consistency

* update

* muicgen

* no prints

* forgot speech enc-dec and clip

* how many composite models we have?

* musicgen meelody is same as mudicgen

* +siglip

* fix tests + add some more

* remove idefics custom overriden code

* make idefics2 automappable

* nits

* skip tests

* doctests

* Update src/transformers/models/idefics2/configuration_idefics2.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/clip/test_modeling_clip.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/idefics2/test_modeling_idefics2.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/idefics2/test_modeling_idefics2.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/configuration_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* major update, no need for automap

* clean up

* add FA2 test

* more tests

* style

* skip tests

* why did these started failing now?

* no attributes for FA2 needed

* one tiny test

* address comment about FA2 false warning

* style

* add new models and resolve conflicts

* fix copies

* let it be this way for now, come back tomorrow to review

* some more fixes

* update

* more updates

* update

* fix copies

* style and tests

* another big update

* fix tests

* fix tests

* update

* another update

* fix tests

* fix copies

* fix tests

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

21d50258

21 Oct, 2024 5 commits
- Fix method name which changes in tutorial (#34252) · 32590b5e
  Andrés Marafioti authored 8 months ago
```
The method `model_download_tool` was called `model_download_counter` earlier in the tutorial, this raises an error when following the code.
```
  32590b5e
- Add a doc section on writing generation prompts (#34248) · f701b98e
  Matt authored 8 months ago
```
Add a section on writing generation prompts
```
  f701b98e
- Add DetrImageProcessorFast (#34063) · a4122813
  Yoni Gozlan authored 8 months ago
```
* add fully functionning image_processing_detr_fast

* Create tensors on the correct device

* fix copies

* fix doc

* add tests equivalence cpu gpu

* fix doc en

* add relative imports and copied from

* Fix copies and nit
```
  a4122813
- Change Paligemma import logging to work with modular (#34211) · 24bdc94d
  Yoni Gozlan authored 8 months ago
```
* change import logging

* fix CI
```
  24bdc94d
- Generation tests: don't rely on main input name (#34228) · ca541bd4
  Raushan Turganbay authored 8 months ago
```
* don't rely on main input name

* update
```
  ca541bd4
18 Oct, 2024 6 commits

Only cast logits to float when computing loss (#34147) · 816f4424

Matthew Hoffman authored 8 months ago

* Only cast logits to float when computing loss

Some misses from #31292 and #33902

* Move logits.float() into existing if labels is not None branch

816f4424

Fix UDOP dtype issue (#34180) · e46e3bc1

Matt authored 8 months ago

* Trigger UDOP tests

* Try forcing dtype in LayoutLMV3

* Do checks to see where uint8 is getting in

* Do checks to see where uint8 is getting in

* Found it!

* Add .astype(np.float32)

* Remove forced check, make fixup

* Checking where exactly the uint8 creeps in

* More checking on the uint8 issues

* Manually upcast in rescale()

* Remove UDOP trigger

e46e3bc1

add Glm (#33823) · 66047640

Cyril Vallez authored 8 months ago

* Create modular_glm.py

* Update modular_glm.py

* Finalize architecture without all attentions

* Add all attentions modules

* Finalize modular

* Update given last version

* Last update

* Finalize model

* Finalize converter

* Update convert_glm_weights_to_hf.py

* style

* style

* Create __init__.py

* Aff all inits

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Correct the rotary embeddings

* Remove apply_residual_connection_post_layernorm (always false)

* remove use_rms_norm (always true)

* remove past_layer_norm (always true)

* Update __init__.py

* Update config and license

* start adding tests and doc

* Add doc + style

* Update test_modeling_glm.py

* Add dummies

* Apply correct modeling

* Refactor attention to follow llama

* Update __init__.py

* Update convert_glm_weights_to_hf.py

* Correct bias

* remove linear_bias and pdrop (never used)

* apply modular

* Simplify converter

* remove dummies + style

* add model_input_names

* Add pretraining_tp to config for when eager attention is used

* Update modular to remove all pretraining_tp

* Update test_modeling_glm.py

* Update the __all__

* Update __all__

* Update __init__.py

* Update test_modeling_glm.py

* add revisions

* Add the correct repos and revisions

* style

* Update __init__.py

* update exports

* remove import of modular files

* style

* Apply Llama changes + refine converter

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* Update convert_glm_weights_to_hf.py

* style

* Use new modular converter

* add pretrainedmodel to init

* style

* Update test_modeling_glm.py

* Move config outside modular to please CI about docstrings

* Add dummies to please CI

* Update glm.md

* Update glm.md

66047640

Informative 2 (#34154) · e95ea479

Lysandre Debut authored 8 months ago


* Informative

* style

* Informative 2

* Apply suggestions from code review

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

---------

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

e95ea479

Fix broken test decorator `require_torch_up_to_2_accelerators` (#34201) · 0437d6cd
byi8220 authored 8 months ago
```
* fix broken require_torch_up_to_2_accelerators

* make style
```
0437d6cd
BLIP: fix input expansion logic (#34225) · 5a5b590d
Raushan Turganbay authored 8 months ago
```
fix
```
5a5b590d

17 Oct, 2024 13 commits

Fix-red-ci (#34230) · b54109c7

Arthur authored 8 months ago

* fix copies, skip fx for llama

* styke

* re-fix copies

* last?

* style

b54109c7

Enable users to use their own loss functions + deal with prefetching for grad accum (#34198) · 6ba31a8a

Zach Mueller authored 8 months ago


* bookmark

* Bookmark

* Bookmark

* Actually implement

* Pass in kwarg explicitly

* Adjust for if we do or don't have labels

* Bookmark fix for od

* bookmark

* Fin

* closer

* Negate accelerate grad accum div

* Fixup not training long enough

* Add in compute_loss to take full model output

* Document

* compute_loss -> compute_loss_fn

* Add a test

* Refactor

* Refactor

* Uncomment tests

* Update tests/trainer/test_trainer.py

Co-authored-by: Daniel Han <danielhanchen@gmail.com>

---------

Co-authored-by: Daniel Han <danielhanchen@gmail.com>

6ba31a8a

Support Llama 3.2 conversion (text models) (#33778) · 7a06d07e

Pedro Cuenca authored 8 months ago


* Support Llama 3.2 conversion (text models)

Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

* Fix rope factor

* Update chat template

Initialize from a well-known template.
The guidance is that the changes should be applied to 3.1 models as
well.

* Remove import

* Support Llama Guard 3 conversion

* Tokenizer details

* Fix eos added token in base models

* Fix generation config for base models

* Specify revision for known tokenizers

* Style

* Reuse chat templates for older models

* Improve error when converting tokenizer < Llama 3

---------

Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>

7a06d07e

Fix Gradient Accumulation issue (#34191) · c1c7e896

Arthur authored 8 months ago


* quick fix

* 3 losses

* oups

* fix

* nits

* check how it scales for special models

* propagate for conditiona detr

* propagate

* propagate

* propagate

* fixes

* propagate changes

* update

* fixup

* nits

* f string

* fixes

* more fixes

* ?

* nit

* arg annoying f string

* nits

* grumble

* update

* nit

* refactor

* fix fetch tests

* nit

* nit

* Update src/transformers/loss/loss_utils.py

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

* update

* nit

* fixup

* make pass

* nits

* port code to more models

* fixup

* ntis

* arf

* update

* update

* nits

* update

* fix

* update

* nits

* fine

* agjkfslga.jsdlkgjklas

* nits

* fix fx?

* update

* update

* styel

* fix imports

* update

* update

* fixup to fix the torch fx?

---------

Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>

c1c7e896

Generate: visit non-llm `prepare_inputs_for_generation` (#34199) · f51ac9e0

Joao Gante authored 8 months ago


* tmp

* all visited

* test all

* Update src/transformers/models/moshi/modeling_moshi.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* delete another one :D

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

f51ac9e0

Fix bus error when using GPT2 on M1 macs (#34031) · 1d2c29f0

David Chanin authored 8 months ago

There's a bug on M1 macs with transformer >= 4.43.0 and torch >= 2.1.0, where if a model has tied embeddings, then the fast loading from #31771 causes a bus error when the model is actually run. This can be solved by disabling `_supports_param_buffer_assignment` for these models.

More info in comments in #33357

1d2c29f0

Llama3 and Llama2 are ExecuTorch compatible (#34101) · 9470c000
Guang Yang authored 8 months ago
```
Llama3_1b and Llama2_7b are ExecuTorch compatible

Co-authored-by: Guang Yang <guangyang@fb.com>
```
9470c000

removes decord (#33987) · 7f508850

Name authored 8 months ago


* removes decord dependency

optimize

np

Revert "optimize"

This reverts commit faa136b51ec4ec5858e5b0ae40eb7ef89a88b475.

helpers as documentation

pydoc

missing keys

* make fixup

* require_av

---------

Co-authored-by: ad <hi@arnaudiaz.com>

7f508850

Fix for tokenizer.apply_chat_template with continue_final_message=True (#34214) · f2846ad2
Sebastian Schoennenbeck authored 8 months ago
```
* Strip final message

* Do full strip instead of rstrip

* Retrigger CI

---------

Co-authored-by: Matt <rocketknight1@gmail.com>
```
f2846ad2

fix(Wav2Vec2ForCTC): torch export (#34023) · b57c7bce

Christopher McGirr authored 8 months ago

* fix(Wav2Vec2ForCTC): torch export

Resolves the issue described in #34022 by implementing the
masking of the hidden states using an elementwise multiplication
rather than indexing with assignment.

The torch.export functionality seems to mark the tensor as frozen
even though the update is legal.

This change is a workaround for now to allow the export of the
model as a FxGraph. Further investigation is required to find
the real solution in pytorch.

* [run-slow] hubert, unispeech, unispeech_sat, wav2vec2

b57c7bce

Ping team members for new failed tests in daily CI (#34171) · fce1fcfe

Yih-Dar authored 8 months ago


* ping

* fix

* fix

* fix

* remove runner

* update members

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fce1fcfe

Fix warning message for fp32_cpu_offloading in bitsandbytes configs (#34079) · aa3e35ac

Amos You authored 8 months ago

* change cpu offload warning for fp8 quantization

* change cpu offload warning for fp4 quantization

* change cpu offload variable name for fp8 and fp4 quantization

aa3e35ac

Update `trainer._get_eval_sampler()` to support `group_by_length` arg (#33514) · 6d2b2033

larin92 authored 8 months ago

Update 'trainer._get_eval_sampler()' to support 'group_by_length' argument

Trainer didn't support grouping by length for evaluation, which made evaluation slow with 'eval_batch_size'>1.

Updated 'trainer._get_eval_sampler()' method was based off of 'trainer._get_train_sampler()'.

6d2b2033

16 Oct, 2024 7 commits

Revert "Fix FSDP resume Initialization issue" (#34193) · 3f06f95e
Marc Sun authored 8 months ago
```
Revert "Fix FSDP resume Initialization issue (#34032)"

This reverts commit 4de1bdbf.
```
3f06f95e

Avoid using torch's Tensor or PIL's Image in chat template utils if not available (#34165) · 3a10c619

Reza Rahemtola authored 8 months ago


* fix(utils): Avoid using torch Tensor or PIL Image if not available

* Trigger CI

---------

Co-authored-by: Matt <rocketknight1@gmail.com>

3a10c619

Fix wrong name for llava onevision and qwen2_vl in tokenization auto (#34177) · bd5dc10f
Yoni Gozlan authored 8 months ago
```
* nit fix wrong llava onevision name in tokenization auto

* add qwen2_vl and fix style
```
bd5dc10f
Revert `accelerate` error caused by `46d09af4` (#34197) · cc7d8b87
steveepreston authored 8 months ago
```
Revert `accelerate` bug
```
cc7d8b87

[fix] fix token healing tests and usage errors (#33931) · 98bad9c6

alpertunga-bile authored 8 months ago

* auto-gptq requirement is removed & model is changed & tokenizer pad token is assigned

* values func is changed with extensions & sequence key value bug is fixed

* map key value check is added in ExtensionsTree

* empty trimmed_ids bug is fixed

* tail_id IndexError is fixed

* empty trimmed_ids bug fix is updated for failed test

* too much specific case for specific tokenizer is removed

* input_ids check is updated

* require auto-gptq import is removed

* key error check is changed with empty list check

* empty input_ids check is added

* empty trimmed_ids fix is checked with numel function

* usage change comments are added

* test changes are commented

* comment style and quality bugs are fixed

* test comment style and quality bug is fixed

98bad9c6

Moshi integration (#33624) · 9ba021ea

Yoach Lacombe authored 8 months ago


* clean mimi commit

* some nits suggestions from Arthur

* make fixup

* first moshi WIP

* converting weights working + configuration + generation configuration

* finalize converting script - still missing tokenizer and FE and processor

* fix saving model w/o default config

* working generation

* use GenerationMixin instead of inheriting

* add delay pattern mask

* fix right order: moshi codes then user codes

* unconditional inputs + generation config

* get rid of MoshiGenerationConfig

* blank user inputs

* update convert script:fix conversion, add  tokenizer, feature extractor and bf16

* add and correct Auto classes

* update modeling code, configuration and tests

* make fixup

* fix some copies

* WIP: add integration tests

* add dummy objects

* propose better readiblity and code organisation

* update tokenization tests

* update docstrigns, eval and modeling

* add .md

* make fixup

* add MoshiForConditionalGeneration to ignore Auto

* revert mimi changes

* re

* further fix

* Update moshi.md

* correct md formating

* move prepare causal mask to class

* fix copies

* fix depth decoder causal

* fix and correct some tests

* make style and update .md

* correct config checkpoitn

* Update tests/models/moshi/test_tokenization_moshi.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update tests/models/moshi/test_tokenization_moshi.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* make style

* Update src/transformers/models/moshi/__init__.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* change firm in copyrights

* udpate config with nested dict

* replace einsum

* make style

* change split to True

* add back splt=False

* remove tests in convert

* Update tests/models/moshi/test_modeling_moshi.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* add default config repo + add model to FA2 docstrings

* remove logits float

* fix some tokenization tests and ignore some others

* make style tokenization tests

* update modeling with sliding window + update modeling tests

* [run-slow] moshi

* remove prepare for generation frol CausalLM

* isort

* remove copied from

* ignore offload tests

* update causal mask and prepare 4D mask aligned with recent changes

* further test refine + add back prepare_inputs_for_generation for depth decoder

* correct conditional use of prepare mask

* update slow integration tests

* fix multi-device forward

* remove previous solution to device_map

* save_load is flaky

* fix generate multi-devices

* fix device

* move tensor to int

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Marc Sun <marc@huggingface.co>

9ba021ea

IDEFICS: support inputs embeds (#34043) · d087165d
Raushan Turganbay authored 8 months ago
```
* support embeds

* use cache from config

* style...

* fix tests after rebase
```
d087165d