Commits · v4.32-release · 某某某 / transformers-new

28 Aug, 2023 4 commits
- Release: v4.32.1 · ccb92be2
  Lysandre authored 1 year ago
  
  v4.32.1
  
  ccb92be2
- Skip broken tests · 657eb26c
  Sylvain Gugger authored 1 year ago
  
  657eb26c
- [idefics] small fixes (#25764) · 13aef138
  Stas Bekman authored 1 year ago
  
  13aef138
- Generate: add missing logits processors docs (#25653) · 6836e9dd
  Joao Gante authored 1 year ago
  
  6836e9dd
23 Aug, 2023 3 commits

Fix bloom add prefix space (#25652) · e82040e1

Arthur authored 1 year ago

* properly support Sequence of pretokenizers

* actual fix

* make sure the fix works. Tests are not working for sure!

* hacky way

* add TODO

* update

* add a todo

* nits

* rename test

* nits

* rename test

e82040e1

[`SPM`] Patch `spm` Llama and T5 (#25656) · 9d42e402

Arthur authored 1 year ago

* hot fix

* only encode with string prefix if starts with prefix

* styling

* add a new test

* fixup

9d42e402

removing unnecesssary extra parameter (#25643) · 27f91578
Rafael Padilla authored 1 year ago

27f91578

22 Aug, 2023 2 commits
- Put IDEFICS in the right section of the doc (#25650) · 6a029a8b
  Sylvain Gugger authored 1 year ago
  
  6a029a8b
- v4.32.0: Release · 41aef337
  Sylvain Gugger authored 1 year ago
  
  v4.32.0
  
  41aef337
21 Aug, 2023 8 commits

Fix test_modeling_mpt typo in model id (#25606) · 26e4c332
Francisco Kurucz authored 1 year ago
```
Fix model id in get_large_model_config on file test_modeling_mpt
```
26e4c332
Run doctest for new files (#25588) · 5ddafeca
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5ddafeca
Ignore all exceptions from signal in dynamic code (#25623) · 976bd738
Sylvain Gugger authored 1 year ago

976bd738
Hotfix · be94bc5e
ydshieh authored 1 year ago

be94bc5e
reattach hooks when using `resize_token_embeddings` (#25596) · 26a38a90
Marc Sun authored 1 year ago
```
* reattach hooks

* fix style
```
26a38a90

new model: IDEFICS via HuggingFaceM4 (#24796) · 3665af0c

Stas Bekman authored 1 year ago


* rename

* restore

* mappings

* unedited tests+docs

* docs

* fixes

* fix auto-sync breakage

* cleanup

* wip

* wip

* add fetch_images

* remove einops dependency

* update

* fix

* fix

* fix

* fix

* fix

* re-add

* add batching

* rework

* fix

* improve

* add Leo as I am extending his work

* cleanup

* fix

* cleanup

* slow-test

* fix

* fix

* fixes

* deal with warning

* rename modified llama classes

* rework fetch_images

* alternative implementation

* cleanup

* strict version

* cleanup

* [`IDEFICS`] Fix idefics ci (#25056)

* Fix IDEFICS CI

* fix test file

* fixup

* some changes to make tests pass

* fix

* fixup

* Update src/transformers/models/idefics/configuration_idefics.py

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

---------

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove compat checks

* style

* explain that Idefics is not for training from scratch

* require pt>=2.0

* fix idefics vision config (#25092)

* fix idefics vision config

* fixup

* clean

* Update src/transformers/models/idefics/configuration_idefics.py

---------

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* cleanup

* style

* cleanup

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* upcase

* sequence of images

* handle the case with no images

* Update src/transformers/image_processing_utils.py

Co-authored-by: Victor SANH <victorsanh@gmail.com>

* support pure lm take 2

* support tokenizer options

* parameterize num_channels

* fix upcase

* s|IdeficsForCausalLM|IdeficsForVisionText2Text|g

* manual to one line

* addressing review

* unbreak

* remove clip dependency

* fix test

* consistency

* PIL import

* Idefics prefix

* Idefics prefix

* hack to make tests work

* style

* fix

* fix

* revert

* try/finally

* cleanup

* clean up

* move

* [`IDEFICS`] Fix idefics config refactor (#25149)

* refactor config

* nuke init weights

* more refactor

* oops

* remove visual question answering pipeline support

* Update src/transformers/models/idefics/clip.py

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/models/idefics/modeling_idefics.py

* cleanup

* mv clip.py vision.py

* tidyup

---------

Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

* fix

* license

* condition on pt

* fix

* style

* fix

* rm torchvision dependency, allow custom transforms

* address review

* rework device arg

* add_eos_token

* s/transforms/transform/

* fix top level imports

* fix return value

* cleanup

* cleanup

* fix

* style

* license

* license

* Update src/transformers/models/idefics/image_processing_idefics.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add a wrapper to freeze vision layears

* tidyup

* use the correct std/mean settings

* parameterize values from config

* add tests/models/idefics/test_image_processing_idefics.py

* add test_processor_idefics.py

* cleanup

* cleanups

* fix

* fix

* move to the right group

* style

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add perceiver config

* reset

* missing arg docs

* Apply suggestions from code review

Co-authored-by: Leo Tronchon <leo.tronchon@gmail.com>

* address review comments

* inject automatic end of utterance tokens (#25218)

* inject automatic end of utterance tokens

* fix

* fix

* fix

* rework to not use the config

* not end_of_utterance_token at the end

* Update src/transformers/models/idefics/processing_idefics.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address review

* Apply suggestions from code review

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/image_processing_utils.py

Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* [`Idefics`] add image_embeddings option in generate-related methods (#25442)

* add image_embeddings option in generate-related methods

* style

* rename image_embeddings and allow perceiver embeddings precomputation

* compute embeddings within generate

* make is_encoder_decoder= True the default in config

* nested if else fix

* better triple check

* switch if elif order for pixel values / img embeds

* update model_kwargs perceiver only at the end

* use _prepare_model_inputs instead of encoder_decoder logic

* fix comment typo

* fix config default for is_encoder_decoder

* style

* add typehints

* precompute in forward

* doc builder

* style

* pop instead of get image hidden states

* Trigger CI

* Update src/transformers/models/idefics/modeling_idefics.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/idefics/modeling_idefics.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix * + indentation + style

* simplify a bit the use_resampler logic using comments

* update diocstrings

* Trigger CI

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix rebase changes

* unbreak #25237 - to be fixed in follow up PRs

* is_composition = False

* no longer needed

---------

Co-authored-by: leot13 <leo.tronchon@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Victor SANH <victorsanh@gmail.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

3665af0c

🌐

[i18n-KO] Translated `perf_train_tpu_tf.md` to Korean (#25433) · 3c99f24a

Hyeonseo Yun authored 1 year ago


* docs: ko: perf_train_tpu_tf.md

* feat: nmt and manual edit perf_train_tpu_tf.md

* fix: resolve suggestions

Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>

---------

Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>

3c99f24a

Make TTS automodels importable (#25595) · 2efa0d42

Omar Sanseviero authored 1 year ago

* Add auto model for spectrogram/waveform

* Add doc and install

* Add dummy objects

* Did I miss anything?

2efa0d42

18 Aug, 2023 10 commits

[`TokenizerFast`] Fix setting prefix space in __init__ (#25563) · ef153425

Arthur authored 1 year ago

* properly support Sequence of pretokenizers

* actual fix

* make sure the fix works. Tests are not working for sure!

* hacky way

* add TODO

* update

* add a todo

ef153425

fix z3 init when using accelerate launcher (#25589) · 636acc75
Sourab Mangrulkar authored 1 year ago

636acc75
[Time series Informer] fix dtype of cumsum (#25431) · 8d2f953f
Kashif Rasul authored 1 year ago
```
* fix dtype of cumsum

* add comment
```
8d2f953f

[`Llama`] remove prompt and fix prefix finetuning (#25565) · bc3e20dc

Arthur authored 1 year ago

* nit

* update

* make sure use_default_system_prompt is saved

* update checkpointing

* consistency

* use_default_system_prompt for test

bc3e20dc

[`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) · 30b3c46f

Arthur authored 1 year ago

* draft changes

* update and add tests

* styling for no

* move test

* path to usable model

* update test

* small update

* update bertbased tokenizers

* don'tuse kwargs for _tokenize

* don'tuse kwargs for _tokenize

* fix copies

* update

* update test for special tokenizers

* fixup

* skip two tests

* remove pdb breakpiont()

* wowo

* rewrite custom tests

* nits

* revert chang in target keys

* fix markup lm

* update documentation of the argument

30b3c46f

Replaces calls to `.cuda` with `.to(torch_device)` in tests (#25571) · 9d7afd25

Alex McKinney authored 1 year ago


* Replaces calls to `.cuda` with `.to(torch_device)` in tests
`torch.Tensor.cuda()` is a pre-0.4 solution to changing a tensor's device. It is recommended to prefer `.to(...)` for greater flexibility and error handling. Furthermore, this makes it more consistent with other tests (that tend to use `.to(torch_device)`) and ensures the correct device backend is used (if `torch_device` is neither `cpu` or `cuda`).

* addressing review comments

* more formatting changes in Bloom test

* `make style`

* Update tests/models/bloom/test_modeling_bloom.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixes style failures

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

9d7afd25

Added missing parenthesis in call to is_fsdp_enabled (#25585) · c45aab75
Martin Malmsten authored 1 year ago
```
Calling function is_fsdp_enabled instead of checking if it is not None
```
c45aab75

[`Docs` / `BetterTransformer` ] Added more details about flash attention + SDPA (#25265) · 940d1a76

Younes Belkada authored 1 year ago


* added more details about flash attention

* correct and add more details

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* few modifs

* more details

* up

* Apply suggestions from code review

Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* adapt from suggestion

* Apply suggestions from code review

Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* trigger CI

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix nits and copies

* add new section

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

940d1a76

Suggestions on Pipeline_webserver (#25570) · 08e32519

Kihoon Son authored 1 year ago


* Suggestions on Pipeline_webserver

docs: reorder the warning tip for pseudo-code

Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ko/pipeline_webserver.md

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

---------

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

08e32519

Fix typo in example code (#25583) · 659ab042
Amélie T. Reymond authored 1 year ago
```
`lang_code_to_id("en_XX")` => `lang_code_to_id["en_XX"]`

lang_code_to_id is a dict
```
659ab042

17 Aug, 2023 13 commits

add warning for 8bit optimizers (#25575) · 4a27c13f
Marc Sun authored 1 year ago
```
* add warning for 8bit optimizers

* protect import
```
4a27c13f
Skip `test_contrastive_generate` for `TFXLNet` (#25574) · 427adc89
Yih-Dar authored 1 year ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
427adc89

Add Text-To-Speech pipeline (#24952) · b8f69d0d

Yoach Lacombe authored 1 year ago


* add AutoModelForTextToSpeech class

* add TTS pipeline and tessting

* add docstrings to text_to_speech pipeline

* fix torch dependency

* corrector 'processor is None' case in Pipeline

* correct repo id

* modify text-to-speech -> text-to-audio

* remove processor

* rename text_to_speech pipelines files to text_audio

* add textToWaveform and textToSpectrogram instead of textToAudio classes

* update TTS pipeline to the bare minimum

* update tests TTS pipeline

* make style and erase useless import torch in TTS pipeline tests

* modify how to check if generate or forward in TTS pipeline

* remove unnecessary extra new lines

* Apply suggestions from code review

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* refactor input_texts -> text_inputs

* correct docstrings of TTS.__call__

* correct the shape of generated waveform

* take care of Bark tokenizer special case

* correct run_pipeline_test TTS

* make style

* update TTS docstrings

* address Sylvain nit refactors

* make style

* refactor into one liners

* correct squeeze

* correct way to test if forward or generate

* Update output audio waveform shape

* make style

* correct import

* modify how the TTS pipeline test if a model can generate

* align shape output of TTS pipeline with consistent shape

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

b8f69d0d

add util for ram efficient loading of model when using fsdp (#25107) · c4c0ceff

Sourab Mangrulkar authored 1 year ago

* add util for ram efficient loading of model when using fsdp

* make fix-copies

* fixes 😅

* docs

* making it further easier to use

* rename the function

* refactor to handle fsdp ram efficiency in `from_pretrained`

* fixes

* fixes

* fixes

* update

* fixes

* revert `load_pretrained_model_only_on_rank0`

* resolve `load_from_checkpoint`

c4c0ceff

Revert "change version (#25387)" (#25573) · 4e1dee0e
Marc Sun authored 1 year ago
```
This reverts commit 3a05e010.
```
4e1dee0e
[`Tests`] Fix failing 8bit test (#25564) · d4c0aa14
Younes Belkada authored 1 year ago
```
* fix failing 8bit test

* trigger CI
```
d4c0aa14
[`NllbMoe`] Update code to properly support loss computation (#25429) · 181d778f
Arthur authored 1 year ago
```
* update nllb_moe

* fix

* doc nits

* nits

* add a small test

* ficup

* remove adapted from
```
181d778f

Inconsistency in PreTrainedModel.resize_token_embeddings When ZeRO3 Is Enabled (#25394) · 9264fc91

Sina authored 1 year ago

* Inconsistency in PreTrainedModel.resize_token_embeddings

This PR addresses https://github.com/huggingface/transformers/issues/25241

.

In previous implementation when ZeRO stage 3 was enbaled, resize_token_embeddings would create independent PyTorch weights on each device. Here we ensure that new embeddings are created with DeepSpeed init, and are properly partitioned accros devices.

* formatting with black

* adding the removed comments back in

---------

Co-authored-by: Sina Moeini <smoeini@amazon.com>

9264fc91

🚨

[`SPM`] Finish fix spm models

🚨

(#25224) · b4d55488

Arthur authored 1 year ago

* fix EVERYTHING

* more fixes

* ⚗️⚗️ Tokenizer magic ⚗️⚗

️

* wrong value but test passes for the TODO

* update

* updat

* safe protobuf import?

* style

* non gated repo

* update

* fixup

* Update src/transformers/models/llama/tokenization_llama.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/llama/tokenization_llama.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/t5/test_tokenization_t5.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* nits

* fix t5 too

* use assert equal

* fix llama decoding

* nits on t5

* fixup

* only remove the prefix space, not other spaces

* more deconding tests and more todos

* fix CI as well

* fixup

* skip failing test on CI (its tf its ok)

* skip test_subword_regularization_tokenizer that is also crashing on the CI for TF

* update llama

* revert good fixes

* fixup

* empty

* explain why we need to encode with an additional token

* better warning?

* nits

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b4d55488

[`SwitchTransformers`] Remove unused module (#25427) · 5347d000
Arthur authored 1 year ago
```
* remove unused module

* remove old feed_forward_proj

* fixup
```
5347d000

[`resize_embedding`] Introduce `pad_to_multiple_of` and guidance (#25088) · d6bf08f7

Arthur authored 1 year ago

* fix

* revert cahnges and update resizing of embedding layer

* use wraning

* fixup

* more styling nits

* fix all tests that overload the embedding tests

* 👀👀 remove breakpoint

* remove useless overload + overload correctly where needed

* resize lm head with new vocab size

* reverse not necessary changes

* style

* fix CIs!

* fix last CI tests, adapt bark and Marian

* fixup

d6bf08f7

Skip `test_beam_search_xla_generate_simple` for `T5` (#25566) · d2871b29
Yih-Dar authored 1 year ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d2871b29

Adds `TRANSFORMERS_TEST_DEVICE` (#25506) · 1791ef8d

Alex McKinney authored 1 year ago

* Adds `TRANSFORMERS_TEST_DEVICE`
Mirrors the same API in the diffusers library. Useful in transformers
too.

* replace backend checking with trying `torch.device`

* Adds better error message for unknown test devices

* `make style`

* adds documentation showing `TRANSFORMERS_TEST_DEVICE` usage.

1791ef8d