Commits · fix_peft_model_in_pipelines · zhusg / transformers-new

11 Mar, 2024 8 commits

Correct base model detection · 0cf09f81
Matt authored 1 year ago

0cf09f81
Support PEFT models in pipelines · f221fb34
Matt authored 1 year ago

f221fb34
[Docs] Fix FastSpeech2Conformer model doc links (#29574) · dd1c9052
Klaus Hipp authored 1 year ago
```
[Docs] Fix FastSpeech2Conformer links
```
dd1c9052

Make torch xla available on GPU (#29334) · 873d9bb3

Yitong Huang authored 1 year ago


* add USE_TORCH_XLA env

* rename torch_tpu to torch_xla

* better is_torch_xla_available; fix some fsdp and performance issues

* fix format

* fix bug when pjrt_device is cpu

* fix bug

* fix the deprecation handling

---------

Co-authored-by: anw90 <ang868@gmail.com>
Co-authored-by: wangang.wa <wangang.wa@alibaba-inc.com>

873d9bb3

Bark model Flash Attention 2 Enabling to pass on check_device_map parameter to super() (#29357) · 9a3f4d4d
Damith Senanayake authored 1 year ago
```
* Fixing error #29332. The _check_and_enable_flash_attn_2() method receives a check_device_map parameter and fails.

* style fixup
```
9a3f4d4d

Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f

Tanay Mehta authored 1 year ago

* add: initial script to train clm fim

* fix: if training model from scratch, new tokens will be added and embeddings resized

* fix: fixed attention_mask errors when generating FIM data

* fix: file formatted using black

* add: run_fim_no_trainer.py and fixed some comments in run_fim.py

* add: added fim examples to the README.md and ran code fixup

* fix: little bug in both fim training scripts

* fix: remove comment from notebook and added a note on fim related params

* fix: minor typo in README

* add: suggested minor changes to README and run_fim.py

* add: gradient_accumulation_steps and gradient_checkpointing args

* add: improved model embedding resizing

* add: pad_to_multiple_of and attn_implementation params

* add: requested minor changes

* add: deepspeed zero compatibility

* add: resize embeddings layer with zero3 support for fim model initialization

6d67837f

[`Docs`] fixed minor typo (#29555) · d80c9a34
j-gc authored 1 year ago

d80c9a34
[`Mamba doc`] Post merge updates (#29472) · 4f27ee93
Arthur authored 1 year ago
```
* post merge update

* nit

* oups
```
4f27ee93

08 Mar, 2024 13 commits

feat: use `warning_advice` for tensorflow warning (#29540) · 0290ec19
Winston H authored 1 year ago
```
feat: use `warning_advice` instead of tensorflow warning
```
0290ec19

Fix eval thread fork bomb (#29538) · 469c1328

Zach Mueller authored 1 year ago

* Fix eval thread fork bomb

* Keep eval dl persistent and prepare after so free_memory doesn't destroy it

* Add note

* Quality

469c1328

[tests] use the correct `n_gpu` in... · 3f6973db

Fanli Lin authored 1 year ago

[tests] use the correct `n_gpu` in `TrainerIntegrationTest::test_train_and_eval_dataloaders` for XPU (#29307)

* fix n_gpu

* fix style

3f6973db

Fix WhisperNoSpeechDetection when input is full silence (#29065) · 1ba89dc2
Yoach Lacombe authored 1 year ago
```
fix total silence input with no_speech_threshold
```
1ba89dc2
fix typos in FSDP config parsing logic in `TrainingArguments` (#29189) · 697f05ba
Yun Dai authored 1 year ago
```
fix FSDP config
```
697f05ba
Make sliding window size inclusive in eager attention (#29519) · 608fa549
Jonatan Kłosko authored 1 year ago
```
* Make sliding window size inclusive in eager attention

* Fix tests
```
608fa549

StableLM: Fix dropout argument type error (#29236) · f386c51a

liangjs authored 1 year ago


* fix stablelm dropout argument type error

* fix docs of _flash_attention_forward

* fix all docs of _flash_attention_forward

* fix docs of _flash_attention_forward in starcoder2

---------

Co-authored-by: oliang <oliang@tencent.com>

f386c51a

[tests] use `torch_device` instead of `auto` for model testing (#29531) · 1ea3ad1a

Fanli Lin authored 1 year ago


* use torch_device

* skip for XPU

* Update tests/generation/test_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1ea3ad1a

Typo fix in error message (#29535) · 14536c33
Clémentine Fourrier authored 1 year ago

14536c33

fix image-to-text batch incorrect output issue (#29342) · 8ee1d472

Wang, Yi authored 1 year ago


* fix image-to-text batch incorrect output issue

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add ci test

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* update ci test

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

---------

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

8ee1d472

[tests] add the missing `require_sacremoses` decorator (#29504) · 8e589c83

Fanli Lin authored 1 year ago


* add sacremoses check

* fix style

* for FlaubertTokenizer

* HerbertTokenizer fix

* add typeHint

* Update src/transformers/testing_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make less skipped

* make quality

* remove import

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8e589c83

Generate: left-padding test, revisited (#29515) · bc764f42

Joao Gante authored 1 year ago


* left-padding test revisited

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bc764f42

Typo in mlx tensor support (#29509) · 631fa7bf
Pedro Cuenca authored 1 year ago
```
Potential typo in mlx support
```
631fa7bf

07 Mar, 2024 9 commits

Fix `VisionEncoderDecoder` Positional Arg (#29497) · b338a6c3

Nick DeGroot authored 1 year ago

* 🐛 Fix vision encoder decoder positional arg

* ✅

 Add test for VisionEncoderDecoder with LayoutLMv3 encoder

---------

Co-authored-by: Nick DeGroot <1966472+nickthegroot@users.noreply.github.com>

b338a6c3

Set `inputs` as kwarg in `TextClassificationPipeline` (#29495) · ddf177ee

Alvaro Bartolome authored 1 year ago


* Set `inputs` as kwarg in `TextClassificationPipeline`

This change has been done to align the `TextClassificationPipeline` with the rest of the pipelines, and to be able to e.g. `pipeline(**{"inputs": "text"})` which wouldn't be possible since the `*args` were being used instead.

* Add `noqa: C409` on `tuple([inputs],)`

Even though is discouraged by the linter, the cast `tuple(list(...),)` is required here, as otherwise the original list in `inputs` will be transformed into a `tuple` and the elements 1...N will be ignored by the `Pipeline`

* Run `ruff format`

* Simplify `tuple` conversion with `(inputs,)`

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

ddf177ee

test_generation_config_is_loaded_with_model - fall back to pytorch model for now (#29521) · 4ed9ae62
amyeroberts authored 1 year ago
```
* Fall back to pytorch model for now

* Fix up
```
4ed9ae62
Add support for metadata format MLX (#29335) · 45c06510
Alex Ishida authored 1 year ago
```
Add support for loading safetensors files saved with metadata format mlx.
```
45c06510
Flava multimodal add attention mask (#29446) · 923733c2
Raushan Turganbay authored 1 year ago
```
* flava multimodal add attn mask

* make style

* check mask is not None
```
923733c2
fix: Avoid error when fsdp_config is missing xla_fsdp_v2 (#29480) · 9288e759
Ashok Pon Kumar authored 1 year ago
```
Signed-off-by: Ashok Pon Kumar Sree Prakash <ashokponkumar@gmail.com>
```
9288e759
Revert "Automatic safetensors conversion when lacking these files (#2… (#29507) · f6133d76
Lysandre Debut authored 1 year ago
```
Revert "Automatic safetensors conversion when lacking these files (#29390)"

This reverts commit a69cbf4e.
```
f6133d76
v4.39 deprecations 🧼 (#29492) · ffe60fdc
Joao Gante authored 1 year ago

ffe60fdc
Enable BLIP for auto VQA (#29499) · 979fccc9
regisss authored 1 year ago
```
* Enable BLIP for auto VQA

* Make style

* Add VQA to BLIP pipeline tests
```
979fccc9

06 Mar, 2024 10 commits

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device (#29439) · d45f47ab

Park Jun authored 1 year ago


* Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS devices

* Update src/transformers/models/gemma/modeling_gemma.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update llama ang gemma rope use cpu in mps device

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d45f47ab

Substantially reduce memory usage in _update_causal_mask for large batches by... · 2a939f20

Glen Taggart authored 1 year ago

Substantially reduce memory usage in _update_causal_mask for large batches by using .expand instead of .repeat [needs tests+sanity check] (#29413)

* try to fix gemma mem use

* fix: handle attention mask dim==2 case

* remove logits=logits.float()

* clean up + add llama

* apply formatting

* readability edit: swap order of items being multiplied

* revert change unrelated to PR

* revert black autoformat

* switch to one .to

* Accept style edits

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2a939f20

Fix `TextGenerationPipeline.__call__` docstring (#29491) · 965cf677
Alvaro Bartolome authored 1 year ago

965cf677

added the max_matching_ngram_size to GenerationConfig (#29131) · 19fb1e22

Moshe Berchansky authored 1 year ago


* added the max_matching_ngram_size parameter into the GenerationConfig, for the PromptLookupCandidateGenerator

* switched back to keyword arguments

* added PromptLookupCandidateGenerator docstring for its parameters

* ruff reformat

* Update src/transformers/generation/configuration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

19fb1e22

Generate: torch.compile-ready generation config preparation (#29443) · ddb4fda3
Joao Gante authored 1 year ago

ddb4fda3
Fix test failure on DeepSpeed (#29444) · 9322576e
Zach Mueller authored 1 year ago
```
* Fix test failure

* use item
```
9322576e
Avoid dummy token in PLD to optimize performance (#29445) · 0a5b0516
Ofir Zafrir authored 1 year ago

0a5b0516
Generate: get generation mode from the generation config instance 🧼 (#29441) · 700d48fb
Joao Gante authored 1 year ago

700d48fb
Generate: add tests for caches with `pad_to_multiple_of` (#29462) · 41f7b7ae
Joao Gante authored 1 year ago

41f7b7ae

Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor (#29447) · 2890116a

Matthew Hoffman authored 1 year ago

* Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor

dataloader_prefetch_factor was added to TrainingArguments in #28498 with the default value None, but  versions of torch<2.0.0 do not accept None and will raise an error if num_workers == 0 and prefetch_factor != 2

* Add is_torch_available() check

* Use is_torch_greater_or_equal_than_2_0

add back check for dataloader_prefetch_factor

2890116a