Commits · da7ea9a4e337eb2eed204090fe38198418c01134 · zhusg / transformers-new

07 Nov, 2023 1 commit

[Whisper] Block language/task args for English-only (#27322) · da7ea9a4

Sanchit Gandhi authored 1 year ago


* [Whisper] Block language/task args for English-only

* Update src/transformers/models/whisper/modeling_whisper.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

da7ea9a4

06 Nov, 2023 10 commits

[docs] fixed links with 404 (#27327) · 9beb2737
Maria Khalusova authored 1 year ago
```
* fixed links with 404

* make style
```
9beb2737

Fix `Kosmos2Processor` batch mode (#27323) · 1b20e2bb

Yih-Dar authored 1 year ago


* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1b20e2bb

Fix VideoMAEforPretrained dtype error (#27296) · a6e0d5a2
Iker García-Ferrero authored 1 year ago
```
* Fix dtype error

* Fix mean and std dtype

* make style
```
a6e0d5a2

Update sequence_classification.md (#27281) · e9dbd392

Akshay Chintalapati authored 1 year ago

I'm adding accelerate as one of the libraries to install because otherwise when running the Trainer, the model errorr out with the error.

ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1`: Please run `pip install transformers[torch]` or `pip install accelerate -U`

Further context:
1. I've tried this across different environments so I believe that the environment is not the issue.
2. I had the latest transformers library version running.
3. Typically even after install accelerate and import it, it wouldn't resolve the issue until I restart the notebook and try again.

e9dbd392

[`PretrainedTokenizer`] add some of the most important functions to the doc (#27313) · 147f7746
Arthur authored 1 year ago

147f7746
enable memory tracker metrics for npu (#27280) · 1ffc4dee
Hz, Ji authored 1 year ago

1ffc4dee
Remove an unexpected argument for FlaxResNetBasicLayerCollection (#27272) · d7dcfa89
Pingzhi Li authored 1 year ago
```
Remove unexpected argument for FlaxResNetBasicLayerCollection
```
d7dcfa89
Update doctest workflow file (#27306) · eef7ea98
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eef7ea98
Fix daily CI image build (#27307) · d788d37d
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d788d37d
Fix tokenizer export for LLamaTokenizerFast (#27222) · b026b5ca
Mayank Mishra authored 1 year ago
```
* fix tokenizer

* fix tokenizer
```
b026b5ca

03 Nov, 2023 12 commits

translate run_scripts.md to chinese (#27246) · cc3e4781

jiaqiw09 authored 1 year ago

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

cc3e4781

translate autoclass_tutorial to chinese (#27269) · bf7cfac2
jiaqiw09 authored 1 year ago
```
* translate autoclass_tutorial.md  to chinese

* translate update
```
bf7cfac2

[`FA2`] Add flash attention for for `DistilBert` (#26489) · 1ac2463d

Susnato Dhar authored 1 year ago

* flash attention added for DistilBert

* fixes

* removed padding_masks

* Update modeling_distilbert.py

* Update test_modeling_distilbert.py

* style fix

1ac2463d

[Docs] Model_doc structure/clarity improvements (#26876) · 5964f820

Maria Khalusova authored 1 year ago

* first batch of structure improvements for model_docs

* second batch of structure improvements for model_docs

* more structure improvements for model_docs

* more structure improvements for model_docs

* structure improvements for cv model_docs

* more structural refactoring

* addressed feedback about image processors

5964f820

[`Docs` / `SAM` ] Reflect correct changes to run inference without OOM (#27268) · ad8ff962
Younes Belkada authored 1 year ago
```
Update sam.md
```
ad8ff962
Fix switch transformer mixed precision issue (#27220) · f13f544a
Shiyu Li authored 1 year ago
```
* Fix mixed precision error for switch transformer

* Fixup
```
f13f544a

Update the ConversationalPipeline docstring for chat templates (#27250) · db69bd88

Matt authored 1 year ago

* Update the ConversationalPipeline docstring now that we're using chat templates

* Direct access to conversation.messages

* Explain the string init

db69bd88

[docs] Custom model doc update (#27213) · 011b15c1
Maria Khalusova authored 1 year ago
```
doc update
```
011b15c1

Avoid many failing tests in doctesting (#27262) · af8d1dc3

Yih-Dar authored 1 year ago


* fix

* update

* update

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

af8d1dc3

[`PEFT` / `Tests` ] Fix peft integration failing tests (#27258) · 8f1a43cd
Younes Belkada authored 1 year ago
```
fix peft integration issues
```
8f1a43cd

Refactor: Use Llama RoPE implementation for Falcon (#26933) · 05ea7b79

Tom Aarsen authored 1 year ago

* Use Llama RoPE implementation for Falcon

+ Add copy functionalities

* Use standard cache format for Falcon

* Simplify apply_rotary_pos_emb, copy from Llama

* Remove unnecessary cache conversion test

We don't need to convert any caches anymore!

* Resolve copy complaint

05ea7b79

Fuyu protection (#27248) · e9a6c72b
Lysandre Debut authored 1 year ago

e9a6c72b

02 Nov, 2023 15 commits

Fixed base model class name extraction from PeftModels (#27162) · 552ff244

Komal Kumar authored 1 year ago

* Fixed base model class name extraction from PeftModels

* Changes to first unwrap the model then extract the base model name

* Changed base_model to base_model.model to stay consistent with peft model abstractions

552ff244

Removed the redundant SiLUActivation class. (#27136) · 49912168

Chi authored 1 year ago

* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.

* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.

49912168

translate peft.md to chinese (#27215) · 00d8502b

jiaqiw09 authored 1 year ago

* tranlsate peft.md to chinese

* translate peft.md to chinese

* fix missing link

00d8502b

Dev version · bc78fd12
Lysandre authored 1 year ago

bc78fd12

Enrich TTS pipeline parameters naming (#26473) · 0ed6729b

Yoach Lacombe authored 1 year ago


* enrich TTS pipeline docstring for clearer forward_params use

* change token leghts

* update Pipeline parameters

* correct docstring and make style

* fix tests

* make style

* change music prompt

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* raise errors if generate_kwargs with forward-only models

* make style

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0ed6729b

Remove redundant code from T5 encoder mask creation (#27216) · 147e8ce4
Pietro Lesci authored 1 year ago
```
* remove redundant code

* update

* add typecasting

* make `attention_mask` float again
```
147e8ce4
Generate: return `past_key_values` (#25086) · a6c82d45
Joao Gante authored 1 year ago

a6c82d45
fix-deprecated-exllama-arg (#27243) · 441c3e0d
Marc Sun authored 1 year ago
```
fix-exllama
```
441c3e0d

Fixing m4t. (#27240) · 8801861d

Nicolas Patry authored 1 year ago

* Fixing m4t.

* Trying to remove comparison ? Odd test failure.

* Adding shared. But why on earth does it hang ????

* Putting back the model weights checks the test is silently failing on
cuda.

* Fix style + unremoved comment.

8801861d

Fix safetensors failing tests (#27231) · 443bf5e9

Lysandre Debut authored 1 year ago


* Fix Kosmos2

* Fix ProphetNet

* Fix MarianMT

* Fix M4T

* XLM ProphetNet

* ProphetNet fix

* XLM ProphetNet

* Final M4T fixes

* Tied weights keys

* Revert M4T changes

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

443bf5e9

Wrap `_prepare_4d_causal_attention_mask` as a leaf function (#27236) · 4557a0de
Michael Benayoun authored 1 year ago
```
Wrap _prepare_4d_causal_attention_mask as a leaf function
```
4557a0de

Fuyu: improve image processing (#27007) · 8a312956

Pablo Montalvo authored 1 year ago


* Fix Fuyu image scaling bug

It could produce negative padding and hence inference errors for certain
image sizes.

* initial rework commit

* add batching capabilities, refactor image processing

* add functional batching for a list of images and texts

* make args explicit

* Fuyu processing update (#27133)

* Add file headers

* Add file headers

* First pass - preprocess method with standard args

* First pass image processor rework

* Small tweaks

* More args and docstrings

* Tidying iterating over batch

* Tidying up

* Modify to have quick tests (for now)

* Fix up

* BatchFeature

* Passing tests

* Add tests for processor

* Sense check when patchifying

* Add some tests

* FuyuBatchFeature

* Post-process box coordinates

* Update to `size` in processor

* Remove unused and duplicate constants

* Store unpadded dims after resize

* Fix up

* Return FuyuBatchFeature

* Get unpadded sizes after resize

* Update exception

* Fix return

* Convert input `<box>` coordinates to model format.

* Post-process point coords, support multiple boxes/points in a single
sequence

* Replace constants

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Preprocess List[List[image]]

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update to Amy's latest state.

* post-processing returns a list of tensors

* Fix error when target_sizes is None

Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Review comments

* Update src/transformers/models/fuyu/image_processing_fuyu.py

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Fix up

* Fix up

---------

Co-authored-by: Ubuntu <ubuntu@ip-172-31-72-126.ec2.internal>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pablo Montalvo <pablo.montalvo.leroux@gmail.com>

* Fix conflicts in fuyu_follow_up_image_processing (#27228)

fixing conflicts and updating on main

* Revert "Fix conflicts in fuyu_follow_up_image_processing" (#27232)

Revert "Fix conflicts in fuyu_follow_up_image_processing (#27228)"

This reverts commit acce10b6c653dc7041fb9d18cfed55775afd6207.

---------

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-72-126.ec2.internal>

8a312956

[`core` / `Quantization`] Fix for 8bit serialization tests (#27234) · 9b25c164
Younes Belkada authored 1 year ago
```
* fix for 8bit serialization

* added regression tests.

* fixup
```
9b25c164

Reproducible checkpoint for npu (#27208) · c52e429b

Hz, Ji authored 1 year ago

* save NPU's RNG states when saving a checkpoint and set after all the
data skip phase when resuming training.

* re-trigger ci

* re-trigger ci

c52e429b

support bf16 (#25879) · 7adaefe2

Roohollah Etemadi authored 1 year ago

* added bf16 support

* added cuda availability check

* applied make style, quality

7adaefe2

01 Nov, 2023 2 commits

[Whisper, Bart, MBart] Add Flash Attention 2 (#27203) · af3de8d8

Patrick von Platen authored 1 year ago


* add whisper fa2

* correct

* change all

* correct

* correct

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix more

* fix more

* fix more

* fix more

* fix more

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

af3de8d8

Enable split_batches through TrainingArguments (#26798) · 3520e37e

Zach Mueller authored 1 year ago

* Enable split_batches through TrainingArguments

* Extra dispatch_batches

* Keep as default false

* Add to docstring

* Add to docstring

* Remove the capturewarnings change

* Comma

3520e37e