Commits · 9459d821d15575943bcacedcc32835c9459bf39b · zhusg / transformers-new

07 Nov, 2023 7 commits

Remove a redundant variable. (#27288) · 9459d821

Chi authored 1 year ago

* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.

* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.

* Remove redundant variable in feature_extraction file

9459d821

[`Whisper`] Add conversion script for the tokenizer (#27338) · 88832c01

Arthur authored 1 year ago

* draft

* updates

* full conversion taken from `https://gist.github.com/xenova/a452a6474428de0182b17605a98631ee`



* psuh

* nits

* updates

* more nits

* Add co author

Co-authored-by: Joshua Lochner <admin@xenova.com>

* fixup

* cleanup

* styling

* add proper path

* update

* nits

* don't  push the exit

* clean

* update whisper doc

* don't error out if tiktoken is not here

* make sure we are BC with conversion

* nit

* Update docs/source/en/model_doc/whisper.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* merge and update

* update markdwon

* Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

---------

Co-authored-by: Joshua Lochner <admin@xenova.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

88832c01

[`FA2`] Add flash attention for `GPT-Neo` (#26486) · 0ded2815

Susnato Dhar authored 1 year ago


* added flash attention for gpt-neo

* small change

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* readme updated

* .

* changes

* removed padding_mask

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0ded2815

Fix Whisper Conversion Script: Correct decoder_attention_heads and _download function (#26834) · 606d9084

Xabier de Zuazo authored 1 year ago

* Fix error in convert_openai_to_hf.py: "_download() missing 1 required positional argument: root"

* Fix error in convert_openai_to_hf.py: "TypeError: byte indices must be integers or slices, not str"

* Fix decoder_attention_heads value in convert_openai_to_hf.py.

Correct the assignment for `decoder_attention_heads` in the conversion script for the Whisper model.

* Black reformat convert_openai_to_hf.py file.

* Fix Whisper model configuration defaults (for Tiny).

- Correct encoder/decoder layers and attention heads count.
- Update model width (`d_model`) to 384.

* Add docstring to the convert_openai_to_hf.py script with a doctest

* Add shebang and +x permission to the convert_openai_to_hf.py

* convert_openai_to_hf.py: reuse the read model_bytes in the _download() function

* Move convert_openai_to_hf.py doctest example to whisper.md

* whisper.md: Add an inference example to the Conversion section.

* whisper.md: remove `model.config.forced_decoder_ids` from examples (deprecated)

* whisper.md: Remove "## Format Conversion" section; not used by users

* whisper.md: Use librispeech_asr_dummy dataset and load_dataset()

606d9084

Generate: skip tests on unsupported models instead of passing (#27265) · 90b4adc1
Joao Gante authored 1 year ago

90b4adc1
Fix autoawq docker image (#27339) · 26d8d5f2
Younes Belkada authored 1 year ago
```
* Update Dockerfile

* Update docker/transformers-all-latest-gpu/Dockerfile
```
26d8d5f2

[Whisper] Block language/task args for English-only (#27322) · da7ea9a4

Sanchit Gandhi authored 1 year ago


* [Whisper] Block language/task args for English-only

* Update src/transformers/models/whisper/modeling_whisper.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

da7ea9a4

06 Nov, 2023 10 commits

[docs] fixed links with 404 (#27327) · 9beb2737
Maria Khalusova authored 1 year ago
```
* fixed links with 404

* make style
```
9beb2737

Fix `Kosmos2Processor` batch mode (#27323) · 1b20e2bb

Yih-Dar authored 1 year ago


* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1b20e2bb

Fix VideoMAEforPretrained dtype error (#27296) · a6e0d5a2
Iker García-Ferrero authored 1 year ago
```
* Fix dtype error

* Fix mean and std dtype

* make style
```
a6e0d5a2

Update sequence_classification.md (#27281) · e9dbd392

Akshay Chintalapati authored 1 year ago

I'm adding accelerate as one of the libraries to install because otherwise when running the Trainer, the model errorr out with the error.

ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1`: Please run `pip install transformers[torch]` or `pip install accelerate -U`

Further context:
1. I've tried this across different environments so I believe that the environment is not the issue.
2. I had the latest transformers library version running.
3. Typically even after install accelerate and import it, it wouldn't resolve the issue until I restart the notebook and try again.

e9dbd392

[`PretrainedTokenizer`] add some of the most important functions to the doc (#27313) · 147f7746
Arthur authored 1 year ago

147f7746
enable memory tracker metrics for npu (#27280) · 1ffc4dee
Hz, Ji authored 1 year ago

1ffc4dee
Remove an unexpected argument for FlaxResNetBasicLayerCollection (#27272) · d7dcfa89
Pingzhi Li authored 1 year ago
```
Remove unexpected argument for FlaxResNetBasicLayerCollection
```
d7dcfa89
Update doctest workflow file (#27306) · eef7ea98
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eef7ea98
Fix daily CI image build (#27307) · d788d37d
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d788d37d
Fix tokenizer export for LLamaTokenizerFast (#27222) · b026b5ca
Mayank Mishra authored 1 year ago
```
* fix tokenizer

* fix tokenizer
```
b026b5ca

03 Nov, 2023 12 commits

translate run_scripts.md to chinese (#27246) · cc3e4781

jiaqiw09 authored 1 year ago

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

* translate run_scripts.md to chinese

cc3e4781

translate autoclass_tutorial to chinese (#27269) · bf7cfac2
jiaqiw09 authored 1 year ago
```
* translate autoclass_tutorial.md  to chinese

* translate update
```
bf7cfac2

[`FA2`] Add flash attention for for `DistilBert` (#26489) · 1ac2463d

Susnato Dhar authored 1 year ago

* flash attention added for DistilBert

* fixes

* removed padding_masks

* Update modeling_distilbert.py

* Update test_modeling_distilbert.py

* style fix

1ac2463d

[Docs] Model_doc structure/clarity improvements (#26876) · 5964f820

Maria Khalusova authored 1 year ago

* first batch of structure improvements for model_docs

* second batch of structure improvements for model_docs

* more structure improvements for model_docs

* more structure improvements for model_docs

* structure improvements for cv model_docs

* more structural refactoring

* addressed feedback about image processors

5964f820

[`Docs` / `SAM` ] Reflect correct changes to run inference without OOM (#27268) · ad8ff962
Younes Belkada authored 1 year ago
```
Update sam.md
```
ad8ff962
Fix switch transformer mixed precision issue (#27220) · f13f544a
Shiyu Li authored 1 year ago
```
* Fix mixed precision error for switch transformer

* Fixup
```
f13f544a

Update the ConversationalPipeline docstring for chat templates (#27250) · db69bd88

Matt authored 1 year ago

* Update the ConversationalPipeline docstring now that we're using chat templates

* Direct access to conversation.messages

* Explain the string init

db69bd88

[docs] Custom model doc update (#27213) · 011b15c1
Maria Khalusova authored 1 year ago
```
doc update
```
011b15c1

Avoid many failing tests in doctesting (#27262) · af8d1dc3

Yih-Dar authored 1 year ago


* fix

* update

* update

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

af8d1dc3

[`PEFT` / `Tests` ] Fix peft integration failing tests (#27258) · 8f1a43cd
Younes Belkada authored 1 year ago
```
fix peft integration issues
```
8f1a43cd

Refactor: Use Llama RoPE implementation for Falcon (#26933) · 05ea7b79

Tom Aarsen authored 1 year ago

* Use Llama RoPE implementation for Falcon

+ Add copy functionalities

* Use standard cache format for Falcon

* Simplify apply_rotary_pos_emb, copy from Llama

* Remove unnecessary cache conversion test

We don't need to convert any caches anymore!

* Resolve copy complaint

05ea7b79

Fuyu protection (#27248) · e9a6c72b
Lysandre Debut authored 1 year ago

e9a6c72b

02 Nov, 2023 11 commits

Fixed base model class name extraction from PeftModels (#27162) · 552ff244

Komal Kumar authored 1 year ago

* Fixed base model class name extraction from PeftModels

* Changes to first unwrap the model then extract the base model name

* Changed base_model to base_model.model to stay consistent with peft model abstractions

552ff244

Removed the redundant SiLUActivation class. (#27136) · 49912168

Chi authored 1 year ago

* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.

* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.

49912168

translate peft.md to chinese (#27215) · 00d8502b

jiaqiw09 authored 1 year ago

* tranlsate peft.md to chinese

* translate peft.md to chinese

* fix missing link

00d8502b

Dev version · bc78fd12
Lysandre authored 1 year ago

bc78fd12

Enrich TTS pipeline parameters naming (#26473) · 0ed6729b

Yoach Lacombe authored 1 year ago


* enrich TTS pipeline docstring for clearer forward_params use

* change token leghts

* update Pipeline parameters

* correct docstring and make style

* fix tests

* make style

* change music prompt

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* raise errors if generate_kwargs with forward-only models

* make style

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0ed6729b

Remove redundant code from T5 encoder mask creation (#27216) · 147e8ce4
Pietro Lesci authored 1 year ago
```
* remove redundant code

* update

* add typecasting

* make `attention_mask` float again
```
147e8ce4
Generate: return `past_key_values` (#25086) · a6c82d45
Joao Gante authored 1 year ago

a6c82d45
fix-deprecated-exllama-arg (#27243) · 441c3e0d
Marc Sun authored 1 year ago
```
fix-exllama
```
441c3e0d

Fixing m4t. (#27240) · 8801861d

Nicolas Patry authored 1 year ago

* Fixing m4t.

* Trying to remove comparison ? Odd test failure.

* Adding shared. But why on earth does it hang ????

* Putting back the model weights checks the test is silently failing on
cuda.

* Fix style + unremoved comment.

8801861d

Fix safetensors failing tests (#27231) · 443bf5e9

Lysandre Debut authored 1 year ago


* Fix Kosmos2

* Fix ProphetNet

* Fix MarianMT

* Fix M4T

* XLM ProphetNet

* ProphetNet fix

* XLM ProphetNet

* Final M4T fixes

* Tied weights keys

* Revert M4T changes

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

443bf5e9

Wrap `_prepare_4d_causal_attention_mask` as a leaf function (#27236) · 4557a0de
Michael Benayoun authored 1 year ago
```
Wrap _prepare_4d_causal_attention_mask as a leaf function
```
4557a0de