Commits · run_tiny_with_fix_tiny_model_creation · zhusg / transformers-new

09 Nov, 2023 4 commits

fix · e68eefcc
ydshieh authored 1 year ago

e68eefcc
fix · f84c122c
ydshieh authored 1 year ago

f84c122c

[`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are... · 085ea7e5

Arthur authored 1 year ago

[`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are not normalized because they are special (#27359)

* make sure tokens are properly initialized for codellama slow

* add m ore pretrained models

* style

* test more tokenizers checkpoints

085ea7e5

Smangrul/fix failing ds ci tests (#27358) · 7ecd229b

Sourab Mangrulkar authored 1 year ago

* fix failing DeepSpeed CI tests due to `safetensors` being default

* debug

* remove debug statements

* resolve comments

* Update test_deepspeed.py

7ecd229b

08 Nov, 2023 13 commits

translate debugging.md to chinese (#27374) · ced9fd86
jiaqiw09 authored 1 year ago
```
* update

* update
```
ced9fd86
Update deprecated `torch.range` in `test_modeling_ibert.py` (#27355) · 0e402e14
Sergii Dymchenko authored 1 year ago
```
* Update deprecated torch.range

* Remove comment
```
0e402e14

Add Flash Attention 2 support to Bark (#27364) · a5bee89c

Yoach Lacombe authored 1 year ago


* change handmade attention mask to _prepare_4d_attention_mask

* add flashattention2 support in Bark

* add flashattention2 tests on BarkSemanticModel

* make style

* fix flashattention and tests + make style

* fix memory leak and allow Bark to pass flash attention to sub-models

* make style

* Apply suggestions from code review

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove unecessary code from tests + justify overriding

* Update tests/models/bark/test_modeling_bark.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make style

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

a5bee89c

translate big_models.md and performance.md to chinese (#27334) · ef716736

jiaqiw09 authored 1 year ago

* translate performance.md

* tranlsate performance.md and big_models.md

* update translation

* update review

ef716736

Fix tiny model script: not using `from_pt=True` (#27372) · bd8f45b1
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
bd8f45b1
[Flax Whisper] large-v3 compatibility (#27360) · 7b175cfa
Sanchit Gandhi authored 1 year ago

7b175cfa
Remove unused param from example script tests (#27354) · 845aa832
Zach Mueller authored 1 year ago
```
Unused param
```
845aa832

Translate index.md to Turkish (#27093) · eb30a49b

Mert Yanık authored 1 year ago


* Add index.md for tukish language

* Fix index.md (huggingface/transformers#27088)

* Add 'tr' to additional files

* Update docs/source/tr/_toctree.yml

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update index.md

---------

Co-authored-by: Mert Yanık <mert.yanik@lcwaikiki.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

eb30a49b

MusicGen Update (#27084) · f16ff0f0

Sanchit Gandhi authored 1 year ago

* [MusicGen] Add stereo model

* safe serialization

* Update src/transformers/models/musicgen/modeling_musicgen.py

* split over 2 lines

* fix slow tests on cuda

f16ff0f0

Fix `Kosmos-2` device issue (#27346) · 5ef650b0

Yih-Dar authored 1 year ago


* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5ef650b0

Fix example tests from failing (#27353) · efa57cb2
Zach Mueller authored 1 year ago
```
* Fix example tests from failing

* CHange thresh
```
efa57cb2
moving example of benchmarking to legacy dir (#27337) · b6dbfee0
Hz, Ji authored 1 year ago
```
move example of benchmarking to legacy
```
b6dbfee0

Add numpy alternative to FE using torchaudio (#26339) · be74b2ea

Yoach Lacombe authored 1 year ago

* add audio_utils usage in the FE of SpeechToText

* clean unecessary parameters of AudioSpectrogramTransformer FE

* add audio_utils usage in AST

* add serialization tests and function to FEs

* make style

* remove use_torchaudio and move to_dict to FE

* test audio_utils usage

* make style and fix import (remove torchaudio dependency import)

* fix torch dependency for jax and tensor tests

* fix typo

* clean tests with suggestions

* add lines to test if is_speech_availble is False

be74b2ea

07 Nov, 2023 14 commits

translate model_sharing.md and llm_tutorial.md to chinese (#27283) · e2647450

jiaqiw09 authored 1 year ago

* translate model_sharing.md

* translate llm_tutorial.md to chiense

* update wrong translation

* update _torctree.yml

* update typos

* update

e2647450

translate the en tokenizer_summary.md to Chinese (#27291) · f213d5dd
九是否随意的称呼 authored 1 year ago
```
* translate the en tokenizer_summary.md to Chinese

* revise WordPiece

* add to source/zh/_toctree.yml
```
f213d5dd

Allow scheduler parameters (#26480) · 7e1eff76

Plemeur authored 1 year ago


* Allow for scheduler kwargs

* Formatting

* Arguments checks, passing the tests

* Black failed somehow

---------

Co-authored-by: Pierre <pierre@avatarin.com>

7e1eff76

FIx Bark batching feature (#27271) · ac5d4cf6
Yoach Lacombe authored 1 year ago
```
* fix bark batching

* make style

* add tests and make style
```
ac5d4cf6
[`Whisper`] Nit converting the tokenizer (#27349) · 8f840edd
Arthur authored 1 year ago
```
* `nospeech` instead of `nocaption` for the no speech token

* oups
```
8f840edd
Remove padding_masks from `gpt_bigcode`. (#27348) · cc9f27bb
Susnato Dhar authored 1 year ago
```
Update modeling_gpt_bigcode.py
```
cc9f27bb

Resolve AttributeError by utilizing device calculation at the start of the... · 8c91f15a

Folco Bertini Baldassini authored 1 year ago

Resolve AttributeError by utilizing device calculation at the start of the forward function (#27347)

This commit addresses the 'NoneType' object AttributeError within the IdeficsModel forward function. Previously, the 'device' attribute was accessed directly from input_ids, resulting in a potential 'NoneType' error. Now, the device is properly calculated at the beginning of the forward function and utilized consistently throughout, ensuring the 'image_hidden_states' are derived from the correct device. This modification enables smoother processing and compatibility, ensuring the correct device attribution for 'image_encoder_embeddings' in the IdeficsModel forward pass.

8c91f15a

Remove a redundant variable. (#27288) · 9459d821

Chi authored 1 year ago

* Removed the redundant SiLUActivation class and now use nn.functional.silu directly.

* I apologize for adding torch.functional.silu. I have replaced it with nn.SiLU.

* Remove redundant variable in feature_extraction file

9459d821

[`Whisper`] Add conversion script for the tokenizer (#27338) · 88832c01

Arthur authored 1 year ago

* draft

* updates

* full conversion taken from `https://gist.github.com/xenova/a452a6474428de0182b17605a98631ee`



* psuh

* nits

* updates

* more nits

* Add co author

Co-authored-by: Joshua Lochner <admin@xenova.com>

* fixup

* cleanup

* styling

* add proper path

* update

* nits

* don't  push the exit

* clean

* update whisper doc

* don't error out if tiktoken is not here

* make sure we are BC with conversion

* nit

* Update docs/source/en/model_doc/whisper.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* merge and update

* update markdwon

* Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

---------

Co-authored-by: Joshua Lochner <admin@xenova.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

88832c01

[`FA2`] Add flash attention for `GPT-Neo` (#26486) · 0ded2815

Susnato Dhar authored 1 year ago


* added flash attention for gpt-neo

* small change

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* readme updated

* .

* changes

* removed padding_mask

* Update src/transformers/models/gpt_neo/modeling_gpt_neo.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0ded2815

Fix Whisper Conversion Script: Correct decoder_attention_heads and _download function (#26834) · 606d9084

Xabier de Zuazo authored 1 year ago

* Fix error in convert_openai_to_hf.py: "_download() missing 1 required positional argument: root"

* Fix error in convert_openai_to_hf.py: "TypeError: byte indices must be integers or slices, not str"

* Fix decoder_attention_heads value in convert_openai_to_hf.py.

Correct the assignment for `decoder_attention_heads` in the conversion script for the Whisper model.

* Black reformat convert_openai_to_hf.py file.

* Fix Whisper model configuration defaults (for Tiny).

- Correct encoder/decoder layers and attention heads count.
- Update model width (`d_model`) to 384.

* Add docstring to the convert_openai_to_hf.py script with a doctest

* Add shebang and +x permission to the convert_openai_to_hf.py

* convert_openai_to_hf.py: reuse the read model_bytes in the _download() function

* Move convert_openai_to_hf.py doctest example to whisper.md

* whisper.md: Add an inference example to the Conversion section.

* whisper.md: remove `model.config.forced_decoder_ids` from examples (deprecated)

* whisper.md: Remove "## Format Conversion" section; not used by users

* whisper.md: Use librispeech_asr_dummy dataset and load_dataset()

606d9084

Generate: skip tests on unsupported models instead of passing (#27265) · 90b4adc1
Joao Gante authored 1 year ago

90b4adc1
Fix autoawq docker image (#27339) · 26d8d5f2
Younes Belkada authored 1 year ago
```
* Update Dockerfile

* Update docker/transformers-all-latest-gpu/Dockerfile
```
26d8d5f2

[Whisper] Block language/task args for English-only (#27322) · da7ea9a4

Sanchit Gandhi authored 1 year ago


* [Whisper] Block language/task args for English-only

* Update src/transformers/models/whisper/modeling_whisper.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

da7ea9a4

06 Nov, 2023 9 commits

[docs] fixed links with 404 (#27327) · 9beb2737
Maria Khalusova authored 1 year ago
```
* fixed links with 404

* make style
```
9beb2737

Fix `Kosmos2Processor` batch mode (#27323) · 1b20e2bb

Yih-Dar authored 1 year ago


* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1b20e2bb

Fix VideoMAEforPretrained dtype error (#27296) · a6e0d5a2
Iker García-Ferrero authored 1 year ago
```
* Fix dtype error

* Fix mean and std dtype

* make style
```
a6e0d5a2

Update sequence_classification.md (#27281) · e9dbd392

Akshay Chintalapati authored 1 year ago

I'm adding accelerate as one of the libraries to install because otherwise when running the Trainer, the model errorr out with the error.

ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1`: Please run `pip install transformers[torch]` or `pip install accelerate -U`

Further context:
1. I've tried this across different environments so I believe that the environment is not the issue.
2. I had the latest transformers library version running.
3. Typically even after install accelerate and import it, it wouldn't resolve the issue until I restart the notebook and try again.

e9dbd392

[`PretrainedTokenizer`] add some of the most important functions to the doc (#27313) · 147f7746
Arthur authored 1 year ago

147f7746
enable memory tracker metrics for npu (#27280) · 1ffc4dee
Hz, Ji authored 1 year ago

1ffc4dee
Remove an unexpected argument for FlaxResNetBasicLayerCollection (#27272) · d7dcfa89
Pingzhi Li authored 1 year ago
```
Remove unexpected argument for FlaxResNetBasicLayerCollection
```
d7dcfa89
Update doctest workflow file (#27306) · eef7ea98
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
eef7ea98
Fix daily CI image build (#27307) · d788d37d
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d788d37d