Commits · v4.23-release · zhusg / transformers-new

11 Oct, 2022 2 commits

Release: v4.23.1 · bd469c40
Sylvain Gugger authored 2 years ago

v4.23.1

bd469c40

Fix whisper for `pipeline` (#19482) · c8bc0a0b

Arthur authored 2 years ago

* update feature extractor params

* update attention mask handling

* fix doc and pipeline test

* add warning when skipping test

* add whisper translation and transcription test

* fix build doc test

c8bc0a0b

10 Oct, 2022 28 commits

Release: v4.23.0 · 9ae22fe3
Lysandre authored 2 years ago

v4.23.0

9ae22fe3
wrap forward passes with torch.no_grad() (#19412) · df2f2812
Partho authored 2 years ago

df2f2812
wrap forward passes with torch.no_grad() (#19413) · 5f5e264a
Partho authored 2 years ago

5f5e264a
wrap forward passes with torch.no_grad() (#19414) · c6a928ca
Partho authored 2 years ago

c6a928ca
wrap forward passes with torch.no_grad() (#19416) · d739a707
Partho authored 2 years ago

d739a707
wrap forward passes with torch.no_grad() (#19438) · 870a9542
Partho authored 2 years ago

870a9542
wrap forward passes with torch.no_grad() (#19439) · 692c5be7
Partho authored 2 years ago

692c5be7

fix (#19469) · a7bc4221

Yih-Dar authored 2 years ago


Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

a7bc4221

Fixed a non-working hyperlink in the README.md file (#19434) · 25cfd911

Mikail Duzenli authored 2 years ago

* Fixed a non-working hyperlink in the README.md file

The hyperlink to the community notebooks was outdated.

* Fixing missing double slash in hyperlink

25cfd911

Fix misspelled word in docstring (#19415) · 9df953a8
Bartosz Szmelczynski authored 2 years ago

9df953a8
Generate: corrected exponential_decay_length_penalty type hint (#19376) · d866b485
Shivang Mishra authored 2 years ago

d866b485

Fix momentum and epsilon values (#19454) · 4dd784c3

amyeroberts authored 2 years ago

The momentum value for PyTorch and TensorFlow batch normalization layers is not equivalent. The TensorFlow value should be (1 - pytorch_momentum) in order to ensure the correct updates are applied to the running mean and running variance calculations. We wouldn't observe a difference loading a pretrained model and performing inference, but evaluation outputs would change after some training steps.

4dd784c3

Add Italian translation for `add_new_model.mdx` (#18713) · b0b962cc
Stefano Bosisio authored 2 years ago
```
* fix conflicts

* start translating

* proof check

* add toc

* fix errors and typos
```
b0b962cc
Fix the error message in run_t5_mlm_flax.py (#19282) · e150c4e2
Kaiyu Yang authored 2 years ago

e150c4e2

Add TF whisper (#19378) · e3f028f3

amyeroberts authored 2 years ago


* simplify loop

* add featur extractor

* add model

* start conversion

* add dropout

* initial commit of test files

* copnversion for all models

* update processor for correct padding

* update feature extraction

* update integration test logits match

* fmnt: off for the logits

* on the fly mel bank

* small nit

* update test

* update tokenizer

* nit feature extraction

* update

* update tokenizer test

* adds logit processor and update tokenizer to get supress tokens

* style

* clean convert

* revert to original modeling tf utils

* Update

* update

* nit

* clean convert file

* update tests and nits

* quality

* slow generation test

* ffn_dim to allow customization

* update readme

* add to toctreee

* start fixing integration tests

* update tests and code

* fix feature extractor

* fix config tests common

* update code to fix tests

* fix feature exctractor

* nit feature extraction

* update test for new feature extractor

* style

* add absrtact

* large logits wioth custom decoder input ids

* wraap around is otrch available

* fix feature extractor

* correct logits for whisper small.en

* nit

* fix encoder_attentino_mask

* some fixes

* remove unnecessary inputs

* nits

* add normalizer file

* update etst tokenization

* fix attention mask not defined

* fix generate

* remove uncoder attention mask useless

* update test modeling whisper

* update condfig to add second non supress tokens

* nits on feature exrtactor

* nit for test tokenizers

* update etsts

* update tests

* update tokenization test

* fixup

* invalidated hf token. Clean convert openai to whisper

* fix logit tests

* fixup

* Add model to README

* Fix doc tests

* clean merge

* revert toc_tree changes

* remove useless LogitProcessor

* Update whisper .mdx

* update config file doc

* update configuration docstring

* update test tokenization

* update test tokenization

* update tokenization whisper
Added copied from where needed

* update feature extraction

* nit test name

* style

* quality

* remove get suppress tokens and update non_speech tokens global variables

* Update src/transformers/models/whisper/feature_extraction_whisper.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* clean modeling whisper and test
Removed the attention mask arguments that are deprecated

* fix large test

* Add multilingual audio test, and translate test

* style

* fix larg multilingual test

* nits

* add copied from for attention layer

* remove attention masks in doc

* add english normalizer

* Update docs/source/en/model_doc/whisper.mdx

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* update tokenization test

* remove copied from in whisper attention : no bias in k_proj only

* wrap around dependencies in english normalizer

* style

* correct import generation logits

* for now, wrap feature extractor with torch

* remove torch depencies for feature extraction and style

* Update src/transformers/models/whisper/convert_openai_whisper_to_tfms.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/whisper.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixup

* nit

* update logitds

* style

* nit

* nits and fix final tests

* add `is_more_itertools_available` to utils

* quality

* add begin supress tokens, supress tokens to generate args and config

* clean supressTokensLogitProcessor in generation logits

* Nit naming

* add supressTokensAtBegin

* udpate tests, supress tokens to None or correct values

* nit and style

* update RAG to fit test and generate_logit

* add copy pasted statment on english normalizer

* add arguments to config_common_kwargs

* Update src/transformers/generation_utils.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/generation_logits_process.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* revert changes based on reviews

* update doc and nits

* Update src/transformers/models/whisper/configuration_whisper.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* more nits

* last nits

* update test configuration common

* add BART name in decoder attention mask documentation

* Update src/transformers/models/whisper/modeling_whisper.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* style

* nit

* nit

* add english.json file to git

* nits on documentation

* nit

* nits

* last styling

* add main toctree file

* remove sentence piece dependency

* clean init file

* fix tokenizer that has no dependencies on sentencepiece

* update whisper init file, nit

* remove english.json file

* add get decoder prompt id

* All weights loading

* Remove hanging pdb

* Fixup and tidy up

* Use same copied from as PT model

* Remove whitespace changes

* Remove torch references

* Tie embeddings

* Remove logits processor input to generate

* Update logit values

* revert changes and add forced logit processor

* nit

* clean normalizer

* remove protected

* Add logit processors and update generation code & tests

* Some tidy up

* Update docstring

* update

* update based on review

* Update src/transformers/models/whisper/configuration_whisper.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/whisper/configuration_whisper.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update to reflect changes on the PT model branch

* Tidy up

* Remove extra whitespace

* Fix test - make input ids small enough we can append

* Include upstream changes on main

* PR comments - add batch tests, remove comments & defaults

* Fix model output imports

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation_tf_logits_process.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/models/whisper/test_modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update docstring example

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Remove changes to adjust_logits_during_generation function

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Tidy up imports that don't require TF

* Update tests - skip and no more skip

* Update tests/generation/test_generation_tf_logits_process.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/whisper/modeling_tf_whisper.py

* Update src/transformers/models/whisper/modeling_tf_whisper.py

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Add training flags

* Add (skipped) XLA generation tests

* Add embedding correctness test

* Add constant ids for generation tests

* Make logits finding a bit tidier

* Remove unused args

* xla generation enabled

* Don't skip XLA tests anymore

* Fix tests - add position ids to expected signature and update rag generation

* Undo method reorder

* Remove added whitespace

* Remove copy-paste gradient checkopint ref

* Remove

* Trigger CI - (issue with refs when pulling)

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: NielsRogge <niels.rogge1@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Joao Gante <joao@huggingface.co>

e3f028f3

Add `OPTForQuestionAnswering` (#19402) · af69360b

APAVOU Clément authored 2 years ago

* Add `OPTForQuestionAnswering`

- added `OPTForQuestionAnswering` class based on `BloomForQuestionAnswering`
- added `OPTForQuestionAnswering` in common tests
- all common tests pass
- make fixup done

* added docstrings for OPTForQuestionAnswering

* Fix docstrings for OPTForQuestionAnswering

af69360b

fix: renamed variable name (#18850) · ba71bf4c

Aritra Roy Gosthipaty authored 2 years ago

The sequence_masked variable is actually the part of the sequence that is kept unmasked for the encoder. This commit renames the variable.

ba71bf4c

Remove dependency of Roberta in Blenderbot (#19411) · 4824741c

Ryan Chan authored 2 years ago

* Remove dependency of Roberta in Blenderbot

* Move Copied from statements to each method of the Roberta classes

* Remove copied from line for mask_token.setter

* update output from example in docs

4824741c

Add onnx support for VisionEncoderDecoder (#19254) · 3080bb47

Mohit Sharma authored 2 years ago


* Add onnx support for VisionEncoderDecoder

* Add onnx support for VisionEncoderDecoder

* Removed unused import

* Rename encoder hidden state

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update docstrings and removed redundant code

* Added test function for enc-dec models

* Update doc string text

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* fixed code style

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

3080bb47

Stop relying on huggingface_hub's private methods (#19392) · 298f6a98
Lysandre Debut authored 2 years ago
```
* Leverage hfh for move cache

* Style
```
298f6a98
Fix typo in image-classification/README.md (#19424) · 7d5ce680
wei zhao authored 2 years ago
```
Fix link typo of the following content.
PyTorch version, Trainer
PyTorch version, no Trainer
```
7d5ce680

fix marianMT convertion to onnx (#19287) · c523a869

Rak Alexey authored 2 years ago


* fix marianMT convertion to onnx

* Update src/transformers/onnx/convert.py

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update src/transformers/onnx/convert.py

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

c523a869

Fixed duplicated line (paragraph #83) Documentation: @sgugger (#19436) · 34107057
Darío Hereñú authored 2 years ago
```
* Fixed duplicated line (paragraph #83) @omarespejel @sgugger

* Datasets map denomination fixed (paragraph 42)
```
34107057
Backtick fixed (paragraph 68) (#19440) · 83dc49b6
Darío Hereñú authored 2 years ago

83dc49b6

remove RobertaConfig inheritance from MarkupLMConfig (#19404) · 1241a499

Druhin Abrol authored 2 years ago


* remove RobertaConfig inheritance from MarkupLMConfig

* Update src/transformers/models/markuplm/configuration_markuplm.py

fixed typo in docstring

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

1241a499

Fix repo names for ESM tests (#19451) · 4107445a
Matt authored 2 years ago

4107445a
Skip `BloomEmbeddingTest.test_embeddings` for PyTorch < 1.10 (#19261) · cbb8a379
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cbb8a379
Fix `ViTMSNForImageClassification` doctest (#19275) · 8b6bba54
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8b6bba54

08 Oct, 2022 1 commit
- Remove ref to is_pipeline_test · d92e22d1
  Sylvain Gugger authored 2 years ago
  
  d92e22d1
07 Oct, 2022 9 commits

Rework pipeline tests (#19366) · 9ac586b3

Sylvain Gugger authored 2 years ago

* Rework pipeline tests

* Try to fix Flax tests

* Try to put it before

* Use a new decorator instead

* Remove ignore marker since it doesn't work

* Filter pipeline tests

* Woopsie

* Use the fitlered list

* Clean up and fake modif

* Remove init

* Revert fake modif

9ac586b3

Improve and fix ImageSegmentationPipeline (#19367) · 983451a1

Alara Dirik authored 2 years ago

- Fixes the image segmentation pipeline test failures caused by changes to the postprocessing methods of supported models
- Updates the ImageSegmentationPipeline tests
- Improves docs, adds 'task' argument to optionally perform semantic, instance or panoptic segmentation

983451a1

Removed Bert dependency from BertGeneration code base. (#19370) · de4d71ea

Vishwas authored 2 years ago


* Copied all the code required from transformers.models.bert.modeling_bert to here

* Fixed styling issues

* Reformatted copied names with Model specific name.

* Reverted BertEncoder part as there is already a class called BertGenerationEncoder

* Added prefixes in missing places.

Co-authored-by: vishwaspai <vishwas.pai@emplay.net>

de4d71ea

Make `Camembert` TF version independent from `Roberta` (#19364) · 34e0cc6d

mustapha ajeghrir authored 2 years ago


* camembert tf version independent

* fixup

* fixup, all working

* remove comments

* Adding copied from roberta

Co-authored-by: Mustapha AJEGHRIR <mustapha.ajeghrir@kleegroup.com>

34e0cc6d

Removed `Bert` interdependency in `tokenization_electra.py` (#19356) · 7418a48e

Blip blop authored 2 years ago


* Copied from BertTokenizer() in tokenization_bert

* Added BasicTokenizer and WordPieceTokenizer Class

* Update src/transformers/models/electra/tokenization_electra.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Added copied from comments for basicTokenizer and WordPieceTokenizer

* Updated the comments for the tokenizerClasses

* Update src/transformers/models/electra/tokenization_electra.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/electra/tokenization_electra.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Formatted tokenization_electra with `make style`

* Fix repo inconsistencies

* Update src/transformers/models/electra/tokenization_electra.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Set the logger

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7418a48e

Remove Dependency between Bart and LED (slow/fast) (#19408) · 6ef16f2b

Infrared1029 authored 2 years ago

* removed dependency from bart(slow)

* removed dependency from bart(slow)

* adding copying comments (copied from bart to led)

* updated led docstring

* updated led docstring

* removed dependency from Bart (fast)

* replaced bart with LED in docstrings

* complying flake8

* added more copy comments

* fixing copying comments

* added comments back

* fix copy comments

* fixing copied from comments

* fixing copied from comments

6ef16f2b

Clip device map (#19409) · 06514b3e
Patrick von Platen authored 2 years ago
```
* add first generation tutorial

* uP

* [Clip] Add text model to device map
```
06514b3e
Removed Bert and XML Dependency from Herbert (#19410) · c2b83d54
harry7337 authored 2 years ago
```
Co-authored-by: harry7337 <hari.8jan@gmail.com>
```
c2b83d54

Remove dependency of Bert from Squeezebert tokenizer (#19403) · e6fc2016

Ryan Chan authored 2 years ago

* Remove dependency of Bert from Squeezebert tokenizer

* run style corrections

* update copies from BertTokenizers

* Update changes and style to Squeezebert files

* update copies for bert-fast

e6fc2016