Commits · 04ca6fe8c434b9dad436d7be18f35c0633c1cb74 · 某某某 / transformers-new

01 Jul, 2022 9 commits

Exclude Databricks from notebook env only if the runtime is below 11.0 · 04ca6fe8
David Heryanto authored 3 years ago

04ca6fe8
higher atol to avoid flaky trainer test failure (#17979) · 664688b9
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
664688b9
Fix FlaxBigBirdEmbeddings (#17842) · 8bb2c387
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8bb2c387

add ONNX support for BLOOM (#17961) · b68d408f

Nouamane Tazi authored 3 years ago


* add onnx support for BLOOM

* use TYPE_CHECKING for type annotations

* fix past_shape for bloom (different from gpt2)

* use logical_or instead of `+` for onnx support

* bigger `atol_for_validation` for larger bloom models

* copied -> taken because it's no longer an exact copy

* remove "copied from" comment

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b68d408f

fixing fsdp autowrap functionality (#17922) · 462b7f3a

Sourab Mangrulkar authored 3 years ago

* fixing fsdp autowrap functionality

* update version and quality

* update torch version to latest stable version

462b7f3a

fix `bias` keyword argument in TFDebertaEmbeddings (#17940) · 3a064bd4
Wissam Antoun authored 3 years ago

3a064bd4
Update expected values in CodeGen tests (#17888) · 569b679a
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
569b679a
Fix typo in perf_train_gpu_one.mdx (#17983) · cb425024
Billy Cao authored 3 years ago

cb425024

skip some gpt_neox tests that require 80G RAM (#17923) · 14fb8a63

Yih-Dar authored 3 years ago


* skip some gpt_neox tests that require 80G RAM

* remove tests

* fix quality

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

14fb8a63

30 Jun, 2022 8 commits

feat: add pipeline registry abstraction (#17905) · 49cd736a

Aaron Pham authored 3 years ago


* feat: add pipeline registry abstraction

- added `PipelineRegistry` abstraction
- updates `add_new_pipeline.mdx` (english docs) to reflect the api addition
- migrate `check_task` and `get_supported_tasks` from
  transformers/pipelines/__init__.py to
  transformers/pipelines/base.py#PipelineRegistry.{check_task,get_supported_tasks}

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* fix: update with upstream/main

chore: Apply suggestions from sgugger's code review

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* chore: PR updates

- revert src/transformers/dependency_versions_table.py from upstream/main
- updates pipeline registry to use global variables

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* tests: add tests for pipeline registry

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* tests: add test for output warning.

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* chore: fmt and cleanup unused imports

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* fix: change imports to top of the file and address comments

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

49cd736a

Add ONNX support for LayoutLMv3 (#17953) · 9cb7cef2

regisss authored 3 years ago

* Add ONNX support for LayoutLMv3

* Update docstrings

* Update empty description in docstring

* Fix imports and type hints

9cb7cef2

skip some ipex tests until it works with torch 1.12 (#17964) · fe140464
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
fe140464

CLI: convert sharded PT models (#17959) · 91e1f24e

Joao Gante authored 3 years ago

* sharded conversion; add flag to control max hidden error

* better hidden name matching

* Add test: load TF from PT shards

* fix test (PT data must be local)

91e1f24e

Fix number of examples for iterable dataset in distributed training (#17951) · f25457b2
Sylvain Gugger authored 3 years ago

f25457b2

[Pipelines] Add revision tag to all default pipelines (#17667) · e4d25885

Patrick von Platen authored 3 years ago


* trigger test failure

* upload revision poc

* Update src/transformers/pipelines/base.py

Co-authored-by: Julien Chaumond <julien@huggingface.co>

* up

* add test

* correct some stuff

* Update src/transformers/pipelines/__init__.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* correct require flag

Co-authored-by: Julien Chaumond <julien@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e4d25885

Unifying training argument type annotations (#17934) · 4f8361af
Jannis Born authored 3 years ago
```
* doc: Unify training arg type annotations

* wip: extracting enum type from Union

* blackening
```
4f8361af
Fix GPT-NeoX-20B past handling, attention computation (#17811) · 205bc415
Jason Phang authored 3 years ago
```
* Fix GPT-NeoX-20B past handling, swap attention computation to hopefully avoid NaN, update docs

* 20B tests
```
205bc415

29 Jun, 2022 22 commits

Flax t5 Encoder (#17784) · 692e61e9

Crystina authored 3 years ago


* first draft adding Flax-t5-encoder and Flax-mt5-encoder

* imports

* after make fixup

* flax t5 encoder test

* black on test

* make fix-copies

* clean

* all_model_classes -> tuple

* clean test

* is_encoder_decoder=False in t5-enc tester

* remove file docstring before FlaxT5Encoder

* black

* isort

* commit suggestions on src/transformers/models/t5/modeling_flax_t5.py

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* commit suggestions on src/transformers/models/t5/modeling_flax_t5.py

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* remove _get_encoder_module

* self.decoder_seq_length -> self.encoder_seq_length as t5-enc does not have decoder

* bugfix - self.module_class is class itself, not instance;

* docs for mt5 and t5

* call -> __call__ in t5 doc

* FlaxMT5EncoderModel to TYPE_HINT

* run doc-builder to allow change the files

Co-authored-by: Suraj Patil <surajp815@gmail.com>

692e61e9

Fix #17893, removed dead code (#17917) · eb1493b1

Clémentine Fourrier authored 3 years ago

* Removed dead position_id code, fix #17893

* Removed unused var

* Now ignores removed (dead) dict key for backward comp

eb1493b1

add MobileViT model (#17354) · fbc7598b

Matthijs Hollemans authored 3 years ago


* add MobileViT

* fixup

* Update README.md

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* remove empty line

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* use clearer variable names

* rename to MobileViTTransformerLayer

* no longer inherit from nn.Sequential

* fixup

* fixup

* not sure why this got added twice

* rename organization for checkpoints

* fix it up

* Update src/transformers/models/mobilevit/__init__.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/models/mobilevit/test_modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* code style improvements

* fixup

* Update docs/source/en/model_doc/mobilevit.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/mobilevit.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* download labels from hub

* rename layers

* rename more layers

* don't compute loss in separate function

* remove some nn.Sequential

* replace nn.Sequential with new MobileViTTransformer class

* replace nn.Sequential with MobileViTMobileNetLayer

* fix pruning since model structure changed

* fixup

* fix doc comment

* remove custom resize from feature extractor

* fix ONNX import

* add to doc tests

* use center_crop from image_utils

* move RGB->BGR flipping into image_utils

* fix broken tests

* wrong type hint

* small tweaks

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fbc7598b

Fix prepare_tf_dataset when drop_remainder is not supplied (#17950) · 5feac3d0
Matt authored 3 years ago

5feac3d0
ExplicitEnum subclass str (JSON dump compatible) (#17933) · bc019b0e
Bram Vanroy authored 3 years ago
```
* ExplicitEnum subclass str (JSON dump compatible)

* allow union if one of the types is str
```
bc019b0e
PyTorch 1.12.0 for scheduled CI (#17949) · b089cca3
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b089cca3
OPT - Fix Softmax NaN in half precision mode (#17437) · d444edb3
Younes Belkada authored 3 years ago

d444edb3
Use explicit torch version in deepspeed CI (#17942) · 9fe2403b
Yih-Dar authored 3 years ago
```
* use explicit torch version

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9fe2403b
fix regexes with escape sequence (#17943) · 4c722e9e
Stas Bekman authored 3 years ago

4c722e9e
Fix all is_torch_tpu_available issues (#17936) · 7c4c6f60
Zachary Mueller authored 3 years ago
```
* Fix all is_torch_tpu_available 
```
7c4c6f60

Fix img seg tests (load checkpoints from `hf-internal-testing`) (#17939) · 77b76672

Mishig Davaadorj authored 3 years ago

* Revert "Skip failing test until they are fixed."

This reverts commit 8f400775.

* Use `tiny-detr` checkpts from `hf-internal-testing`

77b76672

Add MVP model (#17787) · 3cff4cc5

StevenTang1998 authored 3 years ago

* Add MVP model

* Update README

* Remove useless module

* Update docs

* Fix bugs in tokenizer

* Remove useless test

* Remove useless module

* Update vocab

* Remove specifying

* Remove specifying

* Add #Copied ... statement

* Update paper link

* Remove useless TFMvp

* Add #Copied ... statement

* Fix style in test mvp model

* Fix some typos

* Fix properties of unset special tokens in non verbose mode

* Update paper link

* Update MVP doc

* Update MVP doc

* Fix README

* Fix typos in docs

* Update docs

3cff4cc5

Skip failing test until they are fixed. · 8f400775
Sylvain Gugger authored 3 years ago

8f400775
Remove imports and use forward references in ONNX feature (#17926) · 47b91651
Sylvain Gugger authored 3 years ago

47b91651
Fix job links in Slack report (#17892) · 5cdfff5d
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5cdfff5d

TF implementation of RegNets (#17554) · a7eba831

Aritra Roy Gosthipaty authored 3 years ago


* chore: initial commit

Copied the torch implementation of regnets and porting the code to tf step by step. Also introduced an output layer which was needed for regnets.

* chore: porting the rest of the modules to tensorflow

did not change the documentation yet, yet to try the playground on the model

* Fix initilizations (#1)

* fix: code structure in few cases.

* fix: code structure to align tf models.

* fix: layer naming, bn layer still remains.

* chore: change default epsilon and momentum in bn.

* chore: styling nits.

* fix: cross-loading bn params.

* fix: regnet tf model, integration passing.

* add: tests for TF regnet.

* fix: code quality related issues.

* chore: added rest of the files.

* minor additions..

* fix: repo consistency.

* fix: regnet tf tests.

* chore: reorganize dummy_tf_objects for regnet.

* chore: remove checkpoint var.

* chore: remov unnecessary files.

* chore: run make style.

* Update docs/source/en/model_doc/regnet.mdx

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* chore: PR feedback I.

* fix: pt test. thanks to @ydshieh.

* New adaptive pooler (#3)

* feat: new adaptive pooler

Co-authored-by: @Rocketknight1

* chore: remove image_size argument.

Co-authored-by: matt <rocketknight1@gmail.com>

Co-authored-by: matt <rocketknight1@gmail.com>

* Empty-Commit

* chore: remove image_size comment.

* chore: remove playground_tf.py

* chore: minor changes related to spacing.

* chore: make style.

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: amyeroberts <aeroberts4444@gmail.com>

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: amyeroberts <aeroberts4444@gmail.com>

* chore: refactored __init__.

* chore: copied from -> taken from./g

* adaptive pool -> global avg pool, channel check.

* chore: move channel check to stem.

* pr comments - minor refactor and add regnets to doc tests.

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* minor fix in the xlayer.

* Empty-Commit

* chore: removed from_pt=True.

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: matt <rocketknight1@gmail.com>
Co-authored-by: amyeroberts <aeroberts4444@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

a7eba831

TF: XLA beam search + most generation-compatible models are now also... · e6d27ca5

Joao Gante authored 3 years ago

TF: XLA beam search + most generation-compatible models are now also XLA-generate-compatible (#17857)

* working beam search 

* XLA generation compatible with ALL classes

* add xla generation slow test

e6d27ca5

Add missing comment quotes (#17379) · b8142753
Leon Derczynski authored 3 years ago

b8142753
Remove render tags (#17897) · e113c5cb
NielsRogge authored 3 years ago
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
e113c5cb
Fix the Conda package build (#16737) · 90415475
Santiago Castro authored 3 years ago
```
* Fix the Conda package build

* Update build.sh

* Update release-conda.yml
```
90415475
Remove DT_DOUBLE from the T5 graph (#17891) · babd7b1a
Michal Szutenberg authored 3 years ago

babd7b1a
Compute min_resolution in prepare_image_inputs (#17915) · 6aae59d0
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6aae59d0

28 Jun, 2022 1 commit

Fixing a regression with `return_all_scores` introduced in #17606 (#17906) · 776855c7

Nicolas Patry authored 3 years ago

Fixing a regression with `return_all_scores` introduced in #17606

- The legacy test actually tested `return_all_scores=False` (the actual
  default) instead of `return_all_scores=True` (the actual weird case).

This commit adds the correct legacy test and fixes it.

Tmp legacy tests.

Actually fix the regression (also contains lists)

Less diffed code.

776855c7