Commits · 4f8361afe7b411ae2956d59a761264eef8db6ad8 · 某某某 / transformers-new

30 Jun, 2022 2 commits
- Unifying training argument type annotations (#17934) · 4f8361af
  Jannis Born authored 3 years ago
```
* doc: Unify training arg type annotations

* wip: extracting enum type from Union

* blackening
```
  4f8361af
- Fix GPT-NeoX-20B past handling, attention computation (#17811) · 205bc415
  Jason Phang authored 3 years ago
```
* Fix GPT-NeoX-20B past handling, swap attention computation to hopefully avoid NaN, update docs

* 20B tests
```
  205bc415
29 Jun, 2022 22 commits

Crystina authored 3 years ago


* first draft adding Flax-t5-encoder and Flax-mt5-encoder

* imports

* after make fixup

* flax t5 encoder test

* black on test

* make fix-copies

* clean

* all_model_classes -> tuple

* clean test

* is_encoder_decoder=False in t5-enc tester

* remove file docstring before FlaxT5Encoder

* black

* isort

* commit suggestions on src/transformers/models/t5/modeling_flax_t5.py

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* commit suggestions on src/transformers/models/t5/modeling_flax_t5.py

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestions from code review

Co-authored-by: Suraj Patil <surajp815@gmail.com>

* remove _get_encoder_module

* self.decoder_seq_length -> self.encoder_seq_length as t5-enc does not have decoder

* bugfix - self.module_class is class itself, not instance;

* docs for mt5 and t5

* call -> __call__ in t5 doc

* FlaxMT5EncoderModel to TYPE_HINT

* run doc-builder to allow change the files

Co-authored-by: Suraj Patil <surajp815@gmail.com>

692e61e9

Fix #17893, removed dead code (#17917) · eb1493b1

Clémentine Fourrier authored 3 years ago

* Removed dead position_id code, fix #17893

* Removed unused var

* Now ignores removed (dead) dict key for backward comp

eb1493b1

add MobileViT model (#17354) · fbc7598b

Matthijs Hollemans authored 3 years ago


* add MobileViT

* fixup

* Update README.md

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* remove empty line

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* use clearer variable names

* rename to MobileViTTransformerLayer

* no longer inherit from nn.Sequential

* fixup

* fixup

* not sure why this got added twice

* rename organization for checkpoints

* fix it up

* Update src/transformers/models/mobilevit/__init__.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/models/mobilevit/test_modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/mobilevit/modeling_mobilevit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* code style improvements

* fixup

* Update docs/source/en/model_doc/mobilevit.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/mobilevit.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/mobilevit/configuration_mobilevit.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* download labels from hub

* rename layers

* rename more layers

* don't compute loss in separate function

* remove some nn.Sequential

* replace nn.Sequential with new MobileViTTransformer class

* replace nn.Sequential with MobileViTMobileNetLayer

* fix pruning since model structure changed

* fixup

* fix doc comment

* remove custom resize from feature extractor

* fix ONNX import

* add to doc tests

* use center_crop from image_utils

* move RGB->BGR flipping into image_utils

* fix broken tests

* wrong type hint

* small tweaks

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fbc7598b

Fix prepare_tf_dataset when drop_remainder is not supplied (#17950) · 5feac3d0
Matt authored 3 years ago

5feac3d0
ExplicitEnum subclass str (JSON dump compatible) (#17933) · bc019b0e
Bram Vanroy authored 3 years ago
```
* ExplicitEnum subclass str (JSON dump compatible)

* allow union if one of the types is str
```
bc019b0e
PyTorch 1.12.0 for scheduled CI (#17949) · b089cca3
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b089cca3
OPT - Fix Softmax NaN in half precision mode (#17437) · d444edb3
Younes Belkada authored 3 years ago

d444edb3
Use explicit torch version in deepspeed CI (#17942) · 9fe2403b
Yih-Dar authored 3 years ago
```
* use explicit torch version

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
9fe2403b
fix regexes with escape sequence (#17943) · 4c722e9e
Stas Bekman authored 3 years ago

4c722e9e
Fix all is_torch_tpu_available issues (#17936) · 7c4c6f60
Zachary Mueller authored 3 years ago
```
* Fix all is_torch_tpu_available 
```
7c4c6f60

Fix img seg tests (load checkpoints from `hf-internal-testing`) (#17939) · 77b76672

Mishig Davaadorj authored 3 years ago

* Revert "Skip failing test until they are fixed."

This reverts commit 8f400775.

* Use `tiny-detr` checkpts from `hf-internal-testing`

77b76672

Add MVP model (#17787) · 3cff4cc5

StevenTang1998 authored 3 years ago

* Add MVP model

* Update README

* Remove useless module

* Update docs

* Fix bugs in tokenizer

* Remove useless test

* Remove useless module

* Update vocab

* Remove specifying

* Remove specifying

* Add #Copied ... statement

* Update paper link

* Remove useless TFMvp

* Add #Copied ... statement

* Fix style in test mvp model

* Fix some typos

* Fix properties of unset special tokens in non verbose mode

* Update paper link

* Update MVP doc

* Update MVP doc

* Fix README

* Fix typos in docs

* Update docs

3cff4cc5

Skip failing test until they are fixed. · 8f400775
Sylvain Gugger authored 3 years ago

8f400775
Remove imports and use forward references in ONNX feature (#17926) · 47b91651
Sylvain Gugger authored 3 years ago

47b91651
Fix job links in Slack report (#17892) · 5cdfff5d
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
5cdfff5d

TF implementation of RegNets (#17554) · a7eba831

Aritra Roy Gosthipaty authored 3 years ago


* chore: initial commit

Copied the torch implementation of regnets and porting the code to tf step by step. Also introduced an output layer which was needed for regnets.

* chore: porting the rest of the modules to tensorflow

did not change the documentation yet, yet to try the playground on the model

* Fix initilizations (#1)

* fix: code structure in few cases.

* fix: code structure to align tf models.

* fix: layer naming, bn layer still remains.

* chore: change default epsilon and momentum in bn.

* chore: styling nits.

* fix: cross-loading bn params.

* fix: regnet tf model, integration passing.

* add: tests for TF regnet.

* fix: code quality related issues.

* chore: added rest of the files.

* minor additions..

* fix: repo consistency.

* fix: regnet tf tests.

* chore: reorganize dummy_tf_objects for regnet.

* chore: remove checkpoint var.

* chore: remov unnecessary files.

* chore: run make style.

* Update docs/source/en/model_doc/regnet.mdx

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* chore: PR feedback I.

* fix: pt test. thanks to @ydshieh.

* New adaptive pooler (#3)

* feat: new adaptive pooler

Co-authored-by: @Rocketknight1

* chore: remove image_size argument.

Co-authored-by: matt <rocketknight1@gmail.com>

Co-authored-by: matt <rocketknight1@gmail.com>

* Empty-Commit

* chore: remove image_size comment.

* chore: remove playground_tf.py

* chore: minor changes related to spacing.

* chore: make style.

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: amyeroberts <aeroberts4444@gmail.com>

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: amyeroberts <aeroberts4444@gmail.com>

* chore: refactored __init__.

* chore: copied from -> taken from./g

* adaptive pool -> global avg pool, channel check.

* chore: move channel check to stem.

* pr comments - minor refactor and add regnets to doc tests.

* Update src/transformers/models/regnet/modeling_tf_regnet.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* minor fix in the xlayer.

* Empty-Commit

* chore: removed from_pt=True.

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: matt <rocketknight1@gmail.com>
Co-authored-by: amyeroberts <aeroberts4444@gmail.com>
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

a7eba831

TF: XLA beam search + most generation-compatible models are now also... · e6d27ca5

Joao Gante authored 3 years ago

TF: XLA beam search + most generation-compatible models are now also XLA-generate-compatible (#17857)

* working beam search 

* XLA generation compatible with ALL classes

* add xla generation slow test

e6d27ca5

Add missing comment quotes (#17379) · b8142753
Leon Derczynski authored 3 years ago

b8142753
Remove render tags (#17897) · e113c5cb
NielsRogge authored 3 years ago
```
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
```
e113c5cb
Fix the Conda package build (#16737) · 90415475
Santiago Castro authored 3 years ago
```
* Fix the Conda package build

* Update build.sh

* Update release-conda.yml
```
90415475
Remove DT_DOUBLE from the T5 graph (#17891) · babd7b1a
Michal Szutenberg authored 3 years ago

babd7b1a
Compute min_resolution in prepare_image_inputs (#17915) · 6aae59d0
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6aae59d0

28 Jun, 2022 14 commits

Fixing a regression with `return_all_scores` introduced in #17606 (#17906) · 776855c7

Nicolas Patry authored 3 years ago

Fixing a regression with `return_all_scores` introduced in #17606

- The legacy test actually tested `return_all_scores=False` (the actual
  default) instead of `return_all_scores=True` (the actual weird case).

This commit adds the correct legacy test and fixes it.

Tmp legacy tests.

Actually fix the regression (also contains lists)

Less diffed code.

776855c7

Pin PyTorch in requirements as well · 5f1e67a5
Sylvain Gugger authored 3 years ago

5f1e67a5
Pin PyTorch while we fix compatibility with 1.12 · 5a3d0cbd
Sylvain Gugger authored 3 years ago

5a3d0cbd

Adding GroupViT Models (#17313) · 6c8f4c9a

Jerry Jiarui XU authored 3 years ago


* add group vit and fixed test (except slow)

* passing slow test

* addressed some comments

* fixed test

* fixed style

* fixed copy

* fixed segmentation output

* fixed test

* fixed relative path

* fixed copy

* add ignore non auto configured

* fixed docstring, add doc

* fixed copies

* Apply suggestions from code review

merge suggestions

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* resolve comment, renaming model

* delete unused attr

* use fix copies

* resolve comments

* fixed attn

* remove unused vars

* refactor tests

* resolve final comments

* add demo notebook

* fixed inconsitent default

* Apply suggestions from code review

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* rename stage->stages

* Create single GroupViTEncoderLayer class

* Update conversion script

* Simplify conversion script

* Remove cross-attention class in favor of GroupViTAttention

* Convert other model as well, add processor to conversion script

* addressing final comment

* fixed args

* Update src/transformers/models/groupvit/modeling_groupvit.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

6c8f4c9a

Mrbean/codegen onnx (#17903) · b424f0b4
mrbean authored 3 years ago

b424f0b4
Add ONNX support for DETR (#17904) · 76d13de5
regisss authored 3 years ago

76d13de5
In `group_texts` function, drop last block if smaller than `block_size` (#17908) · bfcd5743
Bill Ray authored 3 years ago

bfcd5743
Move logic into pixelshuffle layer (#17899) · f71895a6
amyeroberts authored 3 years ago
```
* Move all pixelshuffle logic into layer

* Rename layer

* Use correct input to function
```
f71895a6
Fix loss computation in TFBertForPreTraining (#17898) · 0094565f
Matt authored 3 years ago

0094565f
Pin black to 22.3.0 to benefit from a stable --preview flag (#17918) · 1dfa03f1
Lysandre Debut authored 3 years ago

1dfa03f1
[M2M100] update conversion script (#17916) · 9eec4e93
Suraj Patil authored 3 years ago

9eec4e93

Fix PyTorch/TF Auto tests (#17895) · db2644b9

Yih-Dar authored 3 years ago


* add loading_info

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

db2644b9

Fix `test_number_of_steps_in_training_with_ipex` (#17889) · f717d47f
Yih-Dar authored 3 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f717d47f
Update expected values in constrained beam search tests (#17887) · 0b0dd977
Yih-Dar authored 3 years ago
```
* fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0b0dd977

27 Jun, 2022 2 commits
- Fix bug in gpt2's (from-scratch) special scaled weight initialization (#17877) · e02037b3
  Andrej authored 3 years ago
```
* only special scale init each gpt2 c_proj weight once, on exact match

* fix double quotes

Co-authored-by: leandro <leandro.vonwerra@spoud.io>
```
  e02037b3
- Update README_zh-hans.md (#17861) · 6dd00f6b
  吉吉 authored 3 years ago
  
  6dd00f6b