Commits · 51227e26ab8fe6d1a19804da697786649f9340e3 · zhusg / transformers-new

29 Jul, 2022 6 commits

Fix TFSegformerForSemanticSegmentation doctest (#18362) · 51227e26
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
51227e26

[FX] Symbolic trace for Bloom (#18356) · 4e2f4a92

* Bloom model can now be traced

* Bloom traced model can be torch scripted and serialized

* Bloom can be traced with variable keyword arguments

* Enable XLNet support

* Disable XLNet for now

4e2f4a92

Fix some doctests (#18359) · 1763770b

Yih-Dar authored 2 years ago


* Fix some doctests

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1763770b

Replace `as_target` context managers by direct calls (#18325) · 986526a0

Sylvain Gugger authored 2 years ago


* Preliminary work on tokenizers

* Quality + fix tests

* Treat processors

* Fix pad

* Remove all uses of  in tests, docs and examples

* Replace all as_target_tokenizer

* Fix tests

* Fix quality

* Update examples/flax/image-captioning/run_image_captioning_flax.py

Co-authored-by: amyeroberts <amy@huggingface.co>

* Style

Co-authored-by: amyeroberts <amy@huggingface.co>

986526a0

Fix OwlViT torchscript tests (#18347) · a64bcb56
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a64bcb56
[Docs] Fix Speech Encoder Decoder doc sample (#18346) · a4ee463d
Sanchit Gandhi authored 2 years ago
```
* [Docs] Fix Speech Encoder Decoder doc sample

* improve pre-processing comment

* make style
```
a4ee463d

28 Jul, 2022 10 commits

Migrate metrics used in flax examples to Evaluate (#18348) · da503ea0

Vijay S Kalmath authored 2 years ago

Currently, tensorflow examples use the `load_metric` function from
Datasets library, commit migrates function call to `load` function
from Evaluate library.

da503ea0

Migrate metric to Evaluate library for tensorflow examples (#18327) · a2586795

Vijay S Kalmath authored 2 years ago

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate metric to Evaluate library in tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

Fix for #18306

* Migrate `metric` to Evaluate for all tf examples

Currently tensorflow examples use `load_metric` function from Datasets
library , commit migrates function call to `load` function to
Evaluate library.

a2586795

[BLOOM] Deprecate `position_ids` (#18342) · 7b090876
Thomas Wang authored 2 years ago

7b090876
Include tensorflow-aarch64 as a candidate (#18345) · 9c336657
Ankur Goyal authored 2 years ago
```
Co-authored-by: Ankur Goyal <ankur@impira.com>
```
9c336657
Remove Flax OPT from doctest for now (#18338) · b53dab60
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
b53dab60
Fix codeparrot deduplication - ignore whitespaces (#18023) · 286a18fa
Loubna Ben Allal authored 2 years ago
```
* ignore whitspaces for hash

* reformat code

* Update README.md
```
286a18fa
Update automatic_speech_recognition.py (#18339) · 5d1fed07
bhuang authored 2 years ago

5d1fed07
Updated _toctree.yml (#18337) · 985c7e3a
Nicola Procopio authored 2 years ago

985c7e3a

updated translation (#18333) · a8e27957

Edoardo Federici authored 2 years ago

Left the term fine-tuning since there is no correct translation into Italian and the English term is generally used. The same was done with some terms like "learning rate"

a8e27957

fixed typo (#18331) · 1e380c7d
Edoardo Federici authored 2 years ago

1e380c7d

27 Jul, 2022 19 commits

Update feature extractor docs (#18324) · 96be1b7f

Steven Liu authored 2 years ago

As pointed out by @NielsRogge, a feature extractor is used to prepare inputs for a model with a single modality rather than multimodal models.

96be1b7f

start from 1.12, torch_ccl is renamed as oneccl_bindings_for_pytorch … (#18229) · 2b81f72b

Wang, Yi authored 2 years ago


* start from 1.12, torch_ccl is renamed as oneccl_bindings_for_pytorch and should import it before use

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add doc for perf_train_cpu_many

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* update doc

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

2b81f72b

Add swin transformer v2 (#17469) · e87ac9d1

Ritik Nandwal authored 2 years ago


* Add files generated using transformer-cli add-new-model-like command

* Add changes for swinv2 attention and forward method

* Add fixes

* Add modifications for weight conversion and remaining args in swin model

* Add changes for patchmerging

* Add changes for SwinV2selfattention

* Update conversion script

* Add final fixes for the swin_v2 model

* Add changes for conversion script for pretrained window size case

* Add pretrained window size value from config in SwinV2Encoder class

* Make fixup

* Add swinv2 to models_not_in_readme to utils/check_copies.py

* Modify Swinv2v2 to Swin Transformer V2

* Remove copied from, to run make fixup command

* Add updates to swinv2tf from main branch

* Add pretrained_window_size to config, to make tests pass

* Add modified weights from nandwalritik profile for swinv2

* Update model weights from swinv2 from nandwalritik profile

* Add fix for build_pr_documentation CI fix

* Add fixes for weight conversion

* Add change to make input with padding work

* Add fixes for test cases

* Add few changes from swin to swinv2 to pass test cases

* Remove tests for tensorflow as swinv2 for TF is not added yet

* Overide test_pt_tf_model_equivalence function as TF implementation for swinv2 is not added yet

* Add modeling_tf_swinv2 to _ignore_modules as test file is removed for this one right now.

* Update docs url for swinv2 in README.md

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Undo changes for check_repo

* Update url in readme.md

* Remove overrided function to test pt_tf_model_equivalence

* Remove TF model imports for Swinv2 as its not implemented in this PR

* Add changes for index.mdx

* Add swinv2 papers link,abstract and contributors details

* Rename cpb_mlp to continous_position_bias_mlp

* Add tips for swinv2 model

* Update src/transformers/models/swinv2/configuration_swinv2.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/swinv2/configuration_swinv2.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Fix indentation for docstring example in src/transformers/models/swinv2/configuration_swinv2.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update import order in src/transformers/models/swinv2/configuration_swinv2.py

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add copyright statements in weights conversion script.

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Remove Swinv2 from models_not_in_readme

* Reformat code

* Remove TF implementation file for swinv2

* Update start docstring.

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add changes for docstring

* Update orgname for weights to microsoft

* Remove to_2tuple function

* Add copied from statements wherever applicable

* Add copied from to Swinv2ForMaskedImageModelling class

* Reformat code.

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add unittest.skip(with reason.) for test_inputs_embeds test case.

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add updates for test_modeling_swinv2.py

* Add @unittest.skip() annotation for clarity to create_and_test_config_common_properties function

* Add continuous_position_bias_mlp parameter to conversion script

* Add test for testing masked_image_modelling for swinv2

* Update Swinv2 to Swin Transformer v2 in docs/source/en/model_doc/swinv2.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update Swinv2 to Swin Transformer v2 in docs/source/en/model_doc/swinv2.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/swinv2.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update docs/source/en/model_doc/swinv2.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add suggested changes

* Add copied from to forward methods of Swinv2Stage and Swinv2Encoder

* Add push_to_hub flag to weight conversion script

* Change order or Swinv2DropPath class

* Add id2label mapping for imagenet 21k

* Add updated url for SwinV2 functions and classes used in implementation

* Update input_feature dimensions format, mentioned in comments.

Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>

* Add suggested changes for modeling_swin2.py

* Update docs

* Remove create_and_test_config_common_properties function, as test_model_common_attributes is sufficient.

* Fix indentation.

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add changes for making Nit objects in code style

* Add suggested changes

* Add suggested changes for test_modelling_swinv2

* make fix-copies

* Update docs/source/en/model_doc/swinv2.mdx

Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Alara Dirik <8944735+alaradirik@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e87ac9d1

Dev version · c89a592e
Lysandre authored 2 years ago

c89a592e

[Flax] Fix incomplete batches in example scripts (#17863) · 7490a97c

Sanchit Gandhi authored 2 years ago

* [Flax] Fix incomplete batches in example scripts

* fix dataloader batching

* convert jnp batch idxs to np array

* add missing `pad_shard_unpad` to final prediction generate step

* only `pad_shard_unpad` at inference time

* merge conflicts

* remove incomplete batch step from eval

* fix run_qa.py

* add `pad_shard_unpad` to run_flax_ner.py

* add `pad_shard_unpad` to run_flax_glue.py

* add `pad_shard_unpad` to run_image_classification.py

* make style

* fix mlm flax eval batches

* remove redundant imports

7490a97c

Owlvit test fixes (#18303) · 9caf68a6

Alara Dirik authored 2 years ago

* fix owlvit test assertion errors

* fix gpu test error

* remove redundant lines

* fix styling

9caf68a6

Fix sacremoses sof dependency for Transformers XL (#18321) · 0077360d
Sylvain Gugger authored 2 years ago
```
* Fix sacremoses sof dependency for Transofmers XL

* Add function to the submodule init
```
0077360d
sentencepiece shouldn't be required for the fast LayoutXLM tokenizer (#18320) · 5c5676cd
Lysandre Debut authored 2 years ago

5c5676cd
Remove all uses of six (#18318) · cf32b2ee
Sylvain Gugger authored 2 years ago
```
* Remove all uses of six

* fix quality
```
cf32b2ee
Generalize decay_mask_fn to apply mask to all LayerNorm params (#18273) · 170fcaa6
Duong A. Nguyen authored 2 years ago
```
* generalize decay_mask_fn to find all layernorm params

* fixup

* generalising decay_mask_fn
```
170fcaa6
fix loading from pretrained for sharded model with `torch_dtype="auto" (#18061) · 83d2d745
Nouamane Tazi authored 2 years ago

83d2d745
fix module order (#18312) · 7996ef74
Younes Belkada authored 2 years ago
```
- put gelu before 4h to h
```
7996ef74

Fixes torch jit tracing for LayoutLMv2 model (re-open) (#18313) · 70e7d1d6

Mikkel Denker authored 2 years ago

* Fixes torch jit tracing for LayoutLMv2 model.
Pytorch seems to reuse memory for input_shape which caused a mismatch in shapes later in the forward pass.

* Fixed code quality

* avoid unneeded allocation of vector for shape

70e7d1d6

Update CodeParrot readme to include training in Megatron (#17798) · 1d71ad89

Loubna Ben Allal authored 2 years ago


* add info about megatron training

* upload models and datasets from CodeParrot organization

* upload models and datasets from CodeParrot organization

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* Update examples/research_projects/codeparrot/README.md

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

* fix typo and add comment about codeparrot vs megatron

Co-authored-by: Leandro von Werra <lvwerra@users.noreply.github.com>

1d71ad89

[XLA] Improve t5 model performance (#18288) · d5610b53
Yanming Wang authored 2 years ago

d5610b53
Apply type correction to `TFSwinModelOutput` (#18295) · e318cda9
Seunghwan Hong authored 2 years ago
```
Signed-off-by: Seunghwan Hong <seunghwan@scatterlab.co.kr>
```
e318cda9

[EncoderDecoder] Improve docs (#18271) · ccd4180f

NielsRogge authored 2 years ago


* Improve docs

* Improve docs of speech one as well

* Apply suggestions from code review

Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

ccd4180f

Remove duplicated line (#18310) · 5dfec704

Manuel R. Ciosici authored 2 years ago

Removes a duplicated instantiation of device. I removed the second instance of the line to maintain code alignment with the GPT-J implementation of forward.

5dfec704

[DETR] Improve code examples (#18262) · 47c2af09

NielsRogge authored 2 years ago


* Improve doc test

* Improve code example of segmentation model

* Apply suggestion

* Update src/transformers/models/detr/modeling_detr.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

47c2af09

26 Jul, 2022 5 commits

patch for smddp import (#18244) · ee67e7ad
Carolyn Wang authored 2 years ago
```
* add import

* format
```
ee67e7ad

Fix Sylvain's nits on the original KerasMetricCallback PR (#18300) · 68097dcc

Matt authored 2 years ago


* Fix Sylvain's nits on the original PR

* Update src/transformers/keras_callbacks.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Re-add "optional" to docstring

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

68097dcc

Add PYTEST_TIMEOUT for CircleCI test jobs (#18251) · 66491331
Yih-Dar authored 2 years ago
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
66491331

Add Spanish translation of custom_models.mdx (#17807) · a5d50483

Ian Castillo authored 2 years ago

* Update index

* Translate to Spanish two sections from custom_models

* Translate to Spanish custom models documentation

* Fixing typos and grammatical errors

* Add requested changes from reviewer

a5d50483

Add Italian translation of sharing_custom_models.mdx (#17631) · 7ea7eba3

Federico Panero authored 2 years ago


* work in progress: custom_models

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml

* Update custom_models.mdx

* Update custom_models.mdx

* Update _toctree.yml

* Update _toctree.yml

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7ea7eba3