Commits · v4.3.3-release · zhusg / transformers-new

27 Feb, 2021 2 commits
- Push conda-build on branch · e001374b
  Lysandre authored 4 years ago
  
  e001374b
- Fix conda-build · 771ed52b
  Lysandre authored 4 years ago
  
  771ed52b
24 Feb, 2021 2 commits
- Release: v4.3.3 · bae0c79f
  Lysandre authored 4 years ago
  
  v4.3.3
  
  bae0c79f
- ConvBERT fix torch <> tf weights conversion (#10314) · 0d4c9808
  abhishek thakur authored 4 years ago
```
* convbert conversion test

* fin

* fin

* fin

* clean up tf<->pt conversion

* remove from_pt

Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
```
  0d4c9808
09 Feb, 2021 5 commits
- Release: v4.3.2 · cd48078c
  Sylvain Gugger authored 4 years ago
  
  v4.3.2
  
  cd48078c
- [RAG] fix generate (#10094) · 727ab9d3
  Suraj Patil authored 4 years ago
```
* fix rag generate and tests

* put back adjust_logits_during_generation

* tests are okay

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  727ab9d3
- fix import (#10103) · c95fae6d
  Patrick von Platen authored 4 years ago
  
  c95fae6d
- Release: v4.3.1 · cc86472c
  Lysandre authored 4 years ago
  
  v4.3.1
  
  cc86472c
- Deprecate Wav2Vec2ForMaskedLM and add Wav2Vec2ForCTC (#10089) · 02451cda
  Patrick von Platen authored 4 years ago
```
* add wav2vec2CTC and deprecate for maskedlm

* remove from docs
```
  02451cda
08 Feb, 2021 3 commits
- Release: v4.3.0 · 800f385d
  Lysandre authored 4 years ago
  
  v4.3.0
  
  800f385d
- Update tokenizers requirement (#10077) · bcf49c04
  Anthony MOI authored 4 years ago
  
  bcf49c04
- Bump minimum Jax requirement to 2.8.0 (#10027) · 15a8906c
  Patrick von Platen authored 4 years ago
```
* Bump minimum Jax requirement to 2.8.0

* update table
```
  15a8906c
04 Feb, 2021 12 commits

Release: 4.3.0.rc1 · 4cd22512
Sylvain Gugger authored 4 years ago

2 tags

4cd22512
Fix test for sagemaker and TPU integrations · 4739ce17
Sylvain Gugger authored 4 years ago

4739ce17

Authorize last version of tokenizer (#9799) · 21b3922e

Sylvain Gugger authored 4 years ago


* Authorize last version of tokenizer

* Update version table

* Fix conversion of spm tokenizers and fix some hub links

* Bump tokenizers version to 0.10.1rc1

* Add script to check tokenizers conversion with XNLI

* Add some more mask_token lstrip support

* Must modify mask_token in slow tokenizers too

* Keep using the old method for Pegasus

* add missing import

Co-authored-by: Anthony MOI <m.anthony.moi@gmail.com>

21b3922e

Hotfixing tests (blenderbot decoderonly tests, also need to remove (#10003) · d5888ef0
Nicolas Patry authored 4 years ago
```
`encoder_no_repeat_ngram_size` from their config.
```
d5888ef0

[trainer] a few fixes (#9993) · 8c3b1fcb

Stas Bekman authored 4 years ago

* trainer fixes

* don't switch the model  just for deepspeed and mp

* correct the fix

8c3b1fcb

Remove "double" assignment in TF-BART like models (#9997) · 714855bd

Daniel Stancl authored 4 years ago

* Replace `attn_weights = attn_wegihts = tf.reshape(...)`
with `attn_weights = tf.reshape(...)` and thus remove
unintentionally used "double" assignment.

714855bd

Fix doc for TFConverBertModel · b72f16b3
Sylvain Gugger authored 4 years ago

b72f16b3

Adding new `encoder_no_repeat_ngram_size` to `generate`. (#9984) · aeb18b92

Nicolas Patry authored 4 years ago

Adding new `encoder_no_repeat_ngram_size` to `generate`.

Blenderbot results seemed off compared to original ParlAI script:
`https://parl.ai/projects/recipes/`

. Notably the model seems
to repeat a lot what was said during the conversation.

The actual problem was that `no_repeat_ngram_size` actually applies
to the `encoder_input_ids` but HF's `no_repeat_ngram_size` applies
to the previously generated ids (within the decoder). The history
conversation of blenderbot is within the `encoder` part so that
explains why HF's implementation had the repetitions.

This fix was focused on blenderbot *not* small and added tests
for those because they are quite different in configuration.

This change includes:

- Adding a new EncoderNoRepeatLogitProcessor.
- Adding 1 new arg to `generate` (`encoder_no_repeat_ngram_size`)
- Adding 1 new config parameter `encoder_no_repeat_ngram_size`.
- Adding 2 tests, one for the pipeline (high level, inputs exhibited
repeat behavior, one low level for EncoderNoRepeatLogitProcessor)
- Factored NoRepeatLogitProcessor so that logic could be reused.

Further work:

- Blenderbot conversational pipeline still does not behave correctly
 as they way input is prepared within the pipeline is still incorrect
(follow up PR)
- Blenderbot allows the bot to have personas, which is done by
prepending "your personna: XXXX" to the input, this could be explored
too in a follow up PR.

@patrickvonplaten
@LysandreJik

* Update src/transformers/generation_logits_process.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/generation_utils.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/configuration_utils.py

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Doc quality.

* Fixing test.

* Last fixes.

* Fixing to account for batch_size.

* Update src/transformers/configuration_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/generation_utils.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

aeb18b92

Fix model templates (#9999) · e89c959a
Lysandre Debut authored 4 years ago

e89c959a
Added Integration testing for DistilBert model from issue #9948' (#9995) · 804cd185
Daniel Hug authored 4 years ago

804cd185

BartForCausalLM analogs to `ProphetNetForCausalLM` (#9128) · 00031785

demSd authored 4 years ago


* initiliaze bart4causalLM

* create BartDecoderWrapper, setters/getters

* delete spaces

* forward and additional methods

* update cache function, loss function, remove ngram* params in data class.

* add bartcausallm, bartdecoder testing

* correct bart for causal lm

* remove at

* add mbart as well

* up

* fix typo

* up

* correct

* add pegasusforcausallm

* add blenderbotforcausallm

* add blenderbotsmallforcausallm

* add marianforcausallm

* add test for MarianForCausalLM

* add Pegasus test

* add BlenderbotSmall test

* add blenderbot test

* fix a fail

* fix an import fail

* a fix

* fix

* Update modeling_pegasus.py

* fix models

* fix inputs_embeds setting getter

* adapt tests

* correct repo utils check

* finish test improvement

* fix tf models as well

* make style

* make fix-copies

* fix copies

* run all tests

* last changes

* fix all tests

Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

00031785

Add `from_slow` in fast tokenizers build and fixes some bugs (#9987) · 7898fc03
Sylvain Gugger authored 4 years ago

7898fc03

03 Feb, 2021 11 commits
- distilbert: fix creation of sinusoidal embeddings when using PyTorch 1.8+ (#9917) · 6244727e
  Stefan Schweter authored 4 years ago
  
  6244727e
- Alber model integration testing added (#9980) · 2f06f2bc
  sandip authored 4 years ago
  
  2f06f2bc
- Integration test added for TF MPnet (#9979) · 75fd00fb
  sandip authored 4 years ago
  
  75fd00fb
- Integration test for mobilebert (#9978) · ce08043f
  sandip authored 4 years ago
  
  ce08043f
- TF DistilBERT integration tests (#9975) · 1486205d
  sandip authored 4 years ago
```
* TF DistilBERT integration test

* Update test_modeling_tf_distilbert.py
```
  1486205d
- Added integration tests for TensorFlow implementation of the ALBERT model (#9976) · f2d5c04e
  sandip authored 4 years ago
```
* TF Albert integration test

* TF Alber integration test added
```
  f2d5c04e
- [run_clm.py] fix getting extention · bca0dd5e
  Suraj Patil authored 4 years ago
  
  bca0dd5e
- fix steps_in_epoch variable in trainer when using max_steps (#9969) · 5442a11f
  yylun authored 4 years ago
```
* fix steps_in_epoch variable when using max_steps

* redundant sentence

* Revert "redundant sentence"

This reverts commit ad5c0e9b6e66d65732dee2239cdc9c76dfa0dc5a.

* remove redundant sentence

Co-authored-by: wujindou <wujindou@sogou-inc.com>
```
  5442a11f
- Fix Longformer and LED (#9942) · 3f77c26d
  Julien Plu authored 4 years ago
```
* Fix Longformer and LED

* Add a test for graph execution with inputs_embeds

* Apply style
```
  3f77c26d
- [research proj] [lxmert] rm bleach dependency (#9970) · d55e10be
  Stas Bekman authored 4 years ago
```
Looks like a vulnerability and it's not really used anywhere in the code, so just as well remove it completely from deps.
https://github.com/huggingface/transformers/security/dependabot/examples/research_projects/lxmert/requirements.txt/bleach/open
```
  d55e10be
- Fix GroupedLinearLayer in TF ConvBERT (#9972) · a1a67a3c
  abhishek thakur authored 4 years ago
  
  a1a67a3c
02 Feb, 2021 5 commits

Add head_mask and decoder_head_mask to PyTorch LED (#9856) · 71bdc076

Daniel Stancl authored 4 years ago

* Add {decoder_,}head_mask to LED

* Fix create_custom_forward signatue in encoder

* Add head_mask to longformer

* Add head_mask to longformer to fix dependencies
of LED on Longformer.

* Not working yet

* Add mising one input in longofrmer_modeling.py

* make fix-copies

71bdc076

Wav2Vec2 (#9659) · d6217fb3

Patrick von Platen authored 4 years ago


* add raw scaffold

* implement feat extract layers

* make style

* remove +

* correctly convert weights

* make feat extractor work

* make feature extraction proj work

* run forward pass

* finish forward pass

* Succesful decoding example

* remove unused files

* more changes

* add wav2vec tokenizer

* add new structure

* fix run forward

* add other layer norm architecture

* finish 2nd structure

* add model tests

* finish tests for tok and model

* clean-up

* make style

* finish docstring for model and config

* make style

* correct docstring

* correct tests

* change checkpoints to fairseq

* fix examples

* finish wav2vec2

* make style

* apply sylvains suggestions

* apply lysandres suggestions

* change print to log.info

* re-add assert statement

* add input_values as required input name

* finish wav2vec2 tokenizer

* Update tests/test_tokenization_wav2vec2.py

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* apply sylvains suggestions

Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

d6217fb3

Use compute_loss in prediction_step (#9935) · d996024a
Sylvain Gugger authored 4 years ago

d996024a
convbert: minor fixes for conversion script (#9937) · aa438a42
Stefan Schweter authored 4 years ago

aa438a42
Bump numpy (#9934) · 62024453
Sylvain Gugger authored 4 years ago

62024453