Commits · smangrul/fix-auto-batch-finder-trainer-issue · zhusg / transformers-new

07 Jun, 2023 17 commits
- Merge branch 'main' into smangrul/fix-auto-batch-finder-trainer-issue · 22a8987b
  Sourab Mangrulkar authored 2 years ago
  
  22a8987b
- fix executable batch size issue (#24067) · 12298cb6
  Sourab Mangrulkar authored 2 years ago
```
* fix executable batch size issue

* fix

* undo
```
  12298cb6
- undo · d5830355
  Sourab Mangrulkar authored 2 years ago
  
  d5830355
- Update delete_doc_comment_trigger.yml (#24084) · ef010071
  Mishig authored 2 years ago
```
fix base workflow name
```
  ef010071
- fix · 5c02c8a2
  Sourab Mangrulkar authored 2 years ago
  
  5c02c8a2
- Fix expected value in tests of the test fetcher (#24077) · 89b00eef
  Sylvain Gugger authored 2 years ago
```
* Fix expected value in tests of the test fetcher

* Fix trigger for repo util tests
```
  89b00eef
- [doc build] Use secrets (#24079) · 5c9394b5
  Mishig authored 2 years ago
  
  5c9394b5
- Make the TF dummies even smaller (#24071) · 1fc832b4
  Matt authored 2 years ago
```
* Let's see if we can use the smallest possible dummies

* Make GPT-2's dummies a little longer

* Just use (1,2) as the default shape

* Update other dummies in sync

* Correct imports for Keras 2.13

* Shrink the Wav2Vec2 dummies
```
  1fc832b4
- Be nice to TF (#24076) · 092c14c3
  Yih-Dar authored 2 years ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  092c14c3
- [`bnb`] Fix bnb skip modules (#24043) · 47952192
  Younes Belkada authored 2 years ago
```
* fix skip modules test

* oops

* address comments
```
  47952192
- Fix `is_optimum_neuron_available` (#23961) · a1160185
  Michael Benayoun authored 2 years ago
```
Fix is_optimum_neuron_available
```
  a1160185
- [`Hub`] Add `safe_serialization` in push_to_hub (#24074) · 6b548129
  Younes Belkada authored 2 years ago
```
add `safe_serialization` in push_to_hub
```
  6b548129
- Support PEFT models when saving the model using trainer (#24073) · 6daf7c31
  Younes Belkada authored 2 years ago
```
* support PEFT models when saving the model using trainer

* fixup
```
  6daf7c31
- Add support for non-rust implemented tokenization for `__getitem__` method. (#24039) · 1e4a7737
  YangLiu authored 2 years ago
```
* Add support for non-rust implemented tokenization for `__getitem__` method.

* Update for error message on adding new sub-branch for `__item__` method.

---------

Co-authored-by: liuyang17 <liuyang17@zhihu.com>
```
  1e4a7737
- [Wav2Vec2] Fix torch srcipt (#24062) · 52972e70
  Patrick von Platen authored 2 years ago
```
* [Wav2Vec2] Fix torch srcipt

* fix more
```
  52972e70
- Generate: increase left-padding test atol (#23448) · 612b2a1a
  Joao Gante authored 2 years ago
```
increase atol
```
  612b2a1a
- fix executable batch size issue · 5eda3a67
  Sourab Mangrulkar authored 2 years ago
  
  5eda3a67
06 Jun, 2023 17 commits

Remote code improvements (#23959) · f1660d7e

Sylvain Gugger authored 2 years ago


* Fix model load when it has both code on the Hub and locally

* Add input check with timeout

* Add tests

* Apply suggestions from code review

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

* Some non-saved stuff

* Add feature extractors

* Add image processor

* Add model

* Add processor and tokenizer

* Reduce timeout

---------

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

f1660d7e

Fix device placement for model-parallelism in generate for encoder/de… (#24025) · 60825f2c
Sylvain Gugger authored 2 years ago
```
* Fix device placement for model-parallelism in generate for encoder/decoders

* Remove debug statements
```
60825f2c
bring back `filtered_test_list_cross_tests.txt` (#24055) · 02d255db
Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
02d255db

Use new parametrization based weight norm if available (#24030) · bc9ecef9

Edward Z. Yang authored 2 years ago

* Use new parametrization based weight norm if available

See https://github.com/pytorch/pytorch/pull/103001



Signed-off-by: Edward Z. Yang <ezyang@meta.com>

* handle copies

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

* black

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

---------

Signed-off-by: Edward Z. Yang <ezyang@meta.com>

bc9ecef9

Move TF building to an actual build() method (#23760) · 4a55e478

Matt authored 2 years ago

* A fun new PR where I break the entire codebase again

* A fun new PR where I break the entire codebase again

* Handle cross-attention

* Move calls to model(model.dummy_inputs) to the new build() method

* Seeing what fails with the build context thing

* make fix-copies

* Let's see what fails with new build methods

* Fix the pytorch crossload build calls

* Fix the overridden build methods in vision_text_dual_encoder

* Make sure all our build methods set self.built or call super().build(), which also sets it

* make fix-copies

* Remove finished TODO

* Tentatively remove unneeded (?) line

* Transpose b in deberta correctly and remove unused threading local

* Get rid of build_with_dummies and all it stands for

* Rollback some changes to TF-PT crossloading

* Correctly call super().build()

4a55e478

Oops, missed one (#24054) · cbf6bc23
Zachary Mueller authored 2 years ago
```
Oops
```
cbf6bc23

Reduce memory usage in TF building (#24046) · 7203ea67

Matt authored 2 years ago

* Make the default dummies (2, 2) instead of (3, 3)

* Fix for Funnel

* Actually fix Funnel

7203ea67

Act on deprecations in Accelerate no_trainer examples (#24053) · 072188d6
Zachary Mueller authored 2 years ago
```
Act on deprecation
```
072188d6
Tiny fix for `check_self_hosted_runner.py` (#24052) · ff4c0fc7
Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ff4c0fc7

Add TimmBackbone model (#22619) · a717e031

amyeroberts authored 2 years ago


* Add test_backbone for convnext

* Add TimmBackbone model

* Add check for backbone type

* Tidying up - config checks

* Update convnextv2

* Tidy up

* Fix indices & clearer comment

* Exceptions for config checks

* Correclty update config for tests

* Safer imports

* Safer safer imports

* Fix where decorators go

* Update import logic and backbone tests

* More import fixes

* Fixup

* Only import all_models if torch available

* Fix kwarg updates in from_pretrained & main rebase

* Tidy up

* Add tests for AutoBackbone

* Tidy up

* Fix import error

* Fix up

* Install nattan in doc_test_job

* Revert back to setting self._out_xxx directly

* Bug fix - out_indices mapping from out_features

* Fix tests

* Dont accept output_loading_info for Timm models

* Set out_xxx and don't remap

* Use smaller checkpoint for test

* Don't remap timm indices - check out_indices based on stage names

* Skip test as it's n/a

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Cleaner imports / spelling is hard

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

a717e031

Modification of one text example file should trigger said test (#24051) · b8935980
Sylvain Gugger authored 2 years ago

b8935980
Prevent ZeroDivisionError on `trainer.evaluate` if model and dataset are tiny (#24049) · 02fe3af2
Tom Aarsen authored 2 years ago
```
Prevent ZeroDivisionError if evaluation is too quick
```
02fe3af2
Use TruncatedNormal from Keras initializers (#24036) · d924390d
Roy Hvaara authored 2 years ago
```
Co-authored-by: Andrey Voynov <avoin@google.com>
```
d924390d
Fixing single candidate_label return. (#24023) · c2e3fa0b
Nicolas Patry authored 2 years ago

c2e3fa0b

Add check for tied parameters (#24029) · 6307312d

Marc Sun authored 2 years ago

* Add check for tied parameters

* Fix style

* fix style

* Fix versioning

* Change if to elif

6307312d

[i18n-KO] Translated `bertology.mdx` to Korean (#23968) · 7da3ce04

Wonhyeong Seo authored 2 years ago


* docs: ko: `bertology.mdx`

* feat: nmt draft

* fix: manual edits

* fix: resolve suggestions

Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

---------

Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>

7da3ce04

[i18n-KO] Translated `language-modeling.mdx` (#23969) · c9385976

Wonhyeong Seo authored 2 years ago


* docs: ko: `language_modeling.mdx`

* feat: nmt draft

* fix: manual edits

* fix: add inline toc

* fix: typo in toc_tree.yml

* fix: resolve suggestions

Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

---------

Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

c9385976

05 Jun, 2023 6 commits

Pin `deepspeed` to `0.9.2` for now (#24024) · 7631db0f
Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
7631db0f

Fix `MobileViTV2` checkpoint name (#24018) · 17846646

Yih-Dar authored 2 years ago


* fix

* fix

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

17846646

[i18n-KO] Translated `tasks_explained.mdx` to Korean (#23844) · 649ffbf5

Hyeonseo Yun authored 2 years ago


* docs: ko: tasks_explained.mdx

* feat: nmt and manual edit `tasks_explained.mdx`

* revised: resolve suggestions task_explained.mdx

* fixed: added draft of reference docs

Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com>

* revised: resolve suggestions(voca, spell check) task_explained.mdx

Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

* revised: remove duplicate sentence in task_explained.mdx

* fixed: remove draft of reference docs

- I think it will be confusing in the translation process.
- This issue is included in #23971.

---------

Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com>
Co-authored-by: Nayeon Han <nayeon2.han@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>

649ffbf5

TensorBoard callback no longer adds hparams (#23999) · 2872f967
Brian Yu authored 2 years ago
```
tensorboard callback no longer adds hparams
```
2872f967

Pix2Struct: fix wrong broadcast axis of attention mask in visual encoder (#23976) · 44bd590a

Jungwoo Park authored 2 years ago


* fix wrong broadcast axis of attention mask in visual encoder

* fix slow tests

---------

Co-authored-by: younesbelkada <younesbelkada@gmail.com>

44bd590a

expose safe_serialization argument in the pipeline API (#23775) · 7824fa43

Yessen Kanapin authored 2 years ago


expose safe_serialization argument of PreTrainedModel and TFPreTrainedModel in the save_pretrained of the pipeline api

Co-authored-by: Yessen Kanapin <yessen@deepinfra.com>

7824fa43