Commits · 01a61305717b39499692383cd9b0d46bf2bb2e12 · 某某某 / transformers-new

29 Jan, 2024 13 commits

use scripts · 01a61305
ydshieh authored 1 year ago

01a61305
Use env.NUM_SLICES · 818ecbdd
ydshieh authored 1 year ago

818ecbdd
Add comment · 3450abbf
ydshieh authored 1 year ago

3450abbf
update / add new workflow files · 0731f406
ydshieh authored 1 year ago

0731f406
fix · e152ddd1
ydshieh authored 1 year ago

e152ddd1
fix · c3cbabd4
ydshieh authored 1 year ago

c3cbabd4
avoid using job name · 8d6dd076
ydshieh authored 1 year ago

8d6dd076

Enable Gradient Checkpointing in Deformable DETR (#28686) · 0548af54

Nate Cibik authored 1 year ago

* Enabled gradient checkpointing in Deformable DETR

* Enabled gradient checkpointing in Deformable DETR encoder

* Removed # Copied from headers in modeling_deta.py to break dependence on Deformable DETR code

0548af54

PatchtTST and PatchTSMixer fixes (#28083) · f72c7c22

Wesley Gifford authored 1 year ago

* 🐛

 fix .max bug

* remove prediction_length from regression output dimensions

* fix parameter names, fix output names, update tests

* ensure shape for PatchTST

* ensure output shape for PatchTSMixer

* update model, batch, and expected for regression distribution test

* update test expected

Signed-off-by: Wesley M. Gifford <wmgifford@us.ibm.com>

* Update tests/models/patchtst/test_modeling_patchtst.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/patchtst/test_modeling_patchtst.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/patchtst/test_modeling_patchtst.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/patchtsmixer/modeling_patchtsmixer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/patchtsmixer/test_modeling_patchtsmixer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/patchtsmixer/test_modeling_patchtsmixer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* standardize on patch_length

Signed-off-by: Wesley M. Gifford <wmgifford@us.ibm.com>

* Update tests/models/patchtsmixer/test_modeling_patchtsmixer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/patchtsmixer/test_modeling_patchtsmixer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Make arguments more explicit

Signed-off-by: Wesley M. Gifford <wmgifford@us.ibm.com>

* adjust prepared inputs

Signed-off-by: Wesley M. Gifford <wmgifford@us.ibm.com>

---------

Signed-off-by: Wesley M. Gifford <wmgifford@us.ibm.com>
Co-authored-by: Wesley M. Gifford <wmgifford@us.ibm.com>
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f72c7c22

[Docs] Fix Typo in English & Japanese CLIP Model Documentation (TMBD -> TMDB) (#28751) · 3a08cc48
Vinyzu authored 1 year ago
```
* [Docs] Fix Typo in English CLIP model_doc

* [Docs] Fix Typo in Japanese CLIP model_doc
```
3a08cc48
Fix input data file extension in examples (#28741) · 39fa4009
Klaus Hipp authored 1 year ago

39fa4009

Fix `DepthEstimationPipeline`'s docstring (#28733) · 5649c0cb

Yih-Dar authored 1 year ago


* fix

* fix

* Fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5649c0cb

Add serialization logic to pytree types (#27871) · 243e186e
Angela Yi authored 1 year ago
```
* Add serialized type name to pytrees

* Modify context

* add serde test
```
243e186e

28 Jan, 2024 1 commit
- [`Siglip`] protect from imports if sentencepiece not installed (#28737) · f1cc6157
  amyeroberts authored 1 year ago
```
[Siglip] protect from imports if sentencepiece not installed
```
  f1cc6157
27 Jan, 2024 2 commits
- Generate: deprecate old src imports (#28607) · 03cc1777
  Joao Gante authored 1 year ago
  
  03cc1777
- Falcon: removed unused function (#28605) · a28a7699
  Joao Gante authored 1 year ago
  
  a28a7699
26 Jan, 2024 12 commits

[Flax] Update no init test for Flax v0.7.1 (#28735) · de13a951
Sanchit Gandhi authored 1 year ago

de13a951
[docs] Fix datasets in guides (#28715) · abe0289e
Steven Liu authored 1 year ago
```
* change datasets

* fix
```
abe0289e

Unpin pydantic (#28728) · f8b7c434

Yih-Dar authored 1 year ago


* try pydantic v2

* try pydantic v2

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f8b7c434

fix: suppress `GatedRepoError` to use cache file (fix #28558). (#28566) · 3aea38ce

Scruel Tao authored 1 year ago

* fix: suppress `GatedRepoError` to use cache file (fix #28558).

* move condition_to_return parameter back to outside.

3aea38ce

Stop confusing the TF compiler with ModelOutput objects (#28712) · 708b19eb

Matt authored 1 year ago

* Stop confusing the TF compiler with ModelOutput objects

* Stop confusing the TF compiler with ModelOutput objects

708b19eb

Fix `weights_only` (#28725) · a638de19
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a638de19

Initialize _tqdm_active with hf_hub_utils.are_progress_bars_disabled(… (#28717) · d6ac8f4a

Shukant Pal authored 1 year ago

Initialize _tqdm_active with hf_hub_utils.are_progress_bars_disabled() to respect HF_HUB_DISABLE_PROGRESS_BARS

It seems like enable_progress_bar() and disable_progress_bar() sync up with huggingface_hub, but the initial value is always True. This changes will make sure the user's preference is respected implicity on initialization.

d6ac8f4a

[`docs`] Update preprocessing.md (#28719) · 3a46e30d

D authored 1 year ago

* Update preprocessing.md

adjust ImageProcessor link to working target (same as in lower section of file)

* Update preprocessing.md

3a46e30d

fix: corrected misleading log message in save_pretrained function (#28699) · 1f47a24a
Turetskii Mikhail authored 1 year ago

1f47a24a

support PeftMixedModel signature inspect (#28321) · bbe30c69

Facico authored 1 year ago


* support PeftMixedModel signature inspect

* import PeftMixedModel only peft>=0.7.0

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* fix styling

* Update src/transformers/trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/trainer.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* style fixup

* fix note

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bbe30c69

Fix duplicate & unnecessary flash attention warnings (#28557) · 8eb74c1c

fxmarty authored 1 year ago


* fix duplicate & unnecessary flash warnings

* trigger ci

* warning_once

* if/else order

---------

Co-authored-by: Your Name <you@example.com>

8eb74c1c

Don't fail when `LocalEntryNotFoundError` during `processor_config.json` loading (#28709) · 142ce683
Yih-Dar authored 1 year ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
142ce683

25 Jan, 2024 6 commits

[`docs`] Improve visualization for vertical parallelism (#28583) · 28751958

Peter Götz authored 1 year ago

The documentation says "We refer to this Model parallelism as “Vertical” because of how models are typically visualized.", but then visualizes the model horizontally. This change visualizes the model indeed vertically.

28751958

[`Vilt`] align input and model dtype in the ViltPatchEmbeddings forward pass (#28633) · 4cbd876e
Fanli Lin authored 1 year ago
```
align dtype
```
4cbd876e

Update question_answering.md (#28694) · 24f1a00e

Yusuf authored 1 year ago

fix typo:

from:

 "model = TFAutoModelForQuestionAnswering("distilbert-base-uncased")"

to:
model = TFAutoModelForQuestionAnswering.from_pretrained("distilbert-base-uncased")

24f1a00e

Improve Backbone API docs (#28666) · 20000956
Merve Noyan authored 1 year ago
```
Update backbones.md
```
20000956
[`chore`] Add missing space in warning (#28695) · 7fa4b36e
Tom Aarsen authored 1 year ago
```
Add missing space in warning
```
7fa4b36e

Add Depth Anything (#28654) · 963db81a

NielsRogge authored 1 year ago

* First draft

* More improvements

* More improvements

* More improvements

* More improvements

* Add docs

* Remove file

* Add copied from

* Address comments

* Address comments

* Address comments

* Fix style

* Update docs

* Convert all checkpoints, add integration test

* Rename checkpoints

* Add pretrained backbone attributes

* Fix default config

* Address comment

* Add figure to docs

* Fix bug thanks to @xenova

* Update conversion script

* Fix integration test

963db81a

24 Jan, 2024 6 commits

[docs] Fix doc format (#28684) · f40b87de
Steven Liu authored 1 year ago
```
* fix hfoptions

* revert changes to other files

* fix
```
f40b87de

improve efficient training on CPU documentation (#28646) · 8278b153

Fanli Lin authored 1 year ago


* update doc

* revert

* typo fix

* refine

* add dtypes

* Update docs/source/en/perf_train_cpu.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/perf_train_cpu.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/perf_train_cpu.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* no comma

* use avx512-vnni

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

8278b153

Improved type hinting for all attention parameters (#28479) · 5d29530e

nakranivaibhav authored 1 year ago

* Changed type hinting for all attention inputs to 'Optional[Tuple[torch.FloatTensor,...]] = None'

* Fixed the ruff formatting issue

* fixed type hinting for all hidden_states to 'Optional[Tuple[torch.FloatTensor, ...]] = None'

* Changed type hinting in these 12 scripts modeling_dpr.py,modeling_nat.py,idefics/vision.py,modeling_tf_dpr.py,modeling_luke.py,modeling_swin.py,modeling_tf_swin.py,modeling_blip.py,modeling_tf_blip.py,modeling_donut_swin.py,modeling_dinat.py,modeling_swinv2.py

* test fail update

* fixed type hinting for these 15 scripts modeling_xlnet.py,modeling_tf_xlnet.py,modeling_led.py,modeling_tf_led.py,modleing_rwkv.py,modeling_dpt.py,modeling_tf_cvt.py,modeling_clip.py,modeling_flax_clip.py,modeling_tf_clip.py,modeling_longformer.py,modeling_tf_longformer.py,modeling_siglip.py,modeling_clap.py,modeling_git.py

* Changed type hinting in these 12 scripts modeling_dpr.py,modeling_nat.py,idefics/vision.py,modeling_tf_dpr.py,modeling_luke.py,modeling_swin.py,modeling_tf_swin.py,modeling_blip.py,modeling_tf_blip.py,modeling_donut_swin.py,modeling_dinat.py,modeling_swinv2.py

* test fail update

* Removed the myvenv file

* Fixed type hinting for these 8 scripts modeling_tvlt.py,modeling_sam.py,modeling_tf_sam.py,modeling_tvp.py,modeling_rag.py,modeling_tf_rag.py,modeling_tf_xlm.py,modeling_xlm.py

5d29530e

[docs] DeepSpeed (#28542) · 738ec75c

Steven Liu authored 1 year ago

* config

* optim

* pre deploy

* deploy

* save weights, memory, troubleshoot, non-Trainer

* done

738ec75c

Add back in generation types (#28681) · bb6aa8bc
amyeroberts authored 1 year ago

bb6aa8bc

Use save_safetensor to disable safe serialization for XLA (#28669) · 0549000c

jeffhataws authored 1 year ago

* Use save_safetensor to disable safe serialization for XLA

https://github.com/huggingface/transformers/issues/28438

* Style fixup

0549000c