Commits · 32b08742a58b43a5a905a28e434b8f67321be024 · 某某某 / transformers-new

13 Apr, 2023 9 commits
- `DocumentQuestionAnsweringPipeline` only for fast ⚡ tokenizers (#22745) · 32b08742
  Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32b08742
- 🌐 [i18n-KO] Translated `training.mdx` to Korean (#22670) · 4def2fe9
  Gabriel Yang authored 2 years ago
```
translate training doc to Korean
```
  4def2fe9
- Change `torch_dtype` to `str` when `saved_model=True` in `save_pretrained` for TF models (#22740) · 7df13432
  Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7df13432
- [Pix2struct] Simplify generation (#22527) · 8eb38f63
  NielsRogge authored 2 years ago
```
* Add model to doc tests

* Remove generate and replace by prepare_inputs_for_generation

* More fixes

* Remove print statements

* Update integration tests

* Fix generate

* Remove model from auto mapping

* Use auto processor

* Fix integration tests

* Fix test

* Add inference code snippet

* Remove is_encoder_decoder

* Update docs

* Remove notebook link
```
  8eb38f63
- Make vilt, switch_transformers compatible with model parallelism (#22703) · 95e70575
  Rinat authored 2 years ago
```
* Update modeling_vilt.py

Vilt compatible with model parallelism

* Update modeling_switch_transformers.py

switch_transformers compatible with model parallelism
```
  95e70575
- Indexing fix for gpt_bigcode (#22737) · 89087597
  Joel Lamy-Poirier authored 2 years ago
```
Fix indexing
```
  89087597
- [Doctest] Add configuration_mvp.py (#22735) · 7ade6ef7
  Elabonga Atuo authored 2 years ago
```
* added configuration file for mvp model

* added configuration_mvp.py line to file
```
  7ade6ef7
- [Doctest] Add configuration_m2m_100.py (#22733) · 51007976
  Elabonga Atuo authored 2 years ago
```
m2m-100-config for doctest
```
  51007976
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored 2 years ago
  
  888c4a2a
12 Apr, 2023 11 commits

Fix docstrings for TF BLIP (#22618) · 50f82e12

Matt authored 2 years ago

* Fix docstrings for TFBLIP

* Fix missing line in TF port!

* Use values from torch tests now other bugs fixed

* Use values from torch tests now other bugs fixed

* Fix doctest string

50f82e12

Update warning levels (#22727) · ce06e478

NielsRogge authored 2 years ago

* Use different level

* Remove futurewarning

* Use warning_once

* Update copies

ce06e478

add fast support and option (#22724) · 98581954

Arthur authored 2 years ago


* add fast support and option

* update based on review

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/llama/convert_llama_weights_to_hf.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* nit

* add print

* fixup

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

98581954

`torch.distributed` group initialization for `torch_neuron` disabled when... · 10fab90f

Michael Benayoun authored 2 years ago

`torch.distributed` group initialization for `torch_neuron` disabled when `optimum-neuron` is installed (#22728)

* Make the process group initialization not happen if optimum_neuron is installed

* Add warning

* Remove list and added warning

10fab90f

[tests] switch to torchrun (#22712) · 1306b7d3
Stas Bekman authored 2 years ago

1306b7d3

Modify pipeline_tutorial.mdx (#22726) · d87ef00c

ARKA1112 authored 2 years ago

generator(model="openai/whisper-large") always returns error. As the error says the generator expects an input, just like the .flac file above. Even the generator object has no parameters called model. While there are parameters which can be passed to generator like 'batch_size' but to pass a model i believe the the parameter has to be passed while instantiating the pipeline and not as a parameter to the instance.

I believe the correct term should be:

generator = pipeline(model="openai/whisper-large", device=0)

d87ef00c

[`bnb`] Let's make serialization of int8 models possible (#22177) · 370f0ca1

Younes Belkada authored 2 years ago


* make serialization of int8 models possible

* make fixup

* add docs

* add ability to push to hub and save pretrained

* fixes

* more addition

* more tests

* fix issues

* change variable

* clearer message

* adapt from suggestions

* few fixes

* remove unused function

* Update src/transformers/utils/quantization_config.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address last comments

* last warning

* clarify doc

* protect import

* Update src/transformers/modeling_utils.py

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

370f0ca1

add model resources for CPMAnt (new) (#20906) · 523ca4e0

pioliverse authored 2 years ago


* resolve conflicts

* rebase and make style

* test

* test

* test

* rebase and make style

* rebase and make style

* tests

* tests

* rewrite some functions

* rebase and make style

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* fix some bugs & docstring

* add models and tests

* solve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* tests

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* fix some bugs & docstring

* save resolution

* make style

* delete redefinition code

* reformat function

* reformat

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* tests

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* resolve conflicts

* make style

* fix bugs and refactor

* modify docstrings and make style

* unify import format in __init__.py

* fix import-altclp bug

* fix copies to update index.md

* fix unused config parameters

* fix unused config parameters

* fix unused config parameters

* update README_ja.md

* dummy commit for unit test

* fix attention mask

* add CPMAntTokenizer&-Fast to auto-mapping

* drop redundant changes in README_ko

* fix  defaults in docstring

* fix use_cache and some docstring

* add missing args in tokenizer

* modify tester inheritance

* add is_jieba_available

* fix some bugs

* make style and fix-copies

* add doctests

* skip integration tests

* add is_jieba_available

* fix bugs in common tests

* adjust docstrings and make style

* add argument docstring

* adjust code to some specifications

* make style and fix-copies

* add fast tokenization test

* dummy commit for unit test

* dummy commit for unit test

* dummy commit for unit test

* normalize some comments and names

* Bert->CPMAnt

* camel names and drop redundant codes

* make style and fix-coies

* add CpmTokenizerFast _import_structure

* drop cpmanttokenizerfast in model_doc

* fix some problems

* fix CPMAnt tokenization for common test

* make style and fixup

* fix copies and fixup

* fix bugs in tokenization test

* dummy commit for connection failure in unittest

* fix copies

* drop trailing comma

* fix decorator in tests

* dummy commit for connection failure in unittest

---------

Co-authored-by: Gong Baitao <gongbaitao11@gmail.com>

523ca4e0

Added parallel device usage for GPT-J (#22713) · 17503b00
jprivera44 authored 2 years ago

17503b00
remove wrong doc in readme (#22723) · b76e6ebd
Arthur authored 2 years ago

b76e6ebd
Update input values for docstring (#22631) · 5a71977b
amyeroberts authored 2 years ago

5a71977b

11 Apr, 2023 7 commits
- Fix decorator order (#22708) · fe1f5a63
  Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fe1f5a63
- Replace -100s in predictions by the pad token (#22693) · 1b1867d8
  Sylvain Gugger authored 2 years ago
```
* Replace -100s in predictions by the pad token

* Style

* Try to catch them all
```
  1b1867d8
- Remove 2 failing ONNX conversion tests (#22660) · ff73deeb
  Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ff73deeb
- Clarify stride option (#22684) · 06b05d45
  Luc CAILLIAU authored 2 years ago
```
* Clarify stride option

* formatting
```
  06b05d45
- Enable naive Pipeline Parallelism training for Gpt neox japanese and san japanese (#22702) · 0224aaf6
  Mayank Agarwal authored 2 years ago
```
Move labels to same device as logits
```
  0224aaf6
- Make it easier to develop without a dev install (#22697) · 28c19ab5
  Sylvain Gugger authored 2 years ago
```
* Make it easier to develop without a dev install

* Remove ugly hack that doesn't work anyway
```
  28c19ab5
- Update some `MarkupLM` tests' expected values (#22667) · 4c01231e
  Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  4c01231e
10 Apr, 2023 8 commits

Model parallelism: Moving labels to same devices as the logits are (#22691) · 151425dd
Shahad Mahmud authored 2 years ago
```
Model parallelism correct labels device
```
151425dd

add GPTNeoXForSequenceClassification (#22671) · 6daa9cb5

Sugawara authored 2 years ago

* add GPTNeoXForSequenceClassification

* move the labels to logits.device (ref: #22561)

* fix

6daa9cb5

use __func__ to check can_generate (#22643) · f74b4020
xinhe authored 2 years ago

f74b4020
Fix quantization docs typo (#22666) · 14fc1a24
Kirill authored 2 years ago

14fc1a24
Make dynamic code work with offline mode (#22661) · 3876fc68
Sylvain Gugger authored 2 years ago
```
* Make dynamic code work with offline mode

* Clean up

* Quality
```
3876fc68
(feat): Moving labels to same device as logits for Deit (#22679) · 98597725
Shikhar Chauhan authored 2 years ago

98597725
Model parallelism: Moving labels to the same device as logits for BridgeTower models (#22676) · 870d91fb
Shahad Mahmud authored 2 years ago
```
BrideTower Model parallelism logits device for loss calculation
```
870d91fb

Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) (#22575) · e0921c6b

Joel Lamy-Poirier authored 2 years ago


* Add model with cli tool

* Remove unwanted stuff

* Add new code

* Remove inference runner

* Style

* Fix checks

* Test updates

* make fixup

* fix docs

* fix doc

* fix test

* hopefully fix pipeline tests

* refactor

* fix CIs

* add comment

* rename to `GPTBigCodeForCausalLM`

* correct readme

* make fixup + docs

* make fixup

* fixes

* fixes

* Remove pruning

* Remove import

* Doc updates

* More pruning removal

* Combine copies

* Single MQA implementation, remove kv cache pre-allocation and padding

* Update doc

* Revert refactor to match gpt2 style

* Merge back key and value caches, fix some type hints

* Update doc

* Fix position ids pith padding (PR 21080)

* Add conversion script temporarily

* Update conversion script

* Remove checkpoint conversion

* New model

* Fix MQA test

* Fix copies

* try fix tests

* FIX TEST!!

* remove  `DoubleHeadsModel`

* add MQA tests

* add slow tests

* clean up

* add CPU checker

* final fixes

* fixes

- fix GPU issue
- fixed slow tests
- skip disk offload

* fix final issue

* Simplify and comment baddbmm fix

* Remove unnecessary code

* Transpose tweaks

* Use beta=1 on cpu, improve tests

---------

Co-authored-by: younesbelkada <younesbelkada@gmail.com>

e0921c6b

07 Apr, 2023 5 commits

moved labels to the same device as logits for BLOOM, GPT Neo, GPT NeoX,... · 656e869a

Arun Brahma authored 2 years ago

moved labels to the same device as logits for BLOOM, GPT Neo, GPT NeoX, RoBERTa and VIT models (#22663)

moved labels to the same device as logits

656e869a

Revert migration of setup to pyproject.toml (#22658) · 6db23af5
Sylvain Gugger authored 2 years ago

6db23af5
Generate: add API warning to streamers (#22659) · 3f96e0b4
Joao Gante authored 2 years ago
```
add API warning
```
3f96e0b4

[OPT] Fix default attention mask size (#22649) · f3341926

Arthur authored 2 years ago

* Fix default attention mask size

* fixup

* add a test to make sure that even if attention mask are not provided, works

* style

f3341926

[tokenization] do not push special file (#22657) · b1b3dc3e

Arthur authored 2 years ago


* do not push special file

* Update src/transformers/tokenization_utils_base.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b1b3dc3e