Commits · 626c1b8af19f3d4cb2f2d2b116fdce45bed10ebb · 某某某 / transformers-new

17 Apr, 2023 4 commits

improve(llama): Faster apply_rotary_pos_emb (#22785) · 626c1b8a
fpgaminer authored 2 years ago

626c1b8a

[i18n-KO] fix: docs: ko: sagemaker anchors and `_toctree.yml` (#22549) · abbc96a2

Jungnerd authored 2 years ago


fix: docs: ko: sagemaker anchors and  `_toctree.yml`

Co-authored-by: Hyeonseo Yun <0525_hhgus@naver.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>
Co-authored-by: Na Yeon Han <nayeon2.han@gmail.com>
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

abbc96a2

[i18n-KO] Translated `custom_models.mdx` to Korean (#22534) · 18c89481

Na Yeon Han authored 2 years ago


docs: ko: translated `custom_models.mdx`

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Gabriel Yang <gabrielwithhappy@gmail.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

18c89481

Fix `test_word_time_stamp_integration` for `Wav2Vec2ProcessorWithLMTest` (#22800) · 76d24f1a
Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
76d24f1a

15 Apr, 2023 1 commit
- Generate: add CJK support to TextStreamer (#22664) · 28f26c10
  bcol authored 2 years ago
  
  28f26c10
14 Apr, 2023 12 commits

Move labels to the same device as logits for Whisper (#22779) · fb3aa06c
oscar-garzon authored 2 years ago

fb3aa06c
Indexing fix - CLIP checkpoint conversion (#22776) · 20e54e49
amyeroberts authored 2 years ago
```
* Indexing fix - CLIP checkpoint conversion

* Fix up
```
20e54e49
Seq2SeqTrainer: Evict decoder_input_ids only when it is created from labels (#22772) · 895ae3b5
Joao Gante authored 2 years ago

895ae3b5
Fix word_ids hyperlink (#22765) · daf53241
Mayank Agarwal authored 2 years ago
```
* Fix word_ids hyperlink

* Add suggested fix
```
daf53241
Tweak ESM tokenizer for Nucleotide Transformer (#22770) · 06e737fb
Matt authored 2 years ago
```
* If EOS is None, don't add it to sequences

* If EOS is None, don't add it to sequences
```
06e737fb

[WIP]

[i18n-KO] Translated `tutorial/proprecssing.mdx` to Korean (#22578) · c8df3900

Sohyun Sim authored 2 years ago


* add ko preprocessing

* translate preprocessing.mdx to korean

* translate preprocessing.mdx

* Update preprocessing.mdx

Fixed the line 273 as below:
또한, 특징 추출기에 `sampling_rate` 인자를 추가하여 발생할 수 있는 조용한 오류(silent errors)를 더 잘 디버깅하는 것을 권장합니다.

* translate Image part

* translated preprocess.mdx

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* Update docs/source/ko/preprocessing.mdx

* fixed translation

---------

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

c8df3900

Fix failing torchscript tests for `CpmAnt` model (#22766) · 53c710d1
Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
53c710d1

Fix a mistake in Llama weight converter log output. (#22764) · d2ffc3fc

Alexander Ljungberg authored 2 years ago

Fixed string format; better tokenizer message.

Before: `Saving a {tokenizer_class} to {tokenizer_path}`
After: `Saving a LlamaTokenizerFast to outdir.`

d2ffc3fc

Generate: pin number of beams in BART test (#22763) · 9af845af
Joao Gante authored 2 years ago

9af845af
Pix2struct: doctest fix (#22761) · 66b15efb
Joao Gante authored 2 years ago

66b15efb

[Examples] TPU-based training of a language model using TensorFlow (#21657) · 390e121f

Sayak Paul authored 2 years ago


* add: tokenizer training script for TF TPU LM training.

* add: script for preparing the TFRecord shards.

* add: sequence of execution to readme.

* remove limit from the tfrecord shard name.

* Add initial train_model.py

* Add basic training arguments and model init

* Get up to the point of writing the data collator

* Pushing progress so far!

* Complete first draft of model training code

* feat: grouping of texts efficiently.

Co-authored-by: Matt <rocketknight1@gmail.com>

* Add proper masking collator and get training loop working

* fix: things.

* Read sample counts from filenames

* Read sample counts from filenames

* Draft README

* Improve TPU warning

* Use distribute instead of distribute.experimental

* Apply suggestions from code review

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Modularize loading and add MLM probability as arg

* minor refactoring to better use the cli args.

* readme fillup.

* include tpu and inference sections in the readme.

* table of contents.

* parallelize maps.

* polish readme.

* change script name to run_mlm.py

* address PR feedback (round I).

---------

Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

390e121f

[i18n-KO] Translated `sequence_classification.mdx` to Korean (#22655) · bfb3925f

Hyeonseo Yun authored 2 years ago


* docs: ko: init: tasks/sequence_classification.mdx

* docs: ko: revised: change voca in tasks/sequence_classification.mdx

* docs: ko: revised: [RE] change voca in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and sentence naturally in tasks/sequence_classification.mdx

* docs: ko: revised: spell check and consistent vocabulary in tasks/sequence_classification.mdx

* docs: ko: revised: Add full stop and change voca in tasks/sequence_classification.mdx

* docs: ko: revised: sync first section templates in tasks/sequence_classification.mdx

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

* fix: revert use of full-stops to colons

* colons are used to emphasize the code block that follows

* @0525hhgus @wonhyeongseo docs: ko: revised: sync second section templates in tasks/sequence_classification.mdx

Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>

* docs: ko: revised: change 'train', 'finetuning' in tasks/sequence_classification.mdx

---------

Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

bfb3925f

13 Apr, 2023 15 commits
- Fix `serving_output` for TF composite models (encoder-decoder like models) (#22743) · a6752a7d
  Yih-Dar authored 2 years ago
```
* fix

* style

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  a6752a7d
- Revert (for now) the change on `Deta` in #22437 (#22750) · 410b61ad
  Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  410b61ad
- Generate: handle text conditioning with multimodal encoder-decoder models (#22748) · 9dfd6a4b
  Joao Gante authored 2 years ago
  
  9dfd6a4b
- fix(llama): fix LlamaTokenzier (#22746) · 90ce374d
  Ruiyang Sun authored 2 years ago
```
Bug in LlamaTokenizer when  #22742
```
  90ce374d
- [trainer] update url (#22747) · d85bf954
  Stas Bekman authored 2 years ago
```
* [trainer] update url

* style
```
  d85bf954
- Remove `DS_BUILD_AIO=1` (#22741) · 656d41ab
  Yih-Dar authored 2 years ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  656d41ab
- `DocumentQuestionAnsweringPipeline` only for fast tokenizers (#22745) · 32b08742
  Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  32b08742
- [i18n-KO] Translated `training.mdx` to Korean (#22670) · 4def2fe9
  Gabriel Yang authored 2 years ago
```
translate training doc to Korean
```
  4def2fe9
- Change `torch_dtype` to `str` when `saved_model=True` in `save_pretrained` for TF models (#22740) · 7df13432
  Yih-Dar authored 2 years ago
```
* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  7df13432
- [Pix2struct] Simplify generation (#22527) · 8eb38f63
  NielsRogge authored 2 years ago
```
* Add model to doc tests

* Remove generate and replace by prepare_inputs_for_generation

* More fixes

* Remove print statements

* Update integration tests

* Fix generate

* Remove model from auto mapping

* Use auto processor

* Fix integration tests

* Fix test

* Add inference code snippet

* Remove is_encoder_decoder

* Update docs

* Remove notebook link
```
  8eb38f63
- Make vilt, switch_transformers compatible with model parallelism (#22703) · 95e70575
  Rinat authored 2 years ago
```
* Update modeling_vilt.py

Vilt compatible with model parallelism

* Update modeling_switch_transformers.py

switch_transformers compatible with model parallelism
```
  95e70575
- Indexing fix for gpt_bigcode (#22737) · 89087597
  Joel Lamy-Poirier authored 2 years ago
```
Fix indexing
```
  89087597
- [Doctest] Add configuration_mvp.py (#22735) · 7ade6ef7
  Elabonga Atuo authored 2 years ago
```
* added configuration file for mvp model

* added configuration_mvp.py line to file
```
  7ade6ef7
- [Doctest] Add configuration_m2m_100.py (#22733) · 51007976
  Elabonga Atuo authored 2 years ago
```
m2m-100-config for doctest
```
  51007976
- v4.29.0.dev0 · 888c4a2a
  Sylvain Gugger authored 2 years ago
  
  888c4a2a
12 Apr, 2023 8 commits

Fix docstrings for TF BLIP (#22618) · 50f82e12

Matt authored 2 years ago

* Fix docstrings for TFBLIP

* Fix missing line in TF port!

* Use values from torch tests now other bugs fixed

* Use values from torch tests now other bugs fixed

* Fix doctest string

50f82e12

Update warning levels (#22727) · ce06e478

NielsRogge authored 2 years ago

* Use different level

* Remove futurewarning

* Use warning_once

* Update copies

ce06e478

add fast support and option (#22724) · 98581954

Arthur authored 2 years ago


* add fast support and option

* update based on review

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/llama/convert_llama_weights_to_hf.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* nit

* add print

* fixup

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

98581954

`torch.distributed` group initialization for `torch_neuron` disabled when... · 10fab90f

Michael Benayoun authored 2 years ago

`torch.distributed` group initialization for `torch_neuron` disabled when `optimum-neuron` is installed (#22728)

* Make the process group initialization not happen if optimum_neuron is installed

* Add warning

* Remove list and added warning

10fab90f

[tests] switch to torchrun (#22712) · 1306b7d3
Stas Bekman authored 2 years ago

1306b7d3

Modify pipeline_tutorial.mdx (#22726) · d87ef00c

ARKA1112 authored 2 years ago

generator(model="openai/whisper-large") always returns error. As the error says the generator expects an input, just like the .flac file above. Even the generator object has no parameters called model. While there are parameters which can be passed to generator like 'batch_size' but to pass a model i believe the the parameter has to be passed while instantiating the pipeline and not as a parameter to the instance.

I believe the correct term should be:

generator = pipeline(model="openai/whisper-large", device=0)

d87ef00c

[`bnb`] Let's make serialization of int8 models possible (#22177) · 370f0ca1

Younes Belkada authored 2 years ago


* make serialization of int8 models possible

* make fixup

* add docs

* add ability to push to hub and save pretrained

* fixes

* more addition

* more tests

* fix issues

* change variable

* clearer message

* adapt from suggestions

* few fixes

* remove unused function

* Update src/transformers/utils/quantization_config.py

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address last comments

* last warning

* clarify doc

* protect import

* Update src/transformers/modeling_utils.py

* Apply suggestions from code review

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

370f0ca1

add model resources for CPMAnt (new) (#20906) · 523ca4e0

pioliverse authored 2 years ago


* resolve conflicts

* rebase and make style

* test

* test

* test

* rebase and make style

* rebase and make style

* tests

* tests

* rewrite some functions

* rebase and make style

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* fix some bugs & docstring

* add models and tests

* solve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* tests

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* fix some bugs & docstring

* save resolution

* make style

* delete redefinition code

* reformat function

* reformat

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* tests

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* resolve conflicts

* fix load_tf_weights_in_cpmant

* reformat some unrelated files

* upgrade quality

* resolve conflicts

* make style

* fix bugs and refactor

* modify docstrings and make style

* unify import format in __init__.py

* fix import-altclp bug

* fix copies to update index.md

* fix unused config parameters

* fix unused config parameters

* fix unused config parameters

* update README_ja.md

* dummy commit for unit test

* fix attention mask

* add CPMAntTokenizer&-Fast to auto-mapping

* drop redundant changes in README_ko

* fix  defaults in docstring

* fix use_cache and some docstring

* add missing args in tokenizer

* modify tester inheritance

* add is_jieba_available

* fix some bugs

* make style and fix-copies

* add doctests

* skip integration tests

* add is_jieba_available

* fix bugs in common tests

* adjust docstrings and make style

* add argument docstring

* adjust code to some specifications

* make style and fix-copies

* add fast tokenization test

* dummy commit for unit test

* dummy commit for unit test

* dummy commit for unit test

* normalize some comments and names

* Bert->CPMAnt

* camel names and drop redundant codes

* make style and fix-coies

* add CpmTokenizerFast _import_structure

* drop cpmanttokenizerfast in model_doc

* fix some problems

* fix CPMAnt tokenization for common test

* make style and fixup

* fix copies and fixup

* fix bugs in tokenization test

* dummy commit for connection failure in unittest

* fix copies

* drop trailing comma

* fix decorator in tests

* dummy commit for connection failure in unittest

---------

Co-authored-by: Gong Baitao <gongbaitao11@gmail.com>

523ca4e0