Commits · 1a57951746a9cdfe880c95700d657bf4dba04bf0 · zhusg / transformers-new

24 Apr, 2024 14 commits

trigger · 1a579517
ydshieh authored 1 year ago

1a579517
update · 91072206
ydshieh authored 1 year ago

91072206
run better names · ef0f2fc8
ydshieh authored 1 year ago

ef0f2fc8
better names · 8e4b0937
ydshieh authored 1 year ago

8e4b0937
fix jamba slow foward for multi-gpu (#30418) · 37fa1f65
Marc Sun authored 1 year ago
```
* fix jamba slow foward for multi-gpu

* remove comm

* oups

* style
```
37fa1f65
fix uncaught init of linear layer in clip's/siglip's for image classification models (#30435) · 5d64ae9d
Anton Vlasjuk authored 1 year ago
```
* fix clip's/siglip's _init_weights to reflect linear layers in "for image classification"

* trigger slow tests
```
5d64ae9d
[tests] make test device-agnostic (#30444) · 16c8e176
Fanli Lin authored 1 year ago
```
* make device-agnostic

* clean code
```
16c8e176

[`Llava`] + CIs fix red cis and llava integration tests (#30440) · 9a4a119c

Arthur authored 1 year ago


* nit

* nit and fmt skip

* fixup

* Update src/transformers/convert_slow_tokenizer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* set to true

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

9a4a119c

Fix YOLOS image processor resizing (#30436) · 767e3518

Pavel Iakubovskii authored 1 year ago

* Add test for square image that fails

* Fix for square images

* Extend test cases

* Fix resizing in tests

* Style fixup

767e3518

Add llama3 (#30334) · 89c510d8

Arthur authored 1 year ago


* nuke

* add co-author

* add co-author

* update card

* fixup and fix copies to please our ci

* nit fixup

* super small nits

* remove tokenizer_path from call to `write_model`

* always safe serialize by default

---------

Co-authored-by: pcuenca <pcuenca@users.noreply.github.com>
Co-authored-by: xenova <xenova@users.noreply.github.com>

89c510d8

New model PR needs green (slow tests) CI (#30341) · fc34f842

Yih-Dar authored 1 year ago


* You should not pass

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

fc34f842

Remove mentions of models in the READMEs and link to the documentation page in... · c6bba940

Lysandre Debut authored 1 year ago

Remove mentions of models in the READMEs and link to the documentation page in which they are featured. (#30420)

* REAMDEs

* REAMDEs v2

c6bba940

Remove add-new-model in favor of add-new-model-like (#30424) · d4e92f1a
Lysandre Debut authored 1 year ago
```
* Remove add-new-model in favor of add-new-model-like

* nits
```
d4e92f1a
Remove task guides auto-update in favor of links towards task pages (#30429) · 0eb8fbcd
Lysandre Debut authored 1 year ago

0eb8fbcd

23 Apr, 2024 16 commits

[`LlamaTokenizerFast`] Refactor default llama (#28881) · e34da3ee

Arthur authored 1 year ago

* push legacy to fast as well

* super strange

* Update src/transformers/convert_slow_tokenizer.py

* make sure we are BC

* fix Llama test

* nit

* revert

* more test

* style

* update

* small update w.r.t tokenizers

* nit

* don't split

* lol

* add a test for `add_prefix_space=False`

* fix gemma tokenizer as well

* update

* fix gemma

* nicer failures

* fixup

* update

* fix the example for legacy = False

* use `huggyllama/llama-7b` for the PR doctest

* nit

* use from_slow

* fix llama

e34da3ee

Fix use_cache for xla fsdp (#30353) · 12c39e56
Jiewen Tan authored 1 year ago
```
* Fix use_cache for xla fsdp

* Fix linters
```
12c39e56
Rename torch.run to torchrun (#30405) · b8b1e442
Steven Basart authored 1 year ago
```
torch.run does not exist anywhere as far as I can tell.
```
b8b1e442

Remove old TF port docs (#30426) · 696ededd

Matt authored 1 year ago

* Remove old TF port guide

* repo-consistency

* Remove some translations as well for consistency

* Remove some translations as well for consistency

696ededd

Fix LayoutLMv2 init issue and doctest (#30278) · 416fdbad

Yih-Dar authored 1 year ago


* fix

* try suggestion

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

416fdbad

FIX: re-add bnb on docker image (#30427) · d179b9dc
Younes Belkada authored 1 year ago
```
Update Dockerfile
```
d179b9dc
Make EosTokenCriteria compatible with mps (#30376) · 4b63d013
Pedro Cuenca authored 1 year ago

4b63d013

fix for itemsize => element_size() for torch backwards compat (#30133) · 57fc00f3

Wing Lian authored 1 year ago


* fix for itemsize => element_size() for torch backwards compat

* improve handling of element counting

* Update src/transformers/modeling_utils.py

* fixup

* Update src/transformers/modeling_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Younes Belkada <younesbelkada@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

57fc00f3

Fix on "cache position" for assisted generation (#30068) · 77b59dce

Raushan Turganbay authored 1 year ago


* clean commit history I hope

* get kv seq length correctly

* PR suggestions

* Update src/transformers/testing_utils.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* add comment

* give gpt bigcode it's own overriden method

* remove code

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

77b59dce

Jax: scipy version pin (#30402) · 31921d8d
Joao Gante authored 1 year ago
```
scipy pin for jax
```
31921d8d
[tests] add `require_torch_sdpa` for test that needs sdpa support (#30408) · 2d61823f
Fanli Lin authored 1 year ago
```
* add cuda flag

* check for sdpa

* add bitsandbytes
```
2d61823f
fix: link to HF repo/tree/revision when a file is missing (#30406) · 04ac3245
Nick Doiron authored 1 year ago
```
fix: link to HF repo tree when a file is missing
```
04ac3245
remove redundant logging from longformer (#30365) · 179ab098
Russell Klopfer authored 1 year ago

179ab098
[Grounding DINO] Add support for cross-attention in GroundingDinoMultiHeadAttention (#30364) · c651ea98
Eduardo Pacheco authored 1 year ago
```
* Added cross attention support

* Fixed dtypes

* Fixed assumption

* Moved to decoder
```
c651ea98

Add inputs embeds in generation (#30269) · 408453b4

Raushan Turganbay authored 1 year ago

* Add inputs embeds in generation

* always scale embeds

* fix-copies

* fix failing test

* fix copies once more

* remove embeds for models with scaling

* second try to revert

* codestyle

408453b4

show `-rs` to show skip reasons (#30318) · 6c1295a0
Arthur authored 1 year ago

6c1295a0

22 Apr, 2024 10 commits

[docs] LLM inference (#29791) · e74d793a
Steven Liu authored 1 year ago
```
* first draft

* feedback

* static cache snippet

* feedback

* feedback
```
e74d793a

[FEAT]: EETQ quantizer support (#30262) · b4c18a83

zhong zhuang authored 1 year ago


* [FEAT]: EETQ quantizer support

* Update quantization.md

* Update docs/source/en/main_classes/quantization.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update docs/source/en/quantization.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update docs/source/en/quantization.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/integrations/__init__.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/integrations/__init__.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/integrations/eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/integrations/eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/integrations/eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/eetq_integration/test_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/auto.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/auto.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/auto.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/eetq_integration/test_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/eetq_integration/test_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update tests/quantization/eetq_integration/test_eetq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* [FEAT]: EETQ quantizer support

* [FEAT]: EETQ quantizer support

* remove whitespaces

* update quantization.md

* style

* Update docs/source/en/quantization.md

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* add copyright

* Update quantization.md

* Update docs/source/en/quantization.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update docs/source/en/quantization.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Address the comments by amyeroberts

* style

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Marc Sun <marc@huggingface.co>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

b4c18a83

Add sdpa and fa2 the Wav2vec2 family. (#30121) · 569743f5

Kamil Akesbi authored 1 year ago


* add sdpa to wav2vec.
Co-authored-by: kamilakesbi <kamil@huggingface.co>
Co-authored-by: jp1924 <jp42maru@gmail.com>

* add fa2 to wav2vec2

* add tests

* fix attention_mask compatibility with fa2

* minor dtype fix

* replace fa2 slow test

* fix fa2 slow test

* apply code review + add fa2 batch test

* add sdpa and fa2 to hubert

* sdpa and fa2 to data2vec_audio

* sdpa and fa2 to Sew

* sdpa to unispeech + unispeech sat

* small fix

* attention mask in tests

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* add_speedup_benchmark_to_doc

---------

Co-authored-by: kamil@huggingface.co <kamil.akesbi@gmail.com>
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

569743f5

FIX / PEFT: Pass device correctly to peft (#30397) · 367a0dbd
Younes Belkada authored 1 year ago
```
pass device correctly to peft
```
367a0dbd

Fix DETA save_pretrained (#30326) · 13b3b90a

Pavel Iakubovskii authored 1 year ago

* Add class_embed to tied weights for DETA

* Fix test_tied_weights_keys for DETA model

* Replace error raise with assert statement

13b3b90a

Jamba: fix left-padding test (#30389) · 6c7335e0
Joao Gante authored 1 year ago
```
fix test
```
6c7335e0
Fix layerwise GaLore optimizer hard to converge with warmup scheduler (#30372) · f3b3533e
hoshi-hiyouga authored 1 year ago
```
Update optimization.py
```
f3b3533e

Terminator strings for generate() (#28932) · 0d84901c

Matt authored 1 year ago


* stash commit (will discard all of this)

* stash commit

* First commit - needs a lot of testing!

* Add a test

* Fix imports and make the tests actually test something

* Tests pass!

* Rearrange test

* Add comments (but it's still a bit confusing)

* Stop storing the tokenizer

* Comment fixup

* Fix for input_ids with a single sequence

* Update tests to test single sequences

* make fixup

* Fix incorrect use of isin()

* Expand tests to catch more cases

* Expand tests to catch more cases

* make fixup

* Fix length calculation and update tests

* Handle Ġ as a space replacement too

* Update src/transformers/generation/stopping_criteria.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Add optimizations from Joao's suggestion

* Remove TODO

* Update src/transformers/generation/stopping_criteria.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update tests/generation/test_stopping_criteria.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* make fixup

* Rename some variables and remove some debugging clauses for clarity

* Add tests for the sub-methods

* Clarify one test slightly

* Add stop_strings to GenerationConfig

* generate() supports stop_string arg, asks for tokenizer if not provided

* make fixup

* Cleanup code and rename variables for clarity

* Update tokenizer error

* Update tokenizer passing, handle generation on GPU

* Slightly more explanation cleanup

* More comment cleanup

* Factor out the token cleanup so it's more obvious what we're doing, and we can change it later

* Careful with that cleanup!

* Cleanup + optimizations to _get_matching_positions

* More minor performance tweaks

* Implement caching and eliminate some expensive ops (startup time: 200ms -> 9ms)

* Remove the pin_memory call

* Parallelize across all stop strings!

* Quick fix for tensor devices

* Update embeddings test for the new format

* Fix test imports

* Manual patching for BERT-like tokenizers

* Return a bool vector instead of a single True/False

* Better comment

* Better comment

* Add tests from @zucchini-nlp

* Amy's list creation nit

* tok_list -> token_list

* Push a big expanded docstring (should we put it somewhere else?)

* Expand docstrings

* Docstring fixups

* Rebase

* make fixup

* Make a properly general method for figuring out token strings

* Fix naming throughout the functions

* Move cache, refactor, fix tests

* Add comment

* Remove finished TODO

* Remove finished TODO

* make fixup

* Update src/transformers/generation/stopping_criteria.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update and shorten docstring

* Update tests to be shorter/clearer and test specific cases

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

0d84901c

Update docstrings for text generation pipeline (#30343) · 0e9d44d7

Matt authored 1 year ago

* Update docstrings for text generation pipeline

* Fix docstring arg

* Update docstring to explain chat mode

* Fix doctests

* Fix doctests

0e9d44d7

`Llama` family, fix `use_cache=False` generation (#30380) · 2d92db84
Arthur authored 1 year ago
```
* nit to make sure cache positions are not sliced

* fix other models

* nit

* style
```
2d92db84