Commits · 78b2929c0554b79e0489b451ce4ece14d265ead2 · 某某某 / transformers-new

21 Sep, 2024 2 commits

Avishai Elmakies authored 9 months ago


* add sdpa to dinov2

* fixup

* add dinov2 to sdpa doc

* update doc order

* [run-slow] dinov2

* common to eager

* [run-slow] dinov2

* update attn implementation in common

* update test_modeling_dinov2 to have mask_ration, num_masks and mask_length similar to vit

* [run-slow] dinov2

---------

Co-authored-by: Avishai Elmakies <avishai.elma@cs.huji.ac.il>

78b2929c

Pixtral update example checkpoint (#33633) · e71bf70e
amyeroberts authored 9 months ago
```
* Update pixtral example checkpoint

* Fix typo
```
e71bf70e

20 Sep, 2024 19 commits

Granitemoe (#33207) · e472e077

Mayank Mishra authored 9 months ago


* first commit

* drop tokenizer

* drop tokenizer

* drop tokenizer

* drop convert

* granite

* drop tokenization test

* mup

* fix

* reformat

* reformat

* reformat

* fix docs

* stop checking for checkpoint

* update support

* attention multiplier

* update model

* tiny drop

* saibo drop

* skip test

* fix test

* fix test

* drop

* drop useless imports

* update docs

* drop flash function

* copied from

* drop pretraining tp

* drop pretraining tp

* drop pretraining tp

* drop unused import

* drop code path

* change name

* softmax scale

* head dim

* drop legacy cache

* rename params

* cleanup

* fix copies

* comments

* add back legacy cache

* multipliers

* multipliers

* multipliers

* text fix

* fix copies

* merge

* multipliers

* attention multiplier

* drop unused imports

* add granitemoe

* add decoration

* remove moe from sequenceclassification

* fix test

* fix

* fix

* fix

* move rope?

* merge

* drop bias

* drop bias

* Update src/transformers/models/granite/configuration_granite.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix

* Update src/transformers/models/granite/modeling_granite.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix

* fix

* fix

* fix

* drop

* drop

* fix

* fix

* cleanup

* cleanup

* fix

* fix granite tests

* fp32 test

* fix

* drop jitter

* fix

* rename

* rename

* fix config

* add gen test

---------

Co-authored-by: Yikang Shen <yikang.shn@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

e472e077

enable low-precision pipeline (#31625) · 49a0bef4

jiqing-feng authored 9 months ago


* enable low-precision pipeline

* fix parameter for ASR

* reformat

* fix asr bug

* fix bug for zero-shot

* add dtype check

* rm useless comments

* add np.float16 check

* Update src/transformers/pipelines/image_classification.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/pipelines/token_classification.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fix comments

* fix asr check

* make fixup

* No more need for is_torch_available()

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Matt <rocketknight1@gmail.com>

49a0bef4

Fix typos (#33583) · 7b2b536a
litianjian authored 9 months ago
```
Co-authored-by: litianjian <litianjian@bytedance.com>
```
7b2b536a
Fix qwen2vl float16 inference bug (#33312) · e9356a42
GeLee authored 9 months ago
```
* fix qwen2vl float16 inference bug

* [run-slow] qwen2_vl
```
e9356a42

Update daily ci to use new cluster (#33627) · 75c878da

Yih-Dar authored 9 months ago


* update

* re-enable daily CI

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

75c878da

Fix some missing tests in circleci (#33559) · 077b552f

Yih-Dar authored 9 months ago


* fix

* fix

* fix

* fix

* skip

* skip more

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

077b552f

Generate: assistant should sample when the main model samples (#33534) · 77c5d59e
Joao Gante authored 9 months ago

77c5d59e

Fix contrastive search to correctly handle input with padding (#33507) · dc8b6eae

Duc-Viet Hoang authored 9 months ago

* fix: handle padding in contrastive search for decoder-only models

* fix: handle padding in contrastive search for encoder-decoder models

* tests: move padding contrastive test to test_util, add t5 test

* fix: handle if model_kwargs["decoder_attention_mask"] is None

* refactor: improve padding input contrastive search generation tests

* chore: _ranking_fast to use LongTensor for cosine_matrix_mask

dc8b6eae

Add support for args to ProcessorMixin for backward compatibility (#33479) · c0c6815d

Yoni Gozlan authored 9 months ago

* add check and prepare args for BC to ProcessorMixin, improve ProcessorTesterMixin

* change size and crop_size in processor kwargs tests to do_rescale and rescale_factor

* remove unnecessary llava processor kwargs test overwrite

* nit

* change data_arg_name to input_name

* Remove unnecessary test override

* Remove unnecessary tests Paligemma

* Move test_prepare_and_validate_optional_call_args to TesterMixin, add docstring

c0c6815d

Fix missing test in `torch_job` (#33593) · 31caf0b9
Yih-Dar authored 9 months ago
```
fix missing tests

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31caf0b9
VLM generate: tests can't generate image/video tokens (#33623) · 2fdb5e74
Joao Gante authored 9 months ago

2fdb5e74

Add sdpa for BioGpt (#33592) · 653eb404

Omar Salman authored 9 months ago

* Add sdpa for BioGpt

* Updates

* Add the docs

* [run_slow] biogpt

* Use the copy mechanism to ensure consistency

* [run_slow] biogpt

653eb404

Remove unnecessary CPM model tests (#33621) · f9b44097
amyeroberts authored 9 months ago
```
Remove model tests
```
f9b44097
Generate: remove flakyness in `test_generate_from_inputs_embeds_decoder_only` (#33602) · 266d0a63
Joao Gante authored 9 months ago
```
almost zero is not zero
```
266d0a63

Update modeling_mamba2.py, fix pad size (#32599) · ec1424c6

Lake Lee authored 9 months ago

* Update modeling_mamba2.py

Fix pad_size calculation to ensure it's less than self.chunk_size

* [run_slow] mamba2

* [run-slow] mamba2

* [run-slow] Add @require_read_token decorator to failing tests for token propagation

* [run_slow] mamba2

ec1424c6

[tests] make more tests device-agnostic (#33580) · 8bd1f2f3

Fanli Lin authored 9 months ago

* enable

* fix

* add xpu skip

* add marker

* skip for xpu

* add more

* enable on accelerator

* add more cases

* add more tests

* add more

8bd1f2f3

Allow CI could be run on private forked repositories (e.g. new model additions) (#33594) · 31650a53
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31650a53
Fix CircleCI nightly run (#33558) · 6dc36461
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6dc36461
Docs: add the ability to manually trigger jobs (#33598) · bdf4649f
Joao Gante authored 9 months ago

bdf4649f

19 Sep, 2024 14 commits

Fix Llama 3 TikToken conversion (#33538) · 0c718f16
Pedro Cuenca authored 9 months ago
```
* Fix Llama 3 TikToken conversion

* No need to add tokens again
```
0c718f16
[tests] enable GemmaIntegrationTest on XPU (#33555) · 4d8908df
Fanli Lin authored 9 months ago
```
enable GemmaIntegrationTest
```
4d8908df

[tests] skip tests for xpu (#33553) · b87755aa

Fanli Lin authored 9 months ago

* enable

* fix

* add xpu skip

* add marker

* skip for xpu

* add more

* add one more

b87755aa

Uniformize kwargs for Paligemma processor and update docs (#33571) · f111d5b7
Yoni Gozlan authored 9 months ago
```
* Uniformize paligemma processor

* nit
```
f111d5b7
Cache: don't throw warnings on `gemma2` when instantiating a new cache (#33595) · 52920b5d
Joao Gante authored 9 months ago

52920b5d
[`Mamba2`] Move dt calculations to kernel (#33520) · b50ff599
Anton Vlasjuk authored 9 months ago
```
* use kernel for dt calculations

* add small test

* [run-slow] mamba2
```
b50ff599

change sequence_bias type of SequenceBiasLogitsProcessor to list, add… (#33375) · 162056a3

Vladislav Bronzov authored 9 months ago

* change sequence_bias type of SequenceBiasLogitsProcessor tp list, add config tests for all processors

* fix format

* small fix for all_token_bias_pairs_are_valid internal func

* small typo fix in description

* improve test impl, some SequenceBiasLogitsProcessor refactoring

162056a3

Generate: check that `attention_mask` is 2D (#33575) · d9d59e7b
Joao Gante authored 9 months ago
```
check attention mask in generate
```
d9d59e7b

add uniform processors for altclip + chinese_clip (#31198) · 413008c5

Pablo Montalvo authored 9 months ago

* add initial design for uniform processors + align model

* add uniform processors for altclip + chinese_clip

* fix mutable default 👀

* add configuration test

* handle structured kwargs w defaults + add test

* protect torch-specific test

* fix style

* fix

* rebase

* update processor to generic kwargs + test

* fix style

* add sensible kwargs merge

* update test

* fix assertEqual

* move kwargs merging to processing common

* rework kwargs for type hinting

* just get Unpack from extensions

* run-slow[align]

* handle kwargs passed as nested dict

* add from_pretrained test for nested kwargs handling

* [run-slow]align

* update documentation + imports

* update audio inputs

* protect audio types, silly

* try removing imports

* make things simpler

* simplerer

* move out kwargs test to common mixin

* [run-slow]align

* skip tests for old processors

* [run-slow]align, clip

* !$#@!! protect imports, darn it

* [run-slow]align, clip

* [run-slow]align, clip

* update common processor testing

* add altclip

* add chinese_clip

* add pad_size

* [run-slow]align, clip, chinese_clip, altclip

* remove duplicated tests

* fix

* update doc

* improve documentation for default values

* add model_max_length testing

This parameter depends on tokenizers received.

* Raise if kwargs are specified in two places

* fix

* match defaults

* force padding

* fix tokenizer test

* clean defaults

* move tests to common

* remove try/catch block

* deprecate kwarg

* format

* add copyright + remove unused method

* [run-slow]altclip, chinese_clip

* clean imports

* fix version

* clean up deprecation

* fix style

* add corner case test on kwarg overlap

* resume processing - add Unpack as importable

* add tmpdirname

* fix altclip

* fix up

* add back crop_size to specific tests

* generalize tests to possible video_processor

* add back crop_size arg

* fixup overlapping kwargs test for qformer_tokenizer

* remove copied from

* fixup chinese_clip tests values

* fixup tests - qformer tokenizers

* [run-slow] altclip, chinese_clip

* remove prepare_image_inputs

413008c5

fix tests with main revision and read token (#33560) · 4f0246e5

Pablo Montalvo authored 9 months ago

* fix tests with main revision and read token

* [run-slow]mamba2

* test previously skipped tests

* [run-slow]mamba2

* skip some tests

* [run-slow]mamba2

* finalize tests

* [run-slow]mamba2

4f0246e5

Cache: don't show warning in forward passes when `past_key_values` is None (#33541) · 80b774eb
Joao Gante authored 9 months ago

80b774eb
rag: fix CI (#33578) · f3b3810f
Joao Gante authored 9 months ago

f3b3810f

VLMs: enable generation tests (#33533) · d7975a58

Raushan Turganbay authored 9 months ago


* add tests

* fix whisper

* update

* nit

* add qwen2-vl

* more updates!

* better this way

* fix this one

* fix more tests

* fix final tests, hope so

* fix led

* Update tests/generation/test_utils.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* pr comments

* not pass pixels and extra for low-mem tests, very flaky because of visio tower

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

d7975a58

Load and save video-processor from separate folder (#33562) · e40bb484

Raushan Turganbay authored 9 months ago


* load and save from video-processor folder

* Update src/transformers/models/llava_onevision/processing_llava_onevision.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e40bb484

18 Sep, 2024 5 commits

Codec integration (#33565) · 5af7d41e

Yoach Lacombe authored 9 months ago


* clean mimi commit

* some nits suggestions from Arthur

* make fixup

* rename repo id + change readme

* Update docs/source/en/model_doc/mimi.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add flaky flag to batching equivalence due to audio_codes failing sometimes

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

5af7d41e

Fix bnb dequantization (#33546) · 6019f3ff
Marc Sun authored 9 months ago

6019f3ff

Improve compiled RT-DETR inference speed (#33412) · 7b1ce634

Yoni Gozlan authored 9 months ago

* modify rt detr to improve inference times when compiled

* Remove redundant "to"

* Fix conditional lru_cache and missing shapes_list

* nit unnecessary list creation

* Fix compile error when ninja not available and custon kernel activated

7b1ce634

enforce original size to be a list (#33564) · 9db963ae

Dominik Niedziela authored 9 months ago

* enforce original size to be a list

* formatting

* apply datatype change to unpad_image in llava_next

9db963ae

Return attention mask in ASR pipeline to avoid warnings (#33509) · 8efc06ee
Matt authored 9 months ago
```
return attention mask in ASR pipeline
```
8efc06ee