Commits · e9356a4206ea3f34a099a240f79712f445c02fbd · 某某某 / transformers-new

20 Sep, 2024 16 commits

Fix qwen2vl float16 inference bug (#33312) · e9356a42
GeLee authored 9 months ago
```
* fix qwen2vl float16 inference bug

* [run-slow] qwen2_vl
```
e9356a42

Update daily ci to use new cluster (#33627) · 75c878da

Yih-Dar authored 9 months ago


* update

* re-enable daily CI

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

75c878da

Fix some missing tests in circleci (#33559) · 077b552f

Yih-Dar authored 9 months ago


* fix

* fix

* fix

* fix

* skip

* skip more

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

077b552f

Generate: assistant should sample when the main model samples (#33534) · 77c5d59e
Joao Gante authored 9 months ago

77c5d59e

Fix contrastive search to correctly handle input with padding (#33507) · dc8b6eae

Duc-Viet Hoang authored 9 months ago

* fix: handle padding in contrastive search for decoder-only models

* fix: handle padding in contrastive search for encoder-decoder models

* tests: move padding contrastive test to test_util, add t5 test

* fix: handle if model_kwargs["decoder_attention_mask"] is None

* refactor: improve padding input contrastive search generation tests

* chore: _ranking_fast to use LongTensor for cosine_matrix_mask

dc8b6eae

Add support for args to ProcessorMixin for backward compatibility (#33479) · c0c6815d

Yoni Gozlan authored 9 months ago

* add check and prepare args for BC to ProcessorMixin, improve ProcessorTesterMixin

* change size and crop_size in processor kwargs tests to do_rescale and rescale_factor

* remove unnecessary llava processor kwargs test overwrite

* nit

* change data_arg_name to input_name

* Remove unnecessary test override

* Remove unnecessary tests Paligemma

* Move test_prepare_and_validate_optional_call_args to TesterMixin, add docstring

c0c6815d

Fix missing test in `torch_job` (#33593) · 31caf0b9
Yih-Dar authored 9 months ago
```
fix missing tests

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31caf0b9
VLM generate: tests can't generate image/video tokens (#33623) · 2fdb5e74
Joao Gante authored 9 months ago

2fdb5e74

Add sdpa for BioGpt (#33592) · 653eb404

Omar Salman authored 9 months ago

* Add sdpa for BioGpt

* Updates

* Add the docs

* [run_slow] biogpt

* Use the copy mechanism to ensure consistency

* [run_slow] biogpt

653eb404

Remove unnecessary CPM model tests (#33621) · f9b44097
amyeroberts authored 9 months ago
```
Remove model tests
```
f9b44097
Generate: remove flakyness in `test_generate_from_inputs_embeds_decoder_only` (#33602) · 266d0a63
Joao Gante authored 9 months ago
```
almost zero is not zero
```
266d0a63

Update modeling_mamba2.py, fix pad size (#32599) · ec1424c6

Lake Lee authored 9 months ago

* Update modeling_mamba2.py

Fix pad_size calculation to ensure it's less than self.chunk_size

* [run_slow] mamba2

* [run-slow] mamba2

* [run-slow] Add @require_read_token decorator to failing tests for token propagation

* [run_slow] mamba2

ec1424c6

[tests] make more tests device-agnostic (#33580) · 8bd1f2f3

Fanli Lin authored 9 months ago

* enable

* fix

* add xpu skip

* add marker

* skip for xpu

* add more

* enable on accelerator

* add more cases

* add more tests

* add more

8bd1f2f3

Allow CI could be run on private forked repositories (e.g. new model additions) (#33594) · 31650a53
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31650a53
Fix CircleCI nightly run (#33558) · 6dc36461
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6dc36461
Docs: add the ability to manually trigger jobs (#33598) · bdf4649f
Joao Gante authored 9 months ago

bdf4649f

19 Sep, 2024 14 commits

Fix Llama 3 TikToken conversion (#33538) · 0c718f16
Pedro Cuenca authored 9 months ago
```
* Fix Llama 3 TikToken conversion

* No need to add tokens again
```
0c718f16
[tests] enable GemmaIntegrationTest on XPU (#33555) · 4d8908df
Fanli Lin authored 9 months ago
```
enable GemmaIntegrationTest
```
4d8908df

[tests] skip tests for xpu (#33553) · b87755aa

Fanli Lin authored 9 months ago

* enable

* fix

* add xpu skip

* add marker

* skip for xpu

* add more

* add one more

b87755aa

Uniformize kwargs for Paligemma processor and update docs (#33571) · f111d5b7
Yoni Gozlan authored 9 months ago
```
* Uniformize paligemma processor

* nit
```
f111d5b7
Cache: don't throw warnings on `gemma2` when instantiating a new cache (#33595) · 52920b5d
Joao Gante authored 9 months ago

52920b5d
[`Mamba2`] Move dt calculations to kernel (#33520) · b50ff599
Anton Vlasjuk authored 9 months ago
```
* use kernel for dt calculations

* add small test

* [run-slow] mamba2
```
b50ff599

change sequence_bias type of SequenceBiasLogitsProcessor to list, add… (#33375) · 162056a3

Vladislav Bronzov authored 9 months ago

* change sequence_bias type of SequenceBiasLogitsProcessor tp list, add config tests for all processors

* fix format

* small fix for all_token_bias_pairs_are_valid internal func

* small typo fix in description

* improve test impl, some SequenceBiasLogitsProcessor refactoring

162056a3

Generate: check that `attention_mask` is 2D (#33575) · d9d59e7b
Joao Gante authored 9 months ago
```
check attention mask in generate
```
d9d59e7b

add uniform processors for altclip + chinese_clip (#31198) · 413008c5

Pablo Montalvo authored 9 months ago

* add initial design for uniform processors + align model

* add uniform processors for altclip + chinese_clip

* fix mutable default 👀

* add configuration test

* handle structured kwargs w defaults + add test

* protect torch-specific test

* fix style

* fix

* rebase

* update processor to generic kwargs + test

* fix style

* add sensible kwargs merge

* update test

* fix assertEqual

* move kwargs merging to processing common

* rework kwargs for type hinting

* just get Unpack from extensions

* run-slow[align]

* handle kwargs passed as nested dict

* add from_pretrained test for nested kwargs handling

* [run-slow]align

* update documentation + imports

* update audio inputs

* protect audio types, silly

* try removing imports

* make things simpler

* simplerer

* move out kwargs test to common mixin

* [run-slow]align

* skip tests for old processors

* [run-slow]align, clip

* !$#@!! protect imports, darn it

* [run-slow]align, clip

* [run-slow]align, clip

* update common processor testing

* add altclip

* add chinese_clip

* add pad_size

* [run-slow]align, clip, chinese_clip, altclip

* remove duplicated tests

* fix

* update doc

* improve documentation for default values

* add model_max_length testing

This parameter depends on tokenizers received.

* Raise if kwargs are specified in two places

* fix

* match defaults

* force padding

* fix tokenizer test

* clean defaults

* move tests to common

* remove try/catch block

* deprecate kwarg

* format

* add copyright + remove unused method

* [run-slow]altclip, chinese_clip

* clean imports

* fix version

* clean up deprecation

* fix style

* add corner case test on kwarg overlap

* resume processing - add Unpack as importable

* add tmpdirname

* fix altclip

* fix up

* add back crop_size to specific tests

* generalize tests to possible video_processor

* add back crop_size arg

* fixup overlapping kwargs test for qformer_tokenizer

* remove copied from

* fixup chinese_clip tests values

* fixup tests - qformer tokenizers

* [run-slow] altclip, chinese_clip

* remove prepare_image_inputs

413008c5

fix tests with main revision and read token (#33560) · 4f0246e5

Pablo Montalvo authored 9 months ago

* fix tests with main revision and read token

* [run-slow]mamba2

* test previously skipped tests

* [run-slow]mamba2

* skip some tests

* [run-slow]mamba2

* finalize tests

* [run-slow]mamba2

4f0246e5

Cache: don't show warning in forward passes when `past_key_values` is None (#33541) · 80b774eb
Joao Gante authored 9 months ago

80b774eb
rag: fix CI (#33578) · f3b3810f
Joao Gante authored 9 months ago

f3b3810f

VLMs: enable generation tests (#33533) · d7975a58

Raushan Turganbay authored 9 months ago


* add tests

* fix whisper

* update

* nit

* add qwen2-vl

* more updates!

* better this way

* fix this one

* fix more tests

* fix final tests, hope so

* fix led

* Update tests/generation/test_utils.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* pr comments

* not pass pixels and extra for low-mem tests, very flaky because of visio tower

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

d7975a58

Load and save video-processor from separate folder (#33562) · e40bb484

Raushan Turganbay authored 9 months ago


* load and save from video-processor folder

* Update src/transformers/models/llava_onevision/processing_llava_onevision.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e40bb484

18 Sep, 2024 10 commits

Codec integration (#33565) · 5af7d41e

Yoach Lacombe authored 9 months ago


* clean mimi commit

* some nits suggestions from Arthur

* make fixup

* rename repo id + change readme

* Update docs/source/en/model_doc/mimi.md

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add flaky flag to batching equivalence due to audio_codes failing sometimes

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

5af7d41e

Fix bnb dequantization (#33546) · 6019f3ff
Marc Sun authored 9 months ago

6019f3ff

Improve compiled RT-DETR inference speed (#33412) · 7b1ce634

Yoni Gozlan authored 9 months ago

* modify rt detr to improve inference times when compiled

* Remove redundant "to"

* Fix conditional lru_cache and missing shapes_list

* nit unnecessary list creation

* Fix compile error when ninja not available and custon kernel activated

7b1ce634

enforce original size to be a list (#33564) · 9db963ae

Dominik Niedziela authored 9 months ago

* enforce original size to be a list

* formatting

* apply datatype change to unpad_image in llava_next

9db963ae

Return attention mask in ASR pipeline to avoid warnings (#33509) · 8efc06ee
Matt authored 9 months ago
```
return attention mask in ASR pipeline
```
8efc06ee
Pipeline: no side-effects on `model.config` and `model.generation_config` 🔫 (#33480) · 7542fac2
Joao Gante authored 9 months ago

7542fac2

Added support for bfloat16 to zero-shot classification pipeline (#33554) · fc83a4d4

Umar Butler authored 9 months ago


* Added support for bfloat16 to zero-shot classification pipeline

* Ensure support for TF.

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

* Remove dependency on `torch`.

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

fc83a4d4

Fix tests in ASR pipeline (#33545) · f883827c
Yoach Lacombe authored 9 months ago

f883827c

fix the wandb logging issue (#33464) · 4f1e9bae

Ziyú Ye authored 9 months ago

* fix the wandb logging issue

* handle ConfigError in WandbCallback; move import to local scope

* update integration_utils.py; move import of ConfigError

* Update integration_utils.py: remove trailing whitespace

4f1e9bae

[i18n-ur] Added README_ur.md file (#33461) · 5427eaad
Ikram Ali authored 9 months ago
```
* Urdu docs added

* fixed the misaligned issue.
```
5427eaad