Commits · custom-compute-loss-num-batches · 某某某 / transformers-new

26 Feb, 2025 1 commit
- make it optional · d7a06329
  Marc Sun authored 3 months ago
  
  d7a06329
25 Feb, 2025 17 commits

Security fix for `benchmark.yml` (#36402) · cbe0ea59
Yih-Dar authored 3 months ago
```
security

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
cbe0ea59
Fix convert_to_rgb for SAM ImageProcessor (#36369) · 88d10517
Marcel authored 3 months ago

88d10517
[CLI] add import guards (#36376) · e1ce9489
Joao Gante authored 3 months ago
```
* add import guards

* nit
```
e1ce9489
Fix pytorch integration tests for SAM (#36397) · fb83befb
Pavel Iakubovskii authored 3 months ago
```
Fix device in tests
```
fb83befb
chore: fix function argument descriptions (#36392) · ca6ebcb9
Afanti authored 3 months ago

ca6ebcb9

fix audio classification pipeline fp16 test on cuda (#36359) · 7c8916dd

jiqing-feng authored 3 months ago


* fix audio classification pipeline fp16 test on cuda

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* add comments

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Update tests/pipelines/test_pipelines_audio_classification.py

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

7c8916dd

[tests] enable autoawq tests on XPU (#36327) · c3700b0e
Fanli Lin authored 3 months ago
```
add autoawq

Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>
```
c3700b0e

tests: revert change of torch_require_multi_gpu to be device agnostic (#35721) · b4b9da6d

Dmitry Rogozhkin authored 3 months ago

* tests: revert change of torch_require_multi_gpu to be device agnostic

The 11c27dd3 modified `torch_require_multi_gpu()` to be device agnostic
instead of being CUDA specific. This broke some tests which are rightfully
CUDA specific, such as:

* `tests/trainer/test_trainer_distributed.py::TestTrainerDistributed`

In the current Transformers tests architecture `require_torch_multi_accelerator()`
should be used to mark multi-GPU tests agnostic to device.

This change addresses the issue introduced by 11c27dd3 and reverts
modification of `torch_require_multi_gpu()`.

Fixes: 11c27dd3

 ("Enable BNB multi-backend support (#31098)")
Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>

* fix bug: modification of frozen set

---------

Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>
Co-authored-by: Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

b4b9da6d

addressing the issue #34611 to make FlaxDinov2 compatible with any batch size (#35138) · d80d52b0
MAHIR DAIYAN authored 3 months ago
```
fixed the batch_size error, all tests are passing

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
```
d80d52b0

Added handling for length <2 of suppress_tokens for whisper (#36336) · 3a02fe56

andreystarenky authored 3 months ago


* Update generation_whisper.py

Added handling for <2 length of suppress_tokens for whisper

* Updated None check for suppress_tokens to avoid ambiguity

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

3a02fe56

Fix doc formatting in forward passes & modular (#36243) · da4ab2a1

Cyril Vallez authored 3 months ago

* fix indentation issues + modular without magic keyword

* style

* Update doc.py

* style

* Fix all decorators indentation

* all models

* style

* style

* Update doc.py

* fix

* general fix

* style

da4ab2a1

Update _get_eval_sampler to reflect Trainer.tokenizer is deprecation ... · 92abc0da

Jeff authored 3 months ago

Update _get_eval_sampler to reflect Trainer.tokenizer is deprecation  self.tokenizer -> self.processing_class (#36315)

* fix warning self.tokenizer -> self.processing_class

* formating change

92abc0da

enable torchao quantization on CPU (#36146) · 9d6abf97

jiqing-feng authored 3 months ago


* enable torchao quantization on CPU

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix int4

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* enable CPU torchao tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix cuda tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix cpu tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix style

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix cuda tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix torchao available

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix torchao available

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix torchao config cannot convert to json

* fix docs

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* rm to_dict to rebase

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* limited torchao version for CPU

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix skip

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Update src/transformers/testing_utils.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fix cpu test

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

9d6abf97

Fix `is_causal` fail with compile (#36374) · 401543a8
Cyril Vallez authored 3 months ago
```
fix
```
401543a8
[modular] Do not track imports in functions (#36279) · bc65f3fc
Cyril Vallez authored 3 months ago
```
* Add check

* just check for function

* Update examples
```
bc65f3fc
Load models much faster on accelerator devices!! (#36380) · 4b5cf549
Cyril Vallez authored 3 months ago
```
* caching allocator warmup

* Update modeling_utils.py

* reuse expanded map

* style
```
4b5cf549
Update modeling_llava_onevision.py (#36391) · 931e5f4a
Yin Song authored 3 months ago
```
Fixed a potential bug in modeling_llava_onevision.py
```
931e5f4a

24 Feb, 2025 8 commits

notify new model merged to `main` (#36375) · 2ab7bdc4
Yih-Dar authored 3 months ago
```
notify new model

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2ab7bdc4

[Modeling] Reduce runtime when loading missing keys (#36312) · 05dfed06

Kyle Sayers authored 3 months ago


* hoist keys

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* remove hoist

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

---------

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

05dfed06

fix(type): padding_side type should be Optional[str] (#36326) · 18276b03
Mathew Shen authored 3 months ago

18276b03
Update amd pytorch index to match base image (#36347) · f4684a6e
ivarflakstad authored 3 months ago
```
pip pytorch index should match docker base image
```
f4684a6e

Add autoquant support for torchao quantizer (#35503) · 2af272c1

Jerry Zhang authored 3 months ago

* Add autoquant support for torchao quantizer

Summary:
att, also verified that autoquantized model can be saved and loaded:

save: https://gist.github.com/jerryzh168/01d367aaf44dbbbfd4068a4a10a00061
load: https://gist.github.com/jerryzh168/d5c6c401b2abdf18e0b6771341f1525c

Test Plan:
tested locally with above script
model uploaded to https://huggingface.co/jerryzh168/llama3-8b-autoquant

Reviewers:

Subscribers:

Tasks:

Tags:

* add test

* ruff fix

* ruff reformat

* add docs and min_sqnr support

* format

* format

* fix test

* update doc

* format

* remove disable_compile

* format

2af272c1

Change slack channel for mi250 CI to amd-hf-ci (#36346) · 977a61f7
ivarflakstad authored 3 months ago

977a61f7

Improve model loading for compressed tensor models (#36152) · 884a8ea1

Rahul Tuli authored 3 months ago

* Disable warnings for stacked compressors
* Introduce two new hooks in HfQuantizer lifecycle
to allow updates to missing and unexpected keys
* Update missing and unexpected keys
for stacked compressors
* Add tests
* Fix: run_compressed cases
* Fix: uncompressed cases

* Rename compressed_tensor folder to compressed_tensors
Move RunCompressedTest to the same file
Update tests to unittest

884a8ea1

[tests] enable bnb tests on xpu (#36233) · 4dbf17c1

Fanli Lin authored 3 months ago


* fix failed test

* fix device

* fix more device cases

* add more cases

* fix empty cache

* Update test_4bit.py

---------

Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

4dbf17c1

21 Feb, 2025 6 commits

Fix exploitable regexes in Nougat and GPTSan/GPTJNeoXJapanese (#36121) · 92c5ca9d

Matt authored 3 months ago


* Fix potential regex catastrophic backtracking in NougatTokenizerFast

The original regex pattern in tokenization_nougat_fast.py was vulnerable to
catastrophic backtracking due to greedy quantifiers and nested alternations.
This commit replaces it with a more efficient pattern that:

1. Uses explicit character classes instead of dot (.)
2. Handles whitespace more precisely
3. Avoids unnecessary backtracking
4. Supports both lowercase and uppercase roman numerals
5. Maintains the same functionality while being more robust

* Try another regex

* Trying deepseek's answer

* Start with a simplification

* Another simplification

* Just rewrite the whole function myself

* Fix gptneox and gptsan

* Simplify the regex even further

* Tighten up the price regex a little

* Add possessive version of the regex

* Fix regex

* Much cleaner regexes

---------

Co-authored-by: openhands <openhands@all-hands.dev>

92c5ca9d

Uses Collection in transformers.image_transforms.normalize (#36301) · 547911e7

CalOmnie authored 3 months ago

* Uses Collection instead of Sequence in transformers.image_transforms.normalize

* Uses collections.abc.Collection in lieu of deprecated typing one

547911e7

[tests] make quanto tests device-agnostic (#36328) · 7c5bd24f
Fanli Lin authored 3 months ago
```
* make device-agnostic

* name change
```
7c5bd24f
[CI] Check test if the `GenerationTesterMixin` inheritance is correct 🐛 🔫 (#36180) · 678885bb
Joao Gante authored 4 months ago

678885bb

Add SigLIP 2 (#36323) · a957b791

Pavel Iakubovskii authored 4 months ago

* Docs

* Inits

* Auto classes

* Add siglip base

* Add base tests

* Fix Siglip V1 for fix res version

* Add image processor

* Update conversion

* Experimenting with vectorized embeddings

* Fixup

* Add modular Siglip2Processor

* Add modular configuration

* Rename num patches

* Correct image and text features merging

* Working conversion script

* Refactoring conversion script

* Remove unused code in conversion script

* Shorten dict a bit

* Refactoring conversion

* Done conversion refactoring

* Fixup

* Modular siglip2

* Make model exportable and compilable without graph breaks

* Remove position_ids from image_processor

* REmove position ids from modeling file

* Update modular

* Type hint

* Fixup

* Set defaults to processor

* Add integration test

* Revert spatial shapes back to tensor

* Change order

* Fix most of the tests

* Fix docstring

* Remove interpolate_pos_encoding arg (not needed)

* Update docs

* Standardize processing

* Fix attention_mask in vision head

* Siglip v1: remove double transpose in FA2

* Update modular file

* Update FA2 test

* Update expected logits

* Fix interpolation for siglip2 image processor

* Skip init test

* Skip dispatch on flash test

* Fix modeling tests

* Fixup

* Add dummy objects

* Fix some docstrings

* Add siglip2 in index.md

* Fix consistency

* Add docs

* Remove size and data format

* Add image processor tests

* Fix

* Add fast image processor

* Fix style

* Fix

* Docs

* Set lowercase for tokenizer

* Adjust head size for Siglip v1

* Update siglip2 for consistency with siglip1

* Update siglip2 conversion

* Update pipeline

* Update checkpoints in tests

* Update checkpoint name

* Fix pooling for image classification model

* Fix FA2 test

* Update processor

* Fix check repo

* Update docs

* Fix typos

* Fix docstring for fast image processor

* Add siglip2 to FA2 docs

* Fix fast ip tests

* Fix constitency

* Fix tokenizer class for siglip v1

* Fix missing header

* Refactor scaling for clip, siglip, siglip2

* Remove unused imports

* Make fast IP default for siglip2

* Update docs

* Update checkpoints

* Update modular

* Update paper link

* Fixup

* Fix name in toctree

* Fix test

v4.49.0-SigLIP-2

a957b791

VLMs: even more clean-up (#36249) · 14552cbd
Raushan Turganbay authored 4 months ago
```
* squash

* style
```
14552cbd

20 Feb, 2025 8 commits

Fix default attention mask of generate in MoshiForConditionalGeneration (#36171) · e18f233f
Cyan authored 4 months ago

e18f233f

[smolvlm] make CI green (#36306) · 27d17075

Joao Gante authored 4 months ago

* add smolvlm to toctree

* add requirements

* dev-ci

* no docker changes

* dev-ci

* update torch-light.dockerfile

* derp

* dev-ci

27d17075

fix: prevent second save in the end of training if last step was saved already (#36219) · effaef33

Nosimus authored 4 months ago


* fix: prevent second save in the end of training

* fix: prevent second save in the end of training

* test: added test for no duplicate save on epoch save strategy

* fix: removed TrainerControl

* chore: style formatting

---------

Co-authored-by: JaktensTid <jaktenstid1@gmail.com>

effaef33

Fix typo in Pixtral example (#36302) · 5412ff1a
12v authored 4 months ago
```
Fix typo
```
5412ff1a

SmolVLM2 (#36126) · 4397dfcb

Orr Zohar authored 4 months ago


* smolvlm init

* updates

* fixing bugs

* minimal run, no checks

* minimal run, no checks

* passing first check + adding url support

* updating video dataloading logic

* fixing image logic

* trying modular, but fails

* modular is working, changing processor to match PR comments and general transformers logic

* fixing kwargs

* offloading video loading logic to  image_util

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* fixing circleci code formatting errors

* update

* add idefics3-based tests

* add keyword to all

* add PreTrainedModel

* updateing video loading logic

* working inference

* updates for PR comments

* updates for PR comments

* moving SmolVLMPretrainedModel higher to fix import error

* CI test pass

* CI test pass

* removing lambda

* CI test pass

* CI test pass

* CI test pass

* CI test pass

* CI test pass

* CI test pass

* processor tests

* add example in docs

* typo

* fix copies

* skip compile tests - sdpa for VisionTransformer

* fix init

* raise import error for num2words

* update doc for FA2

* more doc fix

* CI

* updates for PR comments

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Joshua Lochner <admin@xenova.com>

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* fixing processor -- tokenizer not defined properly, (gpt2 tokenizer), and does not have the attributes of fake image token, etc

* adding smolvlm to VQA models

* removing vqa auto class

* Update src/transformers/models/smolvlm/processing_smolvlm.py

Co-authored-by: Joshua Lochner <admin@xenova.com>

* removing smolvlmvisiontransformer from index.md

* my bad, video processing had typos

* fixing docs

* renaming params in SmolVLMModel.inputs_merger

* removing un-needed dtype/device in model forward

* ruff for CI

* update docs

* Update docs/source/en/model_doc/smolvlm.md

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* return cache position

* return cache position

* return cache also in modular

* needed to run modular again

* fix training tests

* push vectorized inputs merger

* format

* format

* reduce number of mappings

* addressing PR comments

* happy CI, happy me :)

* skip non-nested images

* adjust integration test for smaller GPUs

* format

* fix kwargs in chat template apply

* skip this for now

---------

Co-authored-by: raushan <raushan@huggingface.co>
Co-authored-by: Pablo <pablo.montalvo.leroux@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Joshua Lochner <admin@xenova.com>

v4.49.0-SmolVLM-2

4397dfcb

Ignore conversion files in test fetcher (#36251) · f2ab182d
Yih-Dar authored 4 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f2ab182d
Fix broken CI on release branch due to missing conversion files (#36275) · e8531a0e
Yih-Dar authored 4 months ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
e8531a0e
Make cache traceable (#35873) · 5e2183f3
Ilyas Moutawwakil authored 4 months ago
```
simply make cache traceable
```
5e2183f3