Commits · fix_batch_test · 某某某 / transformers-new

15 May, 2025 4 commits
- remove unhandled parameter · 4f527ed1
  itazap authored 1 month ago
  
  4f527ed1
- Remove head mask in generative models (#35786) · 955e61b0
  Raushan Turganbay authored 1 month ago
```
* just squash into one commit

* delete print
```
  955e61b0
- enable csm integration cases on xpu, all passed (#38140) · 0173a99e
  Yao Matrix authored 1 month ago
```
* enable csm test cases on XPU, all passed

Signed-off-by: Matrix Yao <matrix.yao@intel.com>

* fix style

Signed-off-by: Matrix Yao <matrix.yao@intel.com>

---------

Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  0173a99e
- [Qwen3] Qwen3 MoE add tp plan for expert mlps (#38135) · e5a48785
  Huang, Guangtai authored 1 month ago
```
fix tp plan
```
  e5a48785
14 May, 2025 10 commits

Fix incorrect attention mask truncate in WhisperFlashAttention2 (#36477) · 4005e30c


* Fix incorrect attention mask truncate in whisper flash attention

* also fix incorrect attention mask truncate in qwen2 audio

* Nit attention mask truncate modeling_qwen2_audio.py

* Nit attention mask truncate modeling_whisper.py

Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

---------

Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com>

4005e30c

enable d_fine finetuning properly (#37962) · aa27fa75
Sangbum Daniel Choi authored 1 month ago
```
add pre_output in the front

Co-authored-by: Pavel Iakubovskii <qubvel@gmail.com>
```
aa27fa75
Add `manueldeprada` to `run_slow` whitelist (#38126) · e021bf6b
Manuel de Prada Corral authored 1 month ago
```
Add manueldeprada to run_slow allowed users
```
e021bf6b
[docs] add uv installation instructions for source builds (#37968) · ef27b2bc
Arjuna Sky Kok authored 1 month ago

ef27b2bc
Update trainer.md (#38113) · 4a2decd1
guspuffygit authored 1 month ago
```
Fix typo in torch.compile method parameters
```
4a2decd1

Add config validation and style tweaks (#37589) · 935bbbc7

Kirire authored 1 month ago


* Add config validation and style tweaks

* Fix style issues

* Fix style issues

* style

* Small fixes for copy/paste errors

---------

Co-authored-by: Cyrile <cyrile.delestre@arkea.com>

935bbbc7

Fix auto batch size finder test (#38125) · 1b009663
ivarflakstad authored 1 month ago
```
Ensure --auto_find_batch_size is the last test arg so indexing is correct
```
1b009663

Fix temporal padding in Qwen2VLImageProcessor when the number of frames is not... · fe918d13

Ritwick Chaudhry authored 1 month ago

Fix temporal padding in Qwen2VLImageProcessor when the number of frames is not divisible by temporal_patch_size (#38076)

Qwen2VL: Fix temporal padding in Qwen2VLImageProcessor when frames are not divisible by temporal_patch_size

fe918d13

[video processor] fix tests (#38104) · aaf224d5

Raushan Turganbay authored 1 month ago

* fix tests

* delete

* fix one more test

* fix qwen + some tests are failing irrespective of `VideoProcessor`

* delete file

aaf224d5

enable finegrained_fp8 and granite_speech cases on XPU (#38036) · 9b5ce556

Yao Matrix authored 1 month ago


* enable finegrained_fp8 cases on XPU

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* fix style

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* change back to auto

Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* rename per comments

Signed-off-by: Matrix Yao <matrix.yao@intel.com>

---------

Signed-off-by: Yao Matrix <matrix.yao@intel.com>
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

9b5ce556

13 May, 2025 14 commits

Fix description and formatting errors in code docs (#38074) · b311a3f5

bilibili12433014 authored 1 month ago

* Update stopping_criteria.py

Fix description and formatting errors.

* Update stopping_criteria.py

Align formatting with existing files for consistency.

b311a3f5

Add style bot (#38102) · b499a14b
Marc Sun authored 1 month ago
```
add style bot
```
b499a14b
[CSM] update test for t4 runners (#38110) · e0f225cb
eustlb authored 1 month ago
```
update test for t4 runners
```
e0f225cb

Add Fast Image Processor for vilt (#37304) · 342961f6

Jinyong Lee authored 1 month ago


* init vilt image processor fast

* Refactor image processor tests to use loop for all processors

* Add ViltImageProcessorFast with PyTorch-based optimized image processing

* Change made automatically by make fixup command

* Change made automatically by make fix-copies command

* Fix type hints in ViltImageProcessorFast for Python compatibility

* Define constants for image resizing based on COCO dataset aspect ratio

* Add missing property initializations to ViltImageProcessorFast

* Extract resize logic into dedicated method in ViltImageProcessorFast

* Extract padding logic into dedicated method

* Implement shape-based image grouping for optimized processing in Vilt

* Update test suite to verify ViltImageProcessorFast attributes

* Move variable declarations to _preprocess method parameters

* Remove unused parameters

* Rename _resize method to resize to override existing function

* Remove whitespace

* Remove unnecessary type check and conversion for stacked_images

* Remove redundant loop and apply padding directly to stacked images

* Refactor pad function to return images and mask as tuple instead of dict

* Add tests comparing padding masks in slow and fast implementations

* Update ViltImageProcessor tests to ensure compatibility between slow and fast implementations

* Replace add_start_docstrings with auto_docstring in ViltImageProcessorFast

* Move docstrings of custom args to ViltFastImageProcessorKwargs

* Use reorder_images function for both masks and images

---------

Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

342961f6

Fix InternVL interpolate_pos_encoding and add to video_processing_auto (#38092) · 8771766a
Yoni Gozlan authored 1 month ago
```
* fix InternVL interpolate_pos_encoding

* fix modular and auto_video_processor for internvl
```
8771766a
fix `check_bad commit.py` gives wrong results (#38107) · 582d5e0e
Yih-Dar authored 1 month ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
582d5e0e

[bug] fix llava processor to calculate unpadding size correctly (#37988) · a5cc7a67

youngrok cha authored 1 month ago


* fix llava processor to calculate unpad size correctly

* repo consistency

* Revert "repo consistency" & "setUp in llava family"

This reverts commit 26a50af8db5b15bb6b700db3d53342fe69579d8e.

* add edge case test for padding & unpadding

* compute unpadding size from original size

* make test config explicit

* Revert "compute unpadding size from original size"

This reverts commit 752cd27ad9710ab056c17a9986760c4651975540.

* Revert "add edge case test for padding & unpadding"

This reverts commit ccbd094d69c3f8f6a259159164284f60ba835bce.

* revert unpad logic

* remove irrelevant tests

* model test

* remove processor from model test

---------

Co-authored-by: jaycha <jaycha@ncsoft.com>

a5cc7a67

Fix `past_key_values` type hint in model output types (#37953) · 67b3d45e
Chris authored 1 month ago
```
* F: Fix type hint.

* F: Use Cache type.

* F: Sort import.

* U: Format.

* U: Address reviews.
```
67b3d45e
Fix bug in prefill_chunk_size that ignores disable_compile flag (#38067) · 07feaad8
Eva Koroleva authored 1 month ago
```
Fix bug in prefill_chunk_size implementation that ignores disable_compile flag
```
07feaad8
[smolvlm] skip the test (#38099) · e40f301f
Raushan Turganbay authored 1 month ago
```
skip the test
```
e40f301f

Disable report callbacks for certain training tests (#38088) · e27d230d

ivarflakstad authored 1 month ago

* Disable report callbacks for certain training tests

* Disable report callbacks for test_auto_batch_size_finder

e27d230d

fix: Propagate `lr_scheduler_kwargs` options to create LR Scheduler when... · ab65ba47

Bongseok Lee authored 1 month ago

fix: Propagate `lr_scheduler_kwargs` options to create LR Scheduler when LayerWiseDummyOptimizer is used (#34559)

fix: fix get_scheduler

ab65ba47

add timeout for downloading the `librispeech_asr` dataset (#38073) · 8fb60bf6
Fanli Lin authored 1 month ago
```
* add timeout

* change 10 to 60
```
8fb60bf6

update `require_read_token` (#38093) · 3ad35d0b

Yih-Dar authored 1 month ago


* update require_read_token

* new repo

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

3ad35d0b

12 May, 2025 12 commits

Refactor image processor phi4 (#36976) · e3b70b0d

Yoni Gozlan authored 1 month ago

* refactor image processor phi4

* nits fast image proc

* add image tests phi4

* Fix image processing tests

* update integration tests

* remove revision and add comment in integration tests

e3b70b0d

uninstall `kernels` from docker images (#38083) · 4143f94d
Yih-Dar authored 1 month ago
```
uninstall kernels

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
4143f94d
update seed_worker to set seed based on worker_id and rank (#37980) · a63cb757
Shiyu authored 1 month ago
```
* update seed_worker to set seed based on worker_id and rank

* test case

* set output_dir as remove tmp dir
```
a63cb757

Fix tot update in trainer (#37923) · e387821a

efsotr authored 1 month ago

* fix total updates in epoch

* add test; fix max_steps

* replace with multi-gpu decorator

e387821a

fix the inconsist docstring in apply_chat_template (#38069) · f0e975c6

Weipeng Jiang authored 1 month ago

The commit (https://github.com/huggingface/transformers/commit/5cf11e5ab9591652ee025069658f9af5a98e455e) fixed the type hints for the parameter `tools` in apply_chat_template, but the docstring was not changed.

f0e975c6

chore(qwen2): display warning log only when sliding window attention … (#36316) · 31791b16

Junlin Zhou authored 1 month ago


* chore(qwen2): display warning log only when sliding window attention is enabled

* Align modeling_qwen2.py and modular_qwen2.py

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

31791b16

Fix mt5 test on AMD devices (#38081) · 8ea72d12
ivarflakstad authored 1 month ago

8ea72d12
docs: fix md style (#38057) · 5c850180
谭九鼎 authored 1 month ago

5c850180
Add AMD expectation to test_gpt2_sample (#38079) · 7eaa90b8
ivarflakstad authored 1 month ago

7eaa90b8
Fix OneFormer integration test (#38016) · 4220039b
Pavel Iakubovskii authored 1 month ago
```
* Fix integration tests

* format
```
4220039b

[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes (#38047) · 8efe3a9d

Joao Gante authored 1 month ago

* accept arbitrary kwargs

* move user commands to a separate fn

* work with generation config files

* rm cmmt

* docs

* base generate flag doc section

* nits

* nits

* nits

* no <br>

* better basic args description

8efe3a9d

[VLM] fix loading issues (#38051) · a5c6172c
Raushan Turganbay authored 1 month ago
```
* fix qwen2-vl loading

* fix a few nore models

* delete print

* fix copies
```
a5c6172c