Commits · ci-test-huggingface-hub-v0.29.0.rc2 · 某某某 / transformers-new

18 Feb, 2025 7 commits

Test hfh v0.29.0.rc2 · 45bb453a
Hugging Face Bot (RC Testing) authored 4 months ago

45bb453a

Added Support for Custom Quantization (#35915) · 8eaae6be

Parteek authored 4 months ago


* Added Support for Custom Quantization

* Update code

* code reformatted

* Updated Changes

* Updated Changes

---------

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

8eaae6be

GitModelIntegrationTest - flatten the expected slice tensor (#36260) · 07182b2e
ivarflakstad authored 4 months ago
```
Flatten the expected slice tensor
```
07182b2e

Fix XGLM loss computation (PyTorch and TensorFlow) (#35878) · 4d2de5f6

Damiano Amatruda authored 4 months ago

* Fix XGLM loss computation (PyTorch and TensorFlow)

* Update expected output string in XGLM sample test

This updates the expected output string of test_xglm_sample for torch
2.0 to the correct one and removes the one for torch 1.13.1 + cu116
(transformers moved to torch 2.0 with PR #35358).

* Update expected output IDs in XGLM generation test

4d2de5f6

feat: add support for tensor parallel training workflow with accelerate (#34194) · c3ba5330

Mehant Kammakomati authored 4 months ago


* feat: add support for tensor parallel flow using accelerate

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: add tp degree to env variable

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: add version check for accelerate to allow TP

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* docs: tensor parallelism

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* nit: rename plugin name

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: guard accelerate version before allow tp

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* docs: add more docs and updates related to TP

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

---------

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

c3ba5330

Remove flakiness in VLMs (#36242) · e6cc410d
Raushan Turganbay authored 4 months ago
```
* fix

* nit

* no logits processor needed

* two more tests on assisted decoding
```
e6cc410d

Fix TorchAoConfig not JSON serializable (#36206) · fdcfdbfd

andrewor14 authored 4 months ago

**Summary:** TorchAoConfig optionally contains a
`torchao.dtypes.Layout` object which is a dataclass and not
JSON serializable, and so the following fails:

```
import json
from torchao.dtypes import TensorCoreTiledLayout
from transformers import TorchAoConfig

config = TorchAoConfig("int4_weight_only", layout=TensorCoreTiledLayout())

config.to_json_string()

json.dumps(config.to_dict())
```

This also causes `quantized_model.save_pretrained(...)` to
fail because the first step of this call is to JSON serialize
the config. Fixes https://github.com/pytorch/ao/issues/1704

.

**Test Plan:**
python tests/quantization/torchao_integration/test_torchao.py -k test_json_serializable

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

fdcfdbfd

17 Feb, 2025 12 commits
- Au revoir flaky `test_fast_is_faster_than_slow` (#36240) · 626666c4
  Yih-Dar authored 4 months ago
```
* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  626666c4
- [tests] remove `test_export_to_onnx` (#36241) · 429f1a68
  Joao Gante authored 4 months ago
  
  429f1a68
- Add compressed tensor in quant dockerfile (#36239) · dae8708c
  Marc Sun authored 4 months ago
```
add compressed_tensors in the dockerfile
```
  dae8708c
- Bump transformers from 4.38.0 to 4.48.0 in /examples/research_projects/codeparrot/examples (#36237) · 3e970dbb
  dependabot[bot] authored 4 months ago
```
Bump transformers in /examples/research_projects/codeparrot/examples

Bumps [transformers](https://github.com/huggingface/transformers) from 4.38.0 to 4.48.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v4.38.0...v4.48.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
```
  3e970dbb
- [generate] Fix encoder decoder models attention mask (#36018) · 77aa9fc0
  eustlb authored 4 months ago
  
  77aa9fc0
- [tests] remove tf/flax tests in `/generation` (#36235) · 55493f13
  Joao Gante authored 4 months ago
  
  55493f13
- v4.45.0-dev0 · c877c9fa
  Arthur Zucker authored 4 months ago
  
  c877c9fa
- Add missing atol to torch.testing.assert_close where rtol is specified (#36234) · 7ec35bc3
  ivarflakstad authored 4 months ago
  
  7ec35bc3
- [generate] remove cache v4.47 deprecations (#36212) · dad513e0
  Joao Gante authored 4 months ago
  
  dad513e0
- AMD DeepSpeed image additional HIP dependencies (#36195) · 936aeb70
  ivarflakstad authored 4 months ago
```
* Add hipsolver and hipblastlt as dependencies

* Upgrade torch libs with rocm6.2.4 index
```
  936aeb70
- Fix `LlavaForConditionalGenerationModelTest::test_config` after #36077 (#36230) · 23d6095e
  Yih-Dar authored 4 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  23d6095e
- [tests] fix `EsmModelIntegrationTest::test_inference_bitsandbytes` (#36225) · fae0f3dd
  Fanli Lin authored 4 months ago
```
fix failed test
```
  fae0f3dd
14 Feb, 2025 15 commits

set `test_torchscript = False` for Blip2 testing (#35972) · dd16acb8

Yih-Dar authored 4 months ago


* just skip

* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

dd16acb8

Use `args.num_workers` in `check_modular_conversion.py` (#36200) · 0a9923a6
Yih-Dar authored 4 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
0a9923a6

add shared experts for upcoming Granite 4.0 language models (#35894) · a570e2ba

Mayank Mishra authored 4 months ago


* Modular GraniteMoE with shared Experts.

Signed-off-by: Shawn Tan <shawntan@ibm.com>

* Modified

* Import order.

* Modified for style

* Fix space.

* Test

* Remove extra granitemoe file.

* New converted file and tests

* Modified __init__ files.

* Formatting.

* Dummy PT objects

* register granitemoe shared model

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix linting of a file

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix import in modeling file

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update generated modeling file

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* add documentation

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update docstrings

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* update generated modeling file

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* fix docstrings in config class

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

* merge main

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

---------

Signed-off-by: Shawn Tan <shawntan@ibm.com>
Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Co-authored-by: Shawn Tan <shawntan@ibm.com>
Co-authored-by: Shawn Tan <shawn@wtf.sg>
Co-authored-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>
Co-authored-by: Sukriti Sharma <Ssukriti@users.noreply.github.com>

a570e2ba

Add @require_bitsandbytes to Aria test_batched_generation (#36192) · 7ae7e87a
ivarflakstad authored 4 months ago

7ae7e87a

[Bugfix] Fix reloading of pixtral/llava configs (#36077) · bcfc9d79

Kyle Sayers authored 4 months ago


* add is_composition flag to LlavaConfig

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* WIP: pixtral text config

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* fix style

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* add test

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* use is_composition for pixtral

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

* Revert "use is_composition for pixtral"

This reverts commit a53d5f9fc5149c84419b0e9e03db6d99362add53.

* Revert "Revert "use is_composition for pixtral""

This reverts commit 3ab1c99404e2c2963fba0bcf94b9786d6365db0f.

---------

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

bcfc9d79

VLM: compile compatibility (#35724) · 0c78ef6c

Raushan Turganbay authored 4 months ago


* llavas

* add mroe models

* fix `compile_forward` test for all models

* fix copies

* make style

* also doesn't support cache class

* fix some tests

* not copied from

* ci green?

* fix tests

* fix copies

* fix tests

* check with `numel` and remove `item`

* fix copies

* fix copies

* Update src/transformers/models/cohere2/modeling_cohere2.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* opt remove cross attn

* gemma2

* fixup

* fixup

* fix newly added test

* maybe fixed?

* green please?

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

0c78ef6c

Guard against unset resolved_archive_file (#35628) · b45cf0e9

David LaPalomento authored 4 months ago


* archive_file may not be specified
When loading a pre-trained model from a gguf file, resolved_archive_file may not be set. Guard against that case in the safetensors availability check.

* Remap partial disk offload to cpu for GGUF files
GGUF files don't support disk offload so attempt to remap them to the CPU when device_map is auto. If device_map is anything else but None, raise a NotImplementedError.

* Don't remap auto device_map and raise RuntimeError
If device_map=auto and modules are selected for disk offload, don't attempt to map them to any other device. Raise a runtime error when a GGUF model is configured to map any modules to disk.

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

b45cf0e9

Revert qwen2 breaking changes related to attention refactor (#36162) · 96f01a36
Arthur authored 4 months ago
```
* dito

* add a test

* upsate

* test needs fa2

* update test and configuration

* test requires fa2

* style
```
96f01a36
Add require_read_token to fp8 tests (#36189) · cb586a39
Mohamed Mekkouri authored 4 months ago
```
fix
```
cb586a39

New HIGGS quantization interfaces, JIT kernel compilation support. (#36148) · 5f726f8b

Andrei Panferov authored 4 months ago


* new flute

* new higgs working

* small adjustments

* progress and quallity

* small updates

* style

---------

Co-authored-by: Andrey Panferov <panferov.andrey3@wb.ru>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

5f726f8b

Prepare processors for VideoLLMs (#36149) · 15ec971b

Raushan Turganbay authored 4 months ago


* allow processor to preprocess conversation + video metadata

* allow callable

* add test

* fix test

* nit: fix

* add metadata frames_indices

* Update src/transformers/processing_utils.py

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

* Update src/transformers/processing_utils.py

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

* port updates from Orr and add one more test

* Update src/transformers/processing_utils.py

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

* typo

* as dataclass

* style

* docstring + maek sure tests green

---------

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

15ec971b

Add ImageProcessorFast to Qwen2.5-VL processor (#36164) · 33d1d715

Isotr0py authored 4 months ago


* add qwen2 fast image processor to modular file

Signed-off-by: isotr0py <2037008807@qq.com>

* fix modular

Signed-off-by: isotr0py <2037008807@qq.com>

* fix circle import

Signed-off-by: isotr0py <2037008807@qq.com>

* add docs

Signed-off-by: isotr0py <2037008807@qq.com>

* fix typo

Signed-off-by: isotr0py <2037008807@qq.com>

* add modular generated files

Signed-off-by: isotr0py <2037008807@qq.com>

* revert qwen2vl fast image processor

Signed-off-by: isotr0py <2037008807@qq.com>

* remove qwen2.5-vl image processor from modular

Signed-off-by: isotr0py <2037008807@qq.com>

* re-generate qwen2.5-vl files

Signed-off-by: isotr0py <2037008807@qq.com>

* remove unnecessary test

Signed-off-by: isotr0py <2037008807@qq.com>

* fix auto map

Signed-off-by: isotr0py <2037008807@qq.com>

* cleanup

Signed-off-by: isotr0py <2037008807@qq.com>

* fix model_input_names

Signed-off-by: isotr0py <2037008807@qq.com>

* remove import

Signed-off-by: isotr0py <2037008807@qq.com>

* make fix-copies

Signed-off-by: isotr0py <2037008807@qq.com>

---------

Signed-off-by: isotr0py <2037008807@qq.com>

33d1d715

Chat template docs (#36163) · 1931a351

Raushan Turganbay authored 4 months ago


* decompose chat template docs

* add docs

* update model docs

* qwen2-5

* pixtral

* remove old chat template

* also video as list frames supported

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/chat_template_multimodal.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* remove audio for now

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

1931a351

CI: fix `test-save-trainer` (#36191) · 3bf02cf4
Raushan Turganbay authored 4 months ago
```
* fix

* also the docstring
```
3bf02cf4
Add support for partial rotary embeddings in Phi3 model (#35947) · 0ae93d31
Amit Garg authored 4 months ago
```
* Added support for partial_rotary_factor

* addressed comments

* refactored
```
0ae93d31

13 Feb, 2025 6 commits

Uniformize OwlViT and Owlv2 processors (#35700) · 336dc69d

Yoni Gozlan authored 4 months ago

* uniformize owlvit processor

* uniformize owlv2

* nit

* add positional arg test owlvit

* run-slow: owlvit, owlv2

* run-slow: owlvit, owlv2

* remove one letter variable

336dc69d

Fix make_batched_videos and add tests (#36143) · e6a79817

Yoni Gozlan authored 4 months ago

* add support for initial shift in video processing and other fixes

* revert modifications video loading functions

e6a79817

Fix a mistake in #36175 (#36179) · 8fd4bc7d
Yih-Dar authored 4 months ago
```
fix my bad

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8fd4bc7d
Follow up to SpQR integration (#36176) · b1a2de07
Mohamed Mekkouri authored 4 months ago
```
fix
```
b1a2de07

Fix the key name for _load_rng_state under torch.cuda (#36138) · 12962fe8

Wizyoung authored 4 months ago


fix load key name for _load_rng_state under torch.cuda

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

12962fe8

Make `check_repository_consistency` run faster by MP (#36175) · bfe46c98

Yih-Dar authored 4 months ago


* speeddddd

* speeddddd

* speeddddd

* speeddddd

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

bfe46c98