Commits · 58a939c6b77ec36b890c441a6a07d3ef0b8dd874 · 某某某 / transformers-new

09 Apr, 2024 9 commits

Fix quantization tests (#29914) · 58a939c6

Marc Sun authored 1 year ago

* revert back to torch 2.1.1

* run test

* switch to torch 2.2.1

* udapte dockerfile

* fix awq tests

* fix test

* run quanto tests

* update tests

* split quantization tests

* fix

* fix again

* final fix

* fix report artifact

* build docker again

* Revert "build docker again"

This reverts commit 399a5f9d9308da071d79034f238c719de0f3532e.

* debug

* revert

* style

* new notification system

* testing notfication

* rebuild docker

* fix_prev_ci_results

* typo

* remove warning

* fix typo

* fix artifact name

* debug

* issue fixed

* debug again

* fix

* fix time

* test notif with faling test

* typo

* issues again

* final fix ?

* run all quantization tests again

* remove name to clear space

* revert modfiication done on workflow

* fix

* build docker

* build only quant docker

* fix quantization ci

* fix

* fix report

* better quantization_matrix

* add print

* revert to the basic one

58a939c6

Send headers when converting safetensors (#30144) · 6487e9b3
Yih-Dar authored 1 year ago
```
Co-authored-by: Wauplin <lucainp@gmail.com>
```
6487e9b3

Fix slow tests for important models to be compatible with A10 runners (#29905) · 08a194fc

Yih-Dar authored 1 year ago


* fix mistral and mixtral

* add pdb

* fix mixtral tesst

* fix

* fix mistral ?

* add fix gemma

* fix mistral

* fix

* test

* anoter test

* fix

* fix

* fix mistral tests

* fix them again

* final fixes for mistral

* fix padding right

* fix whipser fa2

* fix

* fix

* fix gemma

* test

* fix llama

* fix

* fix

* fix llama gemma

* add class attribute

* fix CI

* clarify whisper

* compute_capability

* rename names in some comments

* Add   # fmt: skip

* make style

* Update tests/models/mistral/test_modeling_mistral.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* update

* update

---------

Co-authored-by: Younes Belkada <younesbelkada@gmail.com>
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

08a194fc

[Trainer] Undo #29896 (#30129) · e9c23fa0
NielsRogge authored 1 year ago
```
* Undo

* Use tokenizer

* Undo data collator
```
e9c23fa0
[Trainer] Fix default data collator (#30142) · ba1b24e0
NielsRogge authored 1 year ago
```
* Fix data collator

* Support feature extractors as well
```
ba1b24e0

Revert workaround for TF safetensors loading (#30128) · ec59a421

Matt authored 1 year ago

* See if we can get tests to pass with the fixed weights

* See if we can get tests to pass with the fixed weights

* Replace the revisions now that we don't need them anymore

ec59a421

Fix docs Pop2Piano (#30140) · 841e87ef
Raushan Turganbay authored 1 year ago
```
fix copies
```
841e87ef

Add datasets.Dataset to Trainer's train_dataset and eval_dataset type hints (#30077) · af4c0262

Matthew Hoffman authored 1 year ago

* Add datasets.Dataset to Trainer's train_dataset and eval_dataset type hints

* Add is_datasets_available check for importing datasets under TYPE_CHECKING guard

https://github.com/huggingface/transformers/pull/30077/files#r1555939352

af4c0262

Fix failing DeepSpeed model zoo tests (#30112) · 4e3490f7

Sourab Mangrulkar authored 1 year ago

* fix sequence length errors

* fix label column name error for vit

* fix the lm_head embedding!=linear layer mismatches for Seq2Seq models

4e3490f7

08 Apr, 2024 17 commits

[`StableLm`] Add QK normalization and Parallel Residual Support (#29745) · 2f12e408

Jonathan Tow authored 1 year ago

* init: add StableLm 2 support

* add integration test for parallel residual and qk layernorm

* update(modeling): match qk norm naming for consistency with phi/persimmon

* fix(tests): run fwd/bwd on random init test model to jitter norm weights off identity

* `use_parallel_residual`: add copy pointer to `GPTNeoXLayer.forward`

* refactor: rename head states var in `StableLmLayerNormPerHead`

* tests: update test model and add generate check

2f12e408

Adding `mps` as device for `Pipeline` class (#30080) · 8c00b53e

Felix Hirwa Nshuti authored 1 year ago


* adding env variable for mps and is_torch_mps_available for Pipeline

* fix linting errors

* Remove environment overide

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8c00b53e

Fix typo at ImportError (#30090) · 7afade20
DrAnaximandre authored 1 year ago
```
fix typo at ImportError
```
7afade20

Make vitdet jit trace complient (#30065) · ef38e2a7

fxmarty authored 1 year ago

* remove controlflows

* style

* rename patch_ to padded_ following review comment

* style

ef38e2a7

Trainer / Core : Do not change init signature order (#30126) · a71def02
Younes Belkada authored 1 year ago
```
* Update trainer.py

* fix copies
```
a71def02

Fix falcon with SDPA, alibi but no passed mask (#30123) · 1897874e

fxmarty authored 1 year ago


* fix falcon without attention_mask & alibi

* add test

* Update tests/models/falcon/test_modeling_falcon.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1897874e

fix learning rate display in trainer when using galore optimizer (#30085) · 1773afce
Anton Vlasjuk authored 1 year ago
```
fix learning rate display issue in galore optimizer
```
1773afce

Accept token in trainer.push_to_hub() (#30093) · 08c84433

Nick Doiron authored 1 year ago


* pass token to trainer.push_to_hub

* fmt

* Update src/transformers/trainer.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* pass token to create_repo, update_folder

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

08c84433

[#29174] ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1 Fix (#29888) · 0201f642

Utkarsha Gupte authored 1 year ago


* ImportError: Trainer with PyTorch requires accelerate>=0.20.1 Fix

Adding the evaluate and accelerate installs at the beginning of the cell to fix the issue

* ImportError Fix: Trainer with PyTorch requires accelerate>=0.20.1

* Import Error Fix

* Update installation.md

* Update quicktour.md

* rollback other lang changes

* Update _config.py

* updates for other languages

* fixing error

* Tutorial Update

* Update tokenization_utils_base.py

* Just use an optimizer string to pass the doctest?

---------

Co-authored-by: Matt <rocketknight1@gmail.com>

0201f642

Patch fix - don't use safetensors for TF models (#30118) · 7f9aff91

amyeroberts authored 1 year ago

* Patch fix - don't use safetensors for TF models

* Skip test for TF for now

* Update for another test

7f9aff91

fixing issue 30034 - adding data format for run_ner.py (#30088) · f5658732
JINO ROHIT authored 1 year ago

f5658732
[tests] add `require_bitsandbytes` marker (#30116) · d16f0abc
Fanli Lin authored 1 year ago
```
* add bnb flag

* move maker

* add accelerator maker
```
d16f0abc

updated examples/pytorch/language-modeling scripts and requirements.txt to... · 5e673ed2

Haz Sameen Shahgir authored 1 year ago

updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120)

updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0

5e673ed2

Make MLFlow version detection more robust and handles mlflow-skinny (#29957) · 836e88ca

Howard Liberty authored 1 year ago

* Make MLFlow version detection more robust and handles mlflow-skinny

* Make function name more clear and refactor the logic

* Further refactor

836e88ca

Change log level to warning for num_train_epochs override (#30014) · a907a903
Xu Song authored 1 year ago

a907a903

[Whisper] Computing features on GPU in batch mode for whisper feature extractor. (#29900) · 1ed93be4

vaibhavagg303 authored 1 year ago


* add _torch_extract_fbank_features_batch function in feature_extractor_whisper

* reformat feature_extraction_whisper.py file

* handle batching in single function

* add gpu test & doc

* add batch test & device in each __call__

* add device arg in doc string

---------

Co-authored-by: vaibhav.aggarwal <vaibhav.aggarwal@sprinklr.com>

1ed93be4

doc: Correct spelling mistake (#30107) · 1fc34aa6
Cylis authored 1 year ago

1fc34aa6

05 Apr, 2024 13 commits

Fix whisper kwargs and generation config (#30018) · 76fa17c1
Raushan Turganbay authored 1 year ago
```
* clean-up whisper kwargs

* failing test
```
76fa17c1

Fix auto tests (#30067) · 9b5a6450

Yih-Dar authored 1 year ago


* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

9b5a6450

Add docstrings and types for MambaCache (#30023) · d9fa13ce

Kola authored 1 year ago

* Add docstrings and types for MambaCache

* Update src/transformers/models/mamba/modeling_mamba.py

* Update src/transformers/models/mamba/modeling_mamba.py

* Update src/transformers/models/mamba/modeling_mamba.py

* make fixup

* import copy in generation_whisper

* ruff

* Revert "make fixup"

This reverts commit c4fedd6f60e3b0f11974a11433bc130478829a5c.

d9fa13ce

Refactor daily CI workflow (#30012) · b17b54d3

Yih-Dar authored 1 year ago


* separate jobs

* separate jobs

* use channel name directly instead of ID

* use channel name directly instead of ID

* use channel name directly instead of ID

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b17b54d3

Fix `torch.fx` symbolic tracing for LLama (#30047) · 17cd7a9d

Michael Benayoun authored 1 year ago

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* [WIP] fix fx

* Apply changes to other models

17cd7a9d

[test fetcher] Always include the directly related test files (#30050) · 48795317
Yih-Dar authored 1 year ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
48795317

Update quantizer_bnb_4bit.py: In the ValueError string there should be... · de11d0bd

miRx923 authored 1 year ago

Update quantizer_bnb_4bit.py: In the ValueError string there should be "....you need to set `llm_int8_enable_fp32_cpu_offload=True`...." instead of "`load_in_8bit_fp32_cpu_offload=True`". (#30013)

* Update quantizer_bnb_4bit.py

There is an mistake in ValueError on line 86 of quantizer_bnb_4bit.py. In the error string there should be "....you need to set `llm_int8_enable_fp32_cpu_offload=True`...." instead of "load_in_8bit_fp32_cpu_offload=True". I think you updated the BitsAndBytesConfig() arguments, but forgot to change the ValueError in quantizer_bnb_4bit.py.

* Update quantizer_bnb_4bit.py

Changed ValueError string "...you need to set load_in_8bit_fp32_cpu_offload=True..." to "....you need to set llm_int8_enable_fp32_cpu_offload=True...."

de11d0bd

[bnb] Fix offload test (#30039) · 4207a407
Marc Sun authored 1 year ago
```
fix bnb test
```
4207a407
[Trainer] Allow passing image processor (#29896) · 1ab71364
NielsRogge authored 1 year ago
```
* Add image processor to trainer

* Replace tokenizer=image_processor everywhere
```
1ab71364
Fix mixtral ONNX Exporter Issue. (#29858) · d704c0b6
Adam Louly authored 1 year ago
```
* fix mixtral onnx export

* fix qwen model
```
d704c0b6

if output is tuple like facebook/hf-seamless-m4t-medium, waveform is … (#29722) · 79d62b2d

Wang, Yi authored 1 year ago


* if output is tuple like facebook/hf-seamless-m4t-medium, waveform is the first element

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* add test and fix batch issue

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* add dict output support for seamless_m4t

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

---------

Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

79d62b2d

skip `test_encode_decode_fast_slow_all_tokens` for now (#30044) · 8b52fa6b

Yih-Dar authored 1 year ago


skip test_encode_decode_fast_slow_all_tokens for now

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8b52fa6b

Add `whisper` to `IMPORTANT_MODELS` (#30046) · 24d787ce
Yih-Dar authored 1 year ago
```
Add whisper

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
24d787ce

04 Apr, 2024 1 commit
- Refactor Cohere Model (#30027) · 517a3e67
  Saurabh Dash authored 1 year ago
```
* changes

* addressing comments

* smol fix
```
  517a3e67