Commits · fix_autoawq_docker · zhusg / transformers-new

26 Nov, 2024 4 commits
- testing new ci · da8379ce
  MekkCyber authored 6 months ago
  
  da8379ce
- autoawq docker · 8c5d73f4
  MekkCyber authored 6 months ago
  
  8c5d73f4
- Skipping aqlm non working inference tests till fix merged (#34865) · 0e805e6d
  Mohamed Mekkouri authored 6 months ago
  
  0e805e6d
- VideoLLaVA: add default values (#34916) · 73b4ab10
  Raushan Turganbay authored 6 months ago
```
add default values
```
  73b4ab10
25 Nov, 2024 25 commits

Fix import structure for Fast Image processors (#34859) · bdb29ff9
Yoni Gozlan authored 6 months ago
```
* Fix import structure image_processor_fast

* update to new inits
```
bdb29ff9

making gpt2 fx traceable (#34633) · bfc3556b

xuzifei-dmatrix authored 6 months ago

* making gpt2 fx tracable

* running make fix-copies

* Revert "running make fix-copies"

This reverts commit 5a3437cb5b63799243bceae7d21a2aed8d0418c7.

bfc3556b

Updated documentation and added conversion utility (#34319) · 95c10fed

Viktor Scherbakov authored 6 months ago


* Updated documentation and added conversion utility

* Update docs/source/en/tiktoken.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/tiktoken.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Moved util function to integration folder + allow for str

* Update formatting

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Updated formatting

* style changes

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

95c10fed

Fix failling GGML test (#34871) · 890ea7de
Mohamed Mekkouri authored 6 months ago
```
fix_test
```
890ea7de
Upgrade torch version to 2.5 in dockerfile for quantization CI (#34924) · b76a292b
Mohamed Mekkouri authored 6 months ago
```
* Upgrade Torch 2.5

* uncomment
```
b76a292b
Fix `test_auto_backbone_timm_model_from_pretrained` (#34877) · a830df29
Yih-Dar authored 6 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a830df29

fix static cache data type miss-match (#34799) · a464afbe

jiqing-feng authored 6 months ago


* fix gptj data type missmatch

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* add low precision static cache tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix low-precision static cache tests

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* avoid config change

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* change data type convert in cache copy

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix comment

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* cast key value after k v out

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

a464afbe

[AWQ, CI] Bump AWQ version used in docker image (#34922) · b13916c0

Benjamin Bossan authored 6 months ago

The old AWQ version is failing with the latest (unreleased)
transformers, giving the error:

> ImportError: cannot import name 'shard_checkpoint' from
'transformers.modeling_utils'

This has been resolved in awq v0.2.7:

https://github.com/casper-hansen/AutoAWQ/pull/644

b13916c0

Fix : BitNet tests (#34895) · 4e6b19cd
Mohamed Mekkouri authored 6 months ago
```
* fix_tests_bitnet

* fix format
```
4e6b19cd
Rename OLMo November to OLMo2 (#34864) · 9121ab8f
Shane A authored 6 months ago
```
* Rename/move OLMo Nov files to OLMo2

* Rename Olmo1124 and its variants to Olmo2
```
9121ab8f

Bump tornado from 6.4.1 to 6.4.2 in /examples/research_projects/lxmert (#34917) · 1de3598d

dependabot[bot] authored 6 months ago

Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.1 to 6.4.2.
- [Changelog](https://github.com/tornadoweb/tornado/blob/v6.4.2/docs/releases.rst)
- [Commits](https://github.com/tornadoweb/tornado/compare/v6.4.1...v6.4.2

)

---
updated-dependencies:
- dependency-name: tornado
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

1de3598d

Fix Qwen2 failing tests (#34819) · f4c04ba3
Jacky Lee authored 6 months ago
```
* fix: qwen2 model ids

* fix: line

* fix: more format

* update: reformat
```
f4c04ba3

[`peft`] Given that `self.active_adapter` is deprecated, avoid using it (#34804) · 11cc2295

Tom Aarsen authored 6 months ago

* Given that self.active_adapter is deprecated, avoid using it

* Remove misleading comment - `self.active_adapter` is not used (and deprecated)

11cc2295

Fix convert_tokens_to_string when decoder is None (#34569) · 74db22f9

Donald Szeto authored 6 months ago


* Fix convert_tokens_to_string when decoder is None

* revert unrelated changs

---------

Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>

74db22f9

chore: fix some typos (#34891) · 97514a8b
wanxiangchwng authored 6 months ago
```
Signed-off-by: wanxiangchwng <cui.shuang@foxmail.com>
```
97514a8b

Bump tornado from 6.4.1 to 6.4.2 in /examples/research_projects/visual_bert (#34887) · 62ab94de

dependabot[bot] authored 6 months ago

Bump tornado in /examples/research_projects/visual_bert

Bumps [tornado](https://github.com/tornadoweb/tornado) from 6.4.1 to 6.4.2.
- [Changelog](https://github.com/tornadoweb/tornado/blob/v6.4.2/docs/releases.rst)
- [Commits](https://github.com/tornadoweb/tornado/compare/v6.4.1...v6.4.2

)

---
updated-dependencies:
- dependency-name: tornado
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

62ab94de

prepare_fa2_from_position_ids function bugfix (#33269) · c50b5675
Meliksah Turker authored 6 months ago
```
contiguous() is called before view() for key and value within prepare_fa2_from_position_ids function
```
c50b5675

allow unused input parameters passthrough when chunking in asr pipelines (#33889) · a0f4f317

VictorAtIfInsurance authored 6 months ago

* allow unused parameter passthrough when chunking in asr pipelines

* format code

* format

* run fixup

* update tests

* update parameters to pipline in test

* updates parametrs in tests

* change spelling in gitignore

* revert .gitignore to main

* add git ignore of devcontainer folder

* assert asr output follows expected inference output type

* run fixup

* Remove .devcontainer from .gitignore

* remove compliance check

a0f4f317

Sum gathered input tokens (#34554) · 4dc1a693

kang sheng authored 6 months ago


* sum gathered input tokens

* ruff line-length is 119, format the code

---------

Co-authored-by: kangsheng <kangsheng@meituan.com>

4dc1a693

Mllama: fix base prefix (#34874) · 1e492afd
Raushan Turganbay authored 6 months ago
```
fix base prefix
```
1e492afd

[`Deberta/Deberta-v2`] Refactor code base to support compile, export, and fix LLM (#22105) · 857d46ca

Arthur authored 6 months ago

* some modification for roadmap

* revert some changes

* yups

* weird

* make it work

* sttling

* fix-copies

* fixup

* renaming

* more fix-copies

* move stuff around

* remove torch script warnings

* ignore copies

* revert bad changes

* woops

* just styling

* nit

* revert

* style fixup

* nits configuration style

* fixup

* nits

* will this fix the tf pt issue?

* style

* ???????

* update

* eval?

* update error message

* updates

* style

* grumble grumble

* update

* style

* nit

* skip torch fx tests that were failing

* style

* skip the failing tests

* skip another test and make style

857d46ca

BLIP: fix generation after hub update (#34876) · 098962da

Raushan Turganbay authored 6 months ago


* fix blip generation

* dont remove it yet

* Update src/transformers/models/blip_2/modeling_blip_2.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* address comments

* modular

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

098962da

Cache: init empty cache when `use_cache` (#34274) · c1a85204

Raushan Turganbay authored 6 months ago

* fix

* fix tests

* fix copies

* add docs

* Revert "add docs"

This reverts commit 32d35634f12ba02781d2ebdee0c8dcfbe992a7b9.

* qwen move deltas

* mllama can potentiall fullgraph compile

* enable mllama compile and fix tests

* remove mllama fixes

c1a85204

Add safe_globals to resume training on PyTorch 2.6 (#34632) · 1339a14d

Dmitry Rogozhkin authored 6 months ago

Starting from version 2.4 PyTorch introduces a stricter check for the objects which
can be loaded with torch.load(). Starting from version 2.6 loading with weights_only=True
requires allowlisting of such objects.

This commit adds allowlist of some numpy objects used to load model checkpoints.
Usage is restricted by context manager. User can still additionally call
torch.serialization.add_safe_globals() to add other objects into the safe globals list.

Accelerate library also stepped into same problem and addressed it with PR-3036.

Fixes: #34631
See: https://github.com/pytorch/pytorch/pull/137602
See: https://pytorch.org/docs/stable/notes/serialization.html#torch.serialization.add_safe_globals
See: https://github.com/huggingface/accelerate/pull/3036

Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>

1339a14d

Fix: Enable prefill phase key value caching of nemotron/minitron models (#34742) · 318fe25f

jeongin601 authored 6 months ago


* modeling nemotron kv caching bugfix

Signed-off-by: jeongin601 <0200angela@gmail.com>

* test file deleted

Signed-off-by: jeongin601 <0200angela@gmail.com>

* code refinement

Signed-off-by: jeongin601 <0200angela@gmail.com>

* remove unused variables

Signed-off-by: jeongin601 <0200angela@gmail.com>

* import block sorted

* removed deprecation warning

Signed-off-by: jeongin601 <0200angela@gmail.com>

* removed support for tuple shape past_key_values

Signed-off-by: jeongin601 <0200angela@gmail.com>

* Update conditional statement for cache initialization

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Signed-off-by: jeongin601 <0200angela@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

318fe25f

22 Nov, 2024 9 commits

Fix support for image processors modifications in modular (#34866) · 3a8eb746
Yoni Gozlan authored 6 months ago
```
* add fix and examples

* fix camel case naming
```
3a8eb746
Bitnet test fix to avoid using gated model (#34863) · 54be2d7a
Mohamed Mekkouri authored 6 months ago
```
small test fix
```
54be2d7a

[CI] Skip EETQ tests while package is broken with latest transformers (#34854) · 286ffaaf

Benjamin Bossan authored 6 months ago

* CI Skip EETQ tests while package is broken

EETQ tries to import the shard_checkpoint function from transformers but
the function has been removed. Therefore, trying to use EETQ currently
results in an import error. This fix results in EETQ tests being skipped
if there is an import error.

The issue has been reported to EETQ:

https://github.com/NetEase-FuXi/EETQ/issues/34

* Raise helpful error when trying to use eetq

* Forget to raise the error in else clause

286ffaaf

smol improvements to support more flexible usage (#34857) · 861758e2
Andrés Marafioti authored 6 months ago
```
* smol improvements to support more flexible usage

* ruff
```
861758e2

Speculative decoding: Test the target distribution (to prevent issues like #32867) (#34553) · 42b36d73

Nadav Timor authored 6 months ago

* Update test_utils.py

* formatting

* Update test_utils.py

* formatting

* formatting

* Update test_utils.py

* formatting

* Update test_utils.py

* formatting

* format

* comments at standard positions

42b36d73

Auto compile when static cache (#34247) · 597efd21

Arthur authored 6 months ago


* generate with compile

* nits

* simple

* generate with compile

* nits

* simple

* safe

* style

* Update src/transformers/generation/utils.py

Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>

* remove TOKENIZER forked warning

---------

Co-authored-by: Cyril Vallez <cyril.vallez@huggingface.co>

597efd21

Remove quantization related config from dequantized model (#34856) · d9e6f307
Konrad Kalita authored 6 months ago
```
* Remove quantization related config from dequantized model

* Fix whitespace
```
d9e6f307

Update checks for torch.distributed.tensor to require torch >= 2.5 (#34816) · 1867be66

Logan Adams authored 6 months ago

* Update checks for torch.distributed.tensor

* Update PR with feedback

* Formatting fix for import order

* Remove unused function

1867be66

Watermarking: fix order (#34849) · 6a912ff2
Raushan Turganbay authored 6 months ago
```
fix watermarking order
```
6a912ff2

21 Nov, 2024 2 commits

Refactor StarCoder2 using modular (#34015) · 4e90b99e

Cyril Vallez authored 6 months ago

* Create modular_starcoder2.py

* Update modular_starcoder2.py

* update

* finalize modular

* revert # no-unravel

* Add support

* style

* Update modular_model_converter.py

* update docstring

4e90b99e

Fix heuristic scheduling for UAG (#34805) · 18871599
Jonathan Mamou authored 6 months ago
```
* fix heuristic schedule

* fix style

* fix format
```
18871599