Commits · tp-support · zhusg / transformers-new

23 Jan, 2025 1 commit
- fix copies · 2d480ecc
  Arthur Zucker authored 4 months ago
  
  2d480ecc
17 Jan, 2025 2 commits

add more TP support · ef0b5e27
Arthur Zucker authored 5 months ago

ef0b5e27

An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope,... · 8c1b5d37

Ross Wightman authored 5 months ago

 An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, optimize string search. (#35615)

* An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, reduce number of characters searched on every load considerably.

* Fix fix on load issue

* Fix gamma/beta warning test

* A style complaint

* Improve efficiency of weight norm key rename. Add better comments about weight norm and layer norm renaming.

* Habitual elif redunant with the return

8c1b5d37

16 Jan, 2025 15 commits

Added resource class configuration option for `check_circleci_user` job (#32866) · 02a492a8
Sai-Suraj-27 authored 5 months ago
```
Added resource class configuration option for check_circleci_user job.
```
02a492a8
[generate] return Cache object even if passed in a legacy format (#35673) · 94af1c0a
Joao Gante authored 5 months ago
```
* generate returns a Cache object by default

* fix tests

* fix test for encoder-decoder models
```
94af1c0a
[generate] can instantiate `GenerationConfig(cache_implementation="static")` (#35679) · 2818307e
Joao Gante authored 5 months ago
```
fix failing instantiation
```
2818307e
Remove `pt_to_tf` (#35672) · aaa969e9
Joao Gante authored 5 months ago
```
* rm command

* remove exception
```
aaa969e9
🧹 remove `generate`-related objects and methods scheduled for removal in v4.48 (#35677) · 80dbbd10
Joao Gante authored 5 months ago
```
* remove things scheduled for removal

* make fixup
```
80dbbd10

[cache] add a test to confirm we can use cache at train time (#35709) · aeeceb99

Joao Gante authored 5 months ago


* add test

* augment test as suggested

* Update tests/utils/test_modeling_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* rerun tests

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

aeeceb99

Remove batch size argument warning when unjustified (#35519) · 57bf1a12

Quinten Roets authored 5 months ago


* use max batch size

* revert unneccessary change

---------

Co-authored-by: Raushan Turganbay <raushan@huggingface.co>

57bf1a12

Modular: support for importing functions from any file (#35692) · 91be6a5e

Cyril Vallez authored 5 months ago

* fix function imports

* improve comment

* Update modeling_switch_function.py

* make checks more robust

* improvement

* rename

* final test update

91be6a5e

Optimize ForCausalLMLoss by removing unnecessary contiguous() call to reduce... · 8ebe9d71

efsotr authored 5 months ago

Optimize ForCausalLMLoss by removing unnecessary contiguous() call to reduce memory overhead (#35646)

Optimize ForCausalLMLoss by removing unnecessary contiguous() calls to reduce memory overhead

8ebe9d71

Add proper jinja2 error (#35533) · 1302c32a

Matt authored 5 months ago

* Cleanup jinja2 imports

* Raise a proper error if Jinja is missing

* make fixup

1302c32a

[generation] fix type hint (#35725) · 3292e96a
Joao Gante authored 5 months ago
```
fix type hint
```
3292e96a
Fix the bug that `Trainer` cannot correctly call `torch_jit_model_eval` (#35722) · 8b78d9d6
 人民艺术家 authored 5 months ago
```
Fix the bug that the accelerator.autocast does not pass parameters correctly when calling torch_jit_model_eval (#35706)
```
8b78d9d6

Fix condition when GA loss bug fix is not performed (#35651) · 2cbcc587

kang sheng authored 5 months ago

* fix condition when GA loss bug fix is not performed

* max loss diff is 2.29

* fix typo

* add an extra validation that loss should not vary too much

2cbcc587

Fix: Falcon tie_word_embeddings in GGUF (#35715) · fd4f14c9
Mohamed Mekkouri authored 5 months ago
```
* fix falcon tie_word_embeddings

* fix style
```
fd4f14c9

Replace deprecated batch_size with max_batch_size when using HybridCache (#35498) · bef7dded

Mikko Reinikainen authored 5 months ago

* Replace deprecated batch_size with max_batch_size

- Functionality remains the same, because property getter batch_size(self) returned max_batch_size anyways.
- This change just avoids an unnecessary warning about deprecation.

* Use max_batch_size instead of deprecated batch_size with HybridCache

* Use max_batch_size instead of deprecated batch_size with HybridCache

- Change generated code to match original source

bef7dded

15 Jan, 2025 5 commits

Fix typo in /docs/source/ja/model_doc/decision_transformer.md URL (#35705) · 99e0ab6e
hiroaki222 authored 5 months ago
```
doc: Update original code repository URL
```
99e0ab6e
Fix : Nemotron Processor in GGUF conversion (#35708) · 12dfd990
Mohamed Mekkouri authored 5 months ago
```
* fixing nemotron processor

* make style
```
12dfd990

Enable gptqmodel (#35012) · 387663e5

jiqing-feng authored 5 months ago


* gptqmodel

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update readme

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* gptqmodel need use checkpoint_format (#1)

* gptqmodel need use checkpoint_format

* fix quantize

* Update quantization_config.py

* Update quantization_config.py

* Update quantization_config.py

---------

Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* Revert quantizer_gptq.py (#2)

* revert quantizer_gptq.py change

* pass **kwargs

* limit gptqmodel and optimum version

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix warning

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix version check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* revert unrelated changes

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* enable gptqmodel tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix requires gptq

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Fix Transformer compat (#3)

* revert quantizer_gptq.py change

* pass **kwargs

* add meta info

* cleanup

* cleanup

* Update quantization_config.py

* hf_select_quant_linear pass checkpoint_format and meta

* fix GPTQTestCUDA

* Update test_gptq.py

* gptqmodel.hf_select_quant_linear() now does not select ExllamaV2

* cleanup

* add backend

* cleanup

* cleanup

* no need check exllama version

* Update quantization_config.py

* lower checkpoint_format and backend

* check none

* cleanup

* Update quantization_config.py

* fix self.use_exllama == False

* spell

* fix unittest

* fix unittest

---------

Co-authored-by: LRL <lrl@lbx.dev>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* fix format

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix format again

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update gptqmodel version (#6)

* update gptqmodel version

* update gptqmodel version

* fix unit test (#5)

* update gptqmodel version

* update gptqmodel version

* "not self.use_exllama" is not equivalent to "self.use_exllama==False"

* fix unittest

* update gptqmodel version

* backend is loading_attibutes (#7)

* fix format and tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix memory check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix device mismatch

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix result check

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_gptq.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* update tests

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* review: update docs (#10)

* review: update docs (#12)

* review: update docs

* fix typo

* update tests for gptqmodel

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* update document (#9)

* update overview.md

* cleanup

* Update overview.md

* Update overview.md

* Update overview.md

* update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

* Update gptq.md

---------

Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>

* typo

* doc note for asymmetric quant

* typo with apple silicon(e)

* typo for marlin

* column name revert: review

* doc rocm support

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/gptq.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/overview.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/quantization/overview.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
Co-authored-by: LRL-ModelCloud <165116337+LRL-ModelCloud@users.noreply.github.com>
Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai>
Co-authored-by: Qubitium-ModelCloud <qubitium@modelcloud.ai>
Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com>
Co-authored-by: LRL <lrl@lbx.dev>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

387663e5

Add future import for Py < 3.10 (#35666) · 615bf9c5

Matt authored 5 months ago

* Add future import for Py < 3.10

* make fixup

* Same issue in convert_olmo2_weights_to_hf.py

615bf9c5

Clean-up composite configs (#34603) · 09d5f762

Raushan Turganbay authored 5 months ago

* remove manual assignment tie-word-embeddings

* remove another unused attribute

* fix tests

* fix tests

* remove unnecessary overwrites

* fix

* decoder=True

* clean pix2struct

* run-all

* forgot `_tied_weights_keys` when adding Emu3

* also Aria + fix-copies

* and clean aria

09d5f762

14 Jan, 2025 7 commits

Enhance DataCollatorForLanguageModeling with Configurable Token Replacement Probabilities (#35251) · c61fcde9

Mahdi Baghbanzadeh authored 5 months ago

* DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing

* DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing

* Addressed review comments, modified the docstring and made a test for the DataCollatorForLanguageModeling

c61fcde9

Enhanced Installation Section in README.md (#35094) · b0cdbd91

Ego Joseph Oborakpororo authored 5 months ago

* Update README.md

Enhanced installation section with troubleshooting, GPU setup, and OS-specific details.

* Update README.md

Enhanced installation section with troubleshooting, GPU setup, and OS-specific details.

* Update installation.md

Updated installation.md to include virtual environment and GPU setup instructions.

* Update installation.md

Updated installation.md to include virtual environment and GPU setup instructions.

* Update installation.md

Updated installation.md to include virtual environment, troubleshooting and GPU setup instructions.

* Update installation.md

Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.

* Update installation.md

Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.

* Update installation.md

Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.

* Update README.md

Removed numbering from README.md.

* Update README.md

Removed unnecessary "a)" formatting as per maintainer feedback.

* Update README.md

Added blank lines around code snippets for better readability.

* Update README.md

Removed the line "b) Install a backend framework:" from README.md as per feedback.

* Update README.md

Simplified "For Windows:" to "Windows" in README.md as per feedback as well as "For macOS/Linux:" to "macOS/Linux"

* Update README.md

Removed unnecessary heading and retained valid code snippet.

* Update README.md

Removed unnecessary heading "d) Optional: Install from source for the latest updates" as per feedback.

* Update README.md

Removed "GPU Setup (Optional)" section to align with minimal design feedback.

* Update installation.md

Removed "Create and Activate a Virtual Environment" section from installation.md as per feedback.

* Update installation.md

Adjusted "Troubleshooting" to a second-level heading and added an introductory line as per feedback.

* Update installation.md

Updated troubleshooting section with simplified headings and formatted code blocks as per feedback.

* Update installation.md

Integrated GPU setup instructions into the "Install with pip" section for better content flow.

* Update README.md

Removed Troubleshooting section from README.md for minimalism as per maintainer feedback.

b0cdbd91

Fix : add require_read_token for gemma2 gated model (#35687) · a11041ff
Mohamed Mekkouri authored 5 months ago
```
fix gemma2 gated model test
```
a11041ff
Fix expected output for ggml test (#35686) · df2a812e
Mohamed Mekkouri authored 5 months ago
```
fix expected output
```
df2a812e
Fix : HQQ config when hqq not available (#35655) · 05063651
Mohamed Mekkouri authored 5 months ago
```
* fix

* make style

* adding require_hqq

* make style
```
05063651

Update torchao.md: use auto-compilation (#35490) · 715fdd64

Martin authored 5 months ago


* Update torchao.md: use auto-compilation

* Update torchao.md: indicate updating transformers to the latest

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

715fdd64

Fix : adding einops lib in the CI docker for some bitsandbytes tests (#35652) · 4b8d1f7f
Mohamed Mekkouri authored 5 months ago
```
* fix docker

* fix
```
4b8d1f7f

13 Jan, 2025 10 commits

Fix `zero_shot_image_classification` documentation guide link in SigLIP (#35671) · 34f76bb6
RTrace authored 5 months ago

34f76bb6

Add-helium (#35669) · c23a1c19

Arthur authored 5 months ago


* Add the helium model.

* Add a missing helium.

* And add another missing helium.

* Use float for the rmsnorm mul.

* Add the Helium tokenizer converter.

* Add the pad token as suggested by Arthur.

* Update the RMSNorm + some other tweaks.

* Fix more rebase issues.

* fix copies and style

* fixes and add helium.md

* add missing tests

* udpate the backlink

* oups

* style

* update init, and expected results

* small fixes

* match test outputs

* style fixup, fix doc builder

* add dummies and we should be good to go!z

* update sdpa and fa2 documentation

---------

Co-authored-by: laurent <laurent.mazare@gmail.com>

c23a1c19

[i18n-ar] Translated file : docs/source/ar/tasks/token_classification.md into Arabic (#35193) · a3f82328

Ahmed Almaghz authored 5 months ago


* Create token_classification.md

* Update token_classification.md

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/tasks/token_classification.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update _toctree.yml

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

a3f82328

[tests] make cuda-only tests device-agnostic (#35607) · 2fa876d2
Fanli Lin authored 5 months ago
```
* intial commit

* remove unrelated files

* further remove

* Update test_trainer.py

* fix style
```
2fa876d2
[`Compile`] Only test compiling model forward pass (#35658) · e6f9b034
Arthur authored 5 months ago
```
* rename test to only compile forward!

* style emu
```
e6f9b034

Enable different torch dtype in sub models (#34873) · 84a67891

Raushan Turganbay authored 5 months ago

* fix

* fix test

* add tests

* add more tests

* fix tests

* supposed to be a torch.dtype test

* handle BC and make fp32 default

84a67891

[`Phi`] bias should be True (#35650) · 87089176
Arthur authored 5 months ago
```
bias should be True
```
87089176

Removed some duplicated code (#35637) · 91f14f1f

Sai-Suraj-27 authored 5 months ago


* Removed duplicate class field definition.

* Removed duplicate code in try-except block.

---------

Co-authored-by: Pablo Montalvo <39954772+molbap@users.noreply.github.com>

91f14f1f

Fix whisper compile (#35413) · b8c34d97
jiqing-feng authored 5 months ago
```
Fix compile error

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>
```
b8c34d97
Fix device in rope module when using dynamic updates (#35608) · cd44bdb4
Cyril Vallez authored 5 months ago
```
fix rope device
```
cd44bdb4