Commits · 11c0ce9be6b8dc416ebb70b24964ab919cd95852 · zhusg / transformers-new

30 Oct, 2024 2 commits
- push · 11c0ce9b
  ydshieh authored 8 months ago
  
  11c0ce9b
- try · 5f2a32a8
  ydshieh authored 8 months ago
  
  5f2a32a8
28 Oct, 2024 4 commits
- torch 2.5 · bad6669e
  ydshieh authored 8 months ago
  
  bad6669e
- Tiny update after #34383 (#34404) · 9360f182
  Yih-Dar authored 8 months ago
```
* update

* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9360f182
- pin `tensorflow_probability<0.22` in docker files (#34381) · fc465bb1
  Yih-Dar authored 8 months ago
```
0.21

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  fc465bb1
- Fix pix2struct (#34374) · fddbd3c1
  Ilyas Moutawwakil authored 8 months ago
```
* fix

* fix and test use_cache test

* style

* remove atol
```
  fddbd3c1
25 Oct, 2024 10 commits
- [docs] Cache implementations (#34325) · 1d063793
  Steven Liu authored 8 months ago
```
cache
```
  1d063793
- Fix typos in agents_advanced.md (#34405) · 6a62a6d1
  Rudy Delouya authored 8 months ago
  
  6a62a6d1
- Avoid check expected exception when it is on CUDA (#34408) · f73f5e62
  Yih-Dar authored 8 months ago
```
* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  f73f5e62
- Fix bnb training test failure (#34414) · e447185b
  Matthew Douglas authored 8 months ago
```
* Fix bnb training test: compatibility with OPTSdpaAttention
```
  e447185b
- Tests: upgrade `test_eager_matches_sdpa_generate` (#34386) · 186b8dc1
  Joao Gante authored 8 months ago
  
  186b8dc1
- SynthID: better example (#34372) · 8814043c
  Joao Gante authored 8 months ago
```
* better example

* Update src/transformers/generation/configuration_utils.py

* Update src/transformers/generation/logits_process.py

* nits
```
  8814043c
- no filter (#34391) · 22385531
  Yih-Dar authored 8 months ago
```
* no filter

* no filter

* no filter

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  22385531
- Fix right padding in LLaVA models (#34305) · 9f365fe0
  Raushan Turganbay authored 8 months ago
```
* fix right pad llavas

* device mismatch
```
  9f365fe0
- Fix onnx non-expotable inplace aten op (#34376) · 5779bac4
  Ilyas Moutawwakil authored 8 months ago
```
* fix onnx non-expotable inplace op

* mistral, qwen2, qwen2_vl, starcoder2

* fixup copies
```
  5779bac4
- Use non nested images and batched text Idefics2/3 (#34222) · 940a6bd3
  Yoni Gozlan authored 8 months ago
```
* add support for non nested images and add tests

* add tests error scenario

* fix style

* added single and no image to error tests
```
  940a6bd3
24 Oct, 2024 18 commits

Fix glm (#34388) · 3d99f174
Cyril Vallez authored 8 months ago
```
* Fix duplicated

* fix import
```
3d99f174
[auto. ping] Avoid sending empty info + add more team members (#34383) · a308d28d
Yih-Dar authored 8 months ago
```
* update

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a308d28d

Correct the new defaults (#34377) · 4c6e0c92

Cyril Vallez authored 8 months ago

* Correct the new defaults

* CIs

* add check

* Update utils.py

* Update utils.py

* Add the max_length in generate test checking shape without passing length

* style

* CIs

* fix fx CI issue

4c6e0c92

Fix `torch.fx` issue related to the new `loss_kwargs` keyword argument (#34380) · 1c5918d9
Michael Benayoun authored 8 months ago
```
* Fix FX

* Unskip tests
```
1c5918d9

[PEFT] Add warning for missing key in LoRA adapter (#34068) · d9989e0b

Benjamin Bossan authored 8 months ago

When loading a LoRA adapter, so far, there was only a warning when there
were unexpected keys in the checkpoint. Now, there is also a warning
when there are missing keys.

This change is consistent with
https://github.com/huggingface/peft/pull/2118 in PEFT and the planned PR
https://github.com/huggingface/diffusers/pull/9622 in diffusers.

Apart from this change, the error message for unexpected keys was
slightly altered for consistency (it should be more readable now). Also,
besides adding a test for the missing keys warning, a test for
unexpected keys warning was also added, as it was missing so far.

d9989e0b

Ignore unsupported kwarg in ProcessorMixin call (#34285) · fe350733
Yoni Gozlan authored 8 months ago
```
Fix accept any common kwargs
```
fe350733

refactor: remove redundant if-condition and improve type correctness for... · e2886166

Winston H. authored 8 months ago

refactor: remove redundant if-condition and improve type correctness for `convert_tokens_to_ids` (#34030)

* chore: remove redundant if-condition

* fix: import `Iterable`

e2886166

Add code sample docstrings and checkpoint reference for GLM models (#34360) · 450b9cbf

Vijay authored 8 months ago

* Add code sample docstrings and checkpoint reference for GLM models

* Update modular_glm.py

* Update modeling_glm.py

450b9cbf

Fix pil_torch_interpolation_mapping import in image_processing_detr_fast (#34375) · 6432ad8b
Yoni Gozlan authored 8 months ago
```
fix pil_torch_interpolation_mapping import
```
6432ad8b

Add T5 GGUF loading support (#33389) · dd267fca

김준재 authored 8 months ago

* add: GGUFT5Converter

* add: tensormapping for t5

* add: test code for t5

* fix: Remove whitespace from blank line

* add: t5 fp16 tests

* fix: whitespace formatting

* fix: minor formatting

* fix: testing every weights

dd267fca

add code generation to natural language processing section (#34333) · 30c76d5b
Thomas Furtner authored 8 months ago

30c76d5b
Zamba is an LM (#34342) · 2112027d
Lysandre Debut authored 8 months ago
```
* Zamba is an LM

* Addition
```
2112027d
CI: fix failures (#34371) · b29c24ff
Raushan Turganbay authored 8 months ago
```
fix
```
b29c24ff

translated gguf.md into chinese (#34163) · f0b3ef9e

王一苇 authored 8 months ago


* translated gguf.md into chinese

* Apply suggestions from code review

I have updated the PR accordingly.Thank you very much for detailed guidance,and I 'll pay more attention to the details next time.

Co-authored-by: Isotr0py <2037008807@qq.com>

* Apply suggestions from code review

Co-authored-by: Isotr0py <2037008807@qq.com>

---------

Co-authored-by: Isotr0py <2037008807@qq.com>

f0b3ef9e

v4.47.0.dev0 · 96430694
Arthur Zucker authored 8 months ago

96430694

Drop support for Python 3.8 (#34314) · f0e640ad

Yih-Dar authored 8 months ago


* drop python 3.8

* update docker files

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

f0e640ad

Better defaults (#34026) · 05863817

Arthur authored 8 months ago

* be nice to our usres

* nit

* fixup

* default to -1

* oups

* turbo nit

* auto infer framework

05863817

Remove graph breaks for torch.compile() in flash_attention_forward when Lllama... · 65753d60

Abhishek Maurya authored 8 months ago

Remove graph breaks for torch.compile() in flash_attention_forward when Lllama Model is padding free tuned (#33932)

* fix: fixes for graph breaks

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: formatting

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: import error

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: Add Fa2Kwargs

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* Revert "PR changes"

This reverts commit 39d2868e5c93cc5f3f3c7c6ff981b66614c0e0e4.

* PR changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: FlashAttentionKwarg

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix: FlashAttentionKwarg

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* PR Changes

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* addition of documentation

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* change in _flash_attention_forward

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* make fix-copies

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* revert make fix-copies

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>

* fix copies

* style

* loss kwargs typing

* style and pull latest changes

---------

Signed-off-by: Abhishek <maurya.abhishek@ibm.com>
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>

65753d60

23 Oct, 2024 6 commits

Add SynthID (watermerking by Google DeepMind) (#34350) · b0f0c618

Joao Gante authored 8 months ago


* Add SynthIDTextWatermarkLogitsProcessor

* esolving comments.

* Resolving comments.

* esolving commits,

* Improving SynthIDWatermark tests.

* switch to PT version

* detector as pretrained model + style

* update training + style

* rebase

* Update logits_process.py

* Improving SynthIDWatermark tests.

* Shift detector training to wikitext negatives and stabilize with lower learning rate.

* Clean up.

* in for 7B

* cleanup

* upport python 3.8.

* README and final cleanup.

* HF Hub upload and initiaze.

* Update requirements for synthid_text.

* Adding SynthIDTextWatermarkDetector.

* Detector testing.

* Documentation changes.

* Copyrights fix.

* Fix detector api.

* ironing out errors

* ironing out errors

* training checks

* make fixup and make fix-copies

* docstrings and add to docs

* copyright

* BC

* test docstrings

* move import

* protect type hints

* top level imports

* watermarking example

* direct imports

* tpr fpr meaning

* process_kwargs

* SynthIDTextWatermarkingConfig docstring

* assert -> exception

* example updates

* no immutable dict (cant be serialized)

* pack fn

* einsum equivalent

* import order

* fix test on gpu

* add detector example

---------

Co-authored-by: Sumedh Ghaisas <sumedhg@google.com>
Co-authored-by: Marc Sun <marc@huggingface.co>
Co-authored-by: sumedhghaisas2 <138781311+sumedhghaisas2@users.noreply.github.com>
Co-authored-by: raushan <raushan@huggingface.co>

b0f0c618

Fix red CI: benchmark script (#34351) · e50bf61d
Arthur authored 8 months ago
```
* dont'trigger always

* fux

* oups

* update

* ??

* ?

* aie
```
e50bf61d
skip `test_pipeline_depth_estimation` temporarily (#34316) · c42b3223
Yih-Dar authored 8 months ago
```
skip

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
c42b3223

Enable Gradient Accumulation fix across all models + trainer fully in forward() (#34283) · d9f73362

Zach Mueller authored 8 months ago

* Enable grad accum fix across all models + trainer fully in forward()

* handle peft case

* Account for DDP: need to run scale tests

* Use accelerator state

* Quality

* Guard

* Experiment w/ only fairseq fix

* Fairseq only

* Revert multiply_grads fix

* Mult by grad accum to fully bring back solution

* Style

* Good to go now

* Skip fx tests for now

* Bookmark

* Working now

d9f73362

Support boolean tool args (#34208) · 1fb575fc
Aymeric Roucher authored 8 months ago
```
Support boolean tool arguments
```
1fb575fc

Added Deberta model type support (#34308) · 343c8cb8

Filippos Ventirozos authored 8 months ago


* Added Deberta model type for 'add_prefix_space' functionality

* housekeeping

---------

Co-authored-by: Filippos Ventirozos <filippos.ventirozos@autotrader.co.uk>

343c8cb8