Commits · fix_chinese_clip · 某某某 / transformers-new

05 Aug, 2024 3 commits
- exclude ChinesecliptextTransformer from checks · 09af9ead
  Pablo Montalvo authored 10 months ago
  
  09af9ead
- Keep ChineseCLIPTextTransformer internal · e3b68aac
  Pablo Montalvo authored 10 months ago
  
  e3b68aac
- Update src/transformers/models/chinese_clip/modeling_chinese_clip.py · d19a0cd1
  Pablo Montalvo authored 10 months ago
```
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  d19a0cd1
29 Jul, 2024 7 commits

fix imports · 4a79ffb0
Pablo Montalvo authored 10 months ago

4a79ffb0
fix tests · fe3f5beb
Pablo Montalvo authored 10 months ago

fe3f5beb
rename ChineseTextModel · cd6757fc
Pablo Montalvo authored 10 months ago

cd6757fc
update doc · 0c882f2f
Pablo Montalvo authored 10 months ago

0c882f2f
Optimize t5 tokenize logic to avoid redundant calls (#32270) · 5019aabf
leejet authored 10 months ago
```
* Optimize t5 tokenize logic to avoid redundant calls

* fix and overwrite copies
```
5019aabf
Upload new model failure report to Hub (#32264) · f2122cc6
Yih-Dar authored 10 months ago
```
upload

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f2122cc6

Bloom support for cache class (#31445) · f7396876

Raushan Turganbay authored 10 months ago


* bloom dynamic cache

* bloom follows standard cache format

* no skips for bloom anymore

* use cache position when possible

* clean up

* codestyle

* Update src/transformers/models/bloom/modeling_bloom.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bloom/modeling_bloom.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update src/transformers/models/bloom/modeling_bloom.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* pr comments

* isinstance fix

* address comments

* make musicgen test happy

* [run-slow] bloom

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f7396876

27 Jul, 2024 1 commit
- Llama 3.1: replace for loop by tensor ops at inv_freq initialization (#32244) · 44f6fdd7
  Joao Gante authored 11 months ago
```
* replace for loop by tensor ops

* rm assert; readability
```
  44f6fdd7
26 Jul, 2024 10 commits

More flexible trigger condition (#32251) · 8da90687
Yih-Dar authored 11 months ago
```
update

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8da90687
Flash-Attn: fix generation when no attention mask or no pading (#32241) · 81233c06
Raushan Turganbay authored 11 months ago
```
* fix

* fix prev test (half of failures)

* [run-slow] llama, gemma2

* [run-slow] llama, gemma2
```
81233c06

[tests] fix `static` cache implementation is not compatible with... · 27c7f971

Fanli Lin authored 11 months ago

[tests] fix `static` cache implementation is not compatible with `attn_implementation==flash_attention_2` (#32039)

* add flash attention check

* fix

* fix

27c7f971

Add check for `target_sizes is None` in `post_process_image_guided_detection` for owlv2 (#31934) · 5f841c74

Connor Anderson authored 11 months ago

* Add check for target_sizes is None in post_process_image_guided_detection

* Make sure Owlvit and Owlv2 in sync

* Fix incorrect indentation; add check for correct size of target_sizes

5f841c74

Adds: extra_repr for RMSNorm layers in most models (#32204) · f9756d9e

Rohit Dwivedula authored 11 months ago

* adds: extra_repr() to RMSNorm layers in multiple models

* adds: extra_repr for deprecated models as well

* formatting as per style guide

f9756d9e

Refactor: Removed un-necessary `object` base class (#32230) · b8e5cd53
Sai-Suraj-27 authored 11 months ago
```
* Refactored to remove un-necessary object base class.

* small fix.
```
b8e5cd53

don't log base model architecture in wandb if log model is false (#32143) · 1c7ebf1d

João Nadkarni authored 11 months ago


* don't log base model architecture in wandb is log model is false

* Update src/transformers/integrations/integration_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* convert log model setting into an enum

* fix formatting

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1c7ebf1d

Resize embeds with DeepSpeed (#32214) · c46edfb8
Raushan Turganbay authored 11 months ago
```
* fix resize when deepspeed

* deepsped uses new embeds

* we needed this
```
c46edfb8
Llava: generate without images (#32183) · fad15fba
Raushan Turganbay authored 11 months ago
```
* llava w/o images

* tests
```
fad15fba

Generation: stop at `eos` for assisted decoding (#31301) · 4ab33c2d

Raushan Turganbay authored 11 months ago


* fix

* move changes to prompt lookup

* add test

* set eos in assistant model

* style

* fix flakiness

* changes for new `main`

* Update tests/generation/test_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/generation/test_utils.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add comment to explain

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

4ab33c2d

25 Jul, 2024 9 commits
- Fix code snippet for Grounding DINO (#32229) · 9d6c0641
  Pavel Iakubovskii authored 11 months ago
```
Fix code snippet for grounding-dino
```
  9d6c0641
- Allow a specific microphone to be used by the ffmpeg audio pipeline utility... · 3a83ec48
  jrhe authored 11 months ago
```
Allow a specific microphone to be used by the ffmpeg audio pipeline utility functions. Default to using the currently active microphone on Mac (#31846)

* use currently active microphone on mac for ffmpeg_microphone

* Allow ffmpeg_microphone device to be specified

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
```
  3a83ec48
- translate philosophy.md to chinese (#32177) · 6ed0bf1e
  Huazhong Ji authored 11 months ago
```
* translate philosophy.md to chinese

* add the missing link
```
  6ed0bf1e
- Follow up for #31973 (#32025) · df6eee92
  Yih-Dar authored 11 months ago
```
* fix

* [test_all] trigger full CI

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  df6eee92
- [warnings] fix E721 warnings (#32223) · de231889
  Kashif Rasul authored 11 months ago
```
fix E721 warnings
```
  de231889
- [BigBird Pegasus] set _supports_param_buffer_assignment to False (#32222) · 9b9a54e6
  Kashif Rasul authored 11 months ago
```
set _supports_param_buffer_assignment to False
```
  9b9a54e6
- Update question_answering.py (#32208) · 1ecedf1d
  Austin authored 11 months ago
  
  1ecedf1d
- remove unnecessary guard code related with pytorch versions 1.4.2 ~ 1.7.0 (#32210) · f53a5dec
  Huazhong Ji authored 11 months ago
```
remove unnecessary guard code related with pytorch versions 1.4.2 ~
1.7.0
```
  f53a5dec
- [whisper] fix short-form output type (#32178) · 5658e749
  Sanchit Gandhi authored 11 months ago
```
* [whisper] fix short-form output type

* add test

* make style

* update long-form tests

* fixes

* last fix

* finalise test
```
  5658e749
24 Jul, 2024 10 commits

fix: Replaced deprecated `unittest method` with the correct one (#32198) · 85a1269e
Sai-Suraj-27 authored 11 months ago
```
Replaced deprecated unittest method with the correct one.
```
85a1269e

No more default chat templates (#31733) · edd68f4e

Matt authored 11 months ago

* No more default chat templates

* Add the template to the GPT-SW3 tests since it's not available by default now

* Fix GPT2 test

* Fix Bloom test

* Fix Bloom test

* Remove default templates again

edd68f4e

Support dequantizing GGUF FP16 format (#31783) · 1c122a46
Penut Chen authored 11 months ago
```
* support gguf fp16

* support gguf bf16 with pytorch

* add gguf f16 test

* remove bf16
```
1c122a46
Fix float8_e4m3fn in modeling_utils (#32193) · af0e4b7b
Marc Sun authored 11 months ago
```
* Fix float8_e4m3fn in modeling_utils

* style

* fix

* comment
```
af0e4b7b
Fix resize embedding with Deepspeed (#32192) · 1392a686
Raushan Turganbay authored 11 months ago
```
fix resize when deepspeed
```
1392a686
let's not warn when someone is running a forward (#32176) · 8d2534c4
Arthur authored 11 months ago
```
* let's not warn when someone is running a foward without cache + self.training

* more models

* fixup
```
8d2534c4

RoPE: relaxed rope validation (#32182) · e0182f3b

Joao Gante authored 11 months ago

* relaxed rope check

* lets also accept rope_type=None, defaulting to the original implementation

* type and rope_type can coexist

e0182f3b

Remove conversational pipeline tests (#32099) · 165116bc
amyeroberts authored 11 months ago
```
Remove conversation pipeline tests
```
165116bc

Update qwen2.md (#32108) · 5f4ee98a

Dr. Artificial曾小健 authored 11 months ago

* Update qwen2.md

outdated description

* Update qwen2.md

amended

* Update qwen2.md

Update

* Update qwen2.md

fix wrong version code, now good to go

5f4ee98a

fix: default value reflects the runtime environment variables rather than the... · 8678879f

조준래 authored 11 months ago

fix: default value reflects the runtime environment variables rather than the ones present at import time. (#32153)

* fix: default value reflects the runtime environment variables rather than the ones present at import time.

* Fix: Change `deterministic` to None by default; use env var if None

8678879f