- 05 Aug, 2024 3 commits
-
-
Pablo Montalvo authored
-
Pablo Montalvo authored
-
Pablo Montalvo authored
Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 29 Jul, 2024 7 commits
-
-
Pablo Montalvo authored
-
Pablo Montalvo authored
-
Pablo Montalvo authored
-
Pablo Montalvo authored
-
leejet authored
* Optimize t5 tokenize logic to avoid redundant calls * fix and overwrite copies
-
Yih-Dar authored
upload Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Raushan Turganbay authored
* bloom dynamic cache * bloom follows standard cache format * no skips for bloom anymore * use cache position when possible * clean up * codestyle * Update src/transformers/models/bloom/modeling_bloom.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bloom/modeling_bloom.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update src/transformers/models/bloom/modeling_bloom.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * pr comments * isinstance fix * address comments * make musicgen test happy * [run-slow] bloom --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 27 Jul, 2024 1 commit
-
-
Joao Gante authored
* replace for loop by tensor ops * rm assert; readability
-
- 26 Jul, 2024 10 commits
-
-
Yih-Dar authored
update Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Raushan Turganbay authored
* fix * fix prev test (half of failures) * [run-slow] llama, gemma2 * [run-slow] llama, gemma2
-
Fanli Lin authored
[tests] fix `static` cache implementation is not compatible with `attn_implementation==flash_attention_2` (#32039) * add flash attention check * fix * fix
-
Connor Anderson authored
* Add check for target_sizes is None in post_process_image_guided_detection * Make sure Owlvit and Owlv2 in sync * Fix incorrect indentation; add check for correct size of target_sizes
-
Rohit Dwivedula authored
* adds: extra_repr() to RMSNorm layers in multiple models * adds: extra_repr for deprecated models as well * formatting as per style guide
-
Sai-Suraj-27 authored
* Refactored to remove un-necessary object base class. * small fix.
-
João Nadkarni authored
* don't log base model architecture in wandb is log model is false * Update src/transformers/integrations/integration_utils.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * convert log model setting into an enum * fix formatting --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Raushan Turganbay authored
* fix resize when deepspeed * deepsped uses new embeds * we needed this
-
Raushan Turganbay authored
* llava w/o images * tests
-
Raushan Turganbay authored
* fix * move changes to prompt lookup * add test * set eos in assistant model * style * fix flakiness * changes for new `main` * Update tests/generation/test_utils.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update tests/generation/test_utils.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * add comment to explain --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 25 Jul, 2024 9 commits
-
-
Pavel Iakubovskii authored
Fix code snippet for grounding-dino
-
jrhe authored
Allow a specific microphone to be used by the ffmpeg audio pipeline utility functions. Default to using the currently active microphone on Mac (#31846) * use currently active microphone on mac for ffmpeg_microphone * Allow ffmpeg_microphone device to be specified Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Huazhong Ji authored
* translate philosophy.md to chinese * add the missing link
-
Yih-Dar authored
* fix * [test_all] trigger full CI --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Kashif Rasul authored
fix E721 warnings
-
Kashif Rasul authored
set _supports_param_buffer_assignment to False
-
Austin authored
-
Huazhong Ji authored
remove unnecessary guard code related with pytorch versions 1.4.2 ~ 1.7.0
-
Sanchit Gandhi authored
* [whisper] fix short-form output type * add test * make style * update long-form tests * fixes * last fix * finalise test
-
- 24 Jul, 2024 10 commits
-
-
Sai-Suraj-27 authored
Replaced deprecated unittest method with the correct one.
-
Matt authored
* No more default chat templates * Add the template to the GPT-SW3 tests since it's not available by default now * Fix GPT2 test * Fix Bloom test * Fix Bloom test * Remove default templates again
-
Penut Chen authored
* support gguf fp16 * support gguf bf16 with pytorch * add gguf f16 test * remove bf16
-
Marc Sun authored
* Fix float8_e4m3fn in modeling_utils * style * fix * comment
-
Raushan Turganbay authored
fix resize when deepspeed
-
Arthur authored
* let's not warn when someone is running a foward without cache + self.training * more models * fixup
-
Joao Gante authored
* relaxed rope check * lets also accept rope_type=None, defaulting to the original implementation * type and rope_type can coexist
-
amyeroberts authored
Remove conversation pipeline tests
-
Dr. Artificial曾小健 authored
* Update qwen2.md outdated description * Update qwen2.md amended * Update qwen2.md Update * Update qwen2.md fix wrong version code, now good to go
-
조준래 authored
fix: default value reflects the runtime environment variables rather than the ones present at import time. (#32153) * fix: default value reflects the runtime environment variables rather than the ones present at import time. * Fix: Change `deterministic` to None by default; use env var if None
-