- 23 Jan, 2025 8 commits
-
-
Yosshi999 authored
Fix contamination and missing paragraph in translation
-
Alex Brooks authored
* Add multimodal granite support Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Support multiple image feature layres Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Remove failing validation for visual encoders with no cls Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Update llava based models / configs to support list of feature layers Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Add tests for multiple feature layers Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Use conditional instead of except for misaligned feature shapes Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> * crop cls from each hidden state Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> * Fix formatting Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Support single vision feature int in vipllava Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> * Fix typo in vision feature selection strategy validation Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> * Add tentative integration test for granite vision models Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> * Add granite vision docs Replace multimodal granite refs with granite vision Add granite vision / llava next alias Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> * Use image url in granitevision example Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com> --------- Signed-off-by:
Alex-Brooks <Alex.Brooks@ibm.com> Signed-off-by:
Alex-Brooks <Alex.brooks@ibm.com>
-
Arthur authored
add tooslow for the fat ones
-
Jack Roberts authored
* rename tokenizer to processing_class in WandbCallback.on_train_end * rename tokenizer to processing_class in ClearMLCallback and DVCLiveCallback
-
張庭瑜 authored
* Fix GA loss for Deepspeed * Turn off loss scaling in DeepSpeed engine by scale_wrt_gas * Add comment linking to PR
-
ShuaiBai623 authored
* add qwen2.5vl * fix * pass check table * add modular file * fix style * Update src/transformers/models/qwen2_5_vl/modeling_qwen2_5_vl.py Co-authored-by:
Minho Shim <6764739+minostauros@users.noreply.github.com> * Update src/transformers/models/qwen2_5_vl/modeling_qwen2_5_vl.py Co-authored-by:
Minho Shim <6764739+minostauros@users.noreply.github.com> * Update src/transformers/models/qwen2_5_vl/modeling_qwen2_5_vl.py Co-authored-by:
Minho Shim <6764739+minostauros@users.noreply.github.com> * padd copy check * use modular * fix * fix * fix * update flashatt2&sdpa support_list * Update docs/source/en/_toctree.yml Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/qwen2_5_vl.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/qwen2_5_vl.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/qwen2_5_vl.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/model_doc/qwen2_5_vl.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/transformers/models/qwen2_5_vl/modular_qwen2_5_vl.py Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * update config * update * fix hf path * rename Qwen2_5_VLVideosKwargs * fix * fix * update * excuted modular * rollback init * fix * formated * simpler init * fix * fix * fix * fix * fix * update docs * fix * fix * update Qwen2VLRotaryEmbedding for yarn * fix --------- Co-authored-by:
Minho Shim <6764739+minostauros@users.noreply.github.com> Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
gewenbin0992 <gewenbin292@163.com> Co-authored-by:
gewenbin0992 <67409248+gewenbin0992@users.noreply.github.com>
-
Cyril Vallez authored
* support * Update modeling_utils.py * style * most models * Other models * fix-copies * tests + generation utils
-
Arthur authored
remove class from tests
-
- 22 Jan, 2025 12 commits
-
-
Marc Sun authored
fix type
-
Mohit Sharma authored
Disable FA backend for SDPA on AMD GPUs (PyTorch < 2.4.1)
-
LRL-ModelCloud authored
convert_model method of optimum only accepts a single nn.Module type model parameter for versions less than 1.23.99.
-
Joao Gante authored
docs fix
-
Isotr0py authored
fix gemma2 head dim Signed-off-by:
Isotr0py <2037008807@qq.com> Co-authored-by:
Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
-
Joao Gante authored
* tmp commit * add working chat * add docts * docs 2 * use auto dtype by default
-
Mohamed Mekkouri authored
fix nemotron gguf
-
Joao Gante authored
missing import
-
Joao Gante authored
fix gpt2 generation tests
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Raushan Turganbay authored
* init cache on meta device * offloaded static + enable tests * tests weren't running before :( * update * fix mamba * fix copies * update * address comments and fix tests * fix copies * Update src/transformers/cache_utils.py Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * update * mamba fix --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
- 21 Jan, 2025 20 commits
-
-
CalOmnie authored
* Remove pyav pin to allow python 3.11 to be used * Run make fixup --------- Co-authored-by:
Louis Groux <louis.cal.groux@gmail.com>
-
Joao Gante authored
* remove traces of the old deprecated benchmarks * also remove old tf benchmark example, which uses deleted code * run doc builder
-
eustlb authored
update values for t4
-
Cyril Vallez authored
* start a nice doc * keep improving the doc * Finalize doc * Update modular_transformers.md * apply suggestion
-
Yoni Gozlan authored
* add qwen2_vl image processor fast * add device to ImagesKwargs * remove automatic fix copies * fix fast_is_faster_than_slow * remove unnecessary import
-
eustlb authored
-
Ahmed Almaghz authored
* إضافة الترجمة العربية: masked_language_modeling.md * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/masked_language_modeling.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml * Update _toctree.yml * Add language_modeling.md * Add Sequence_classifiation.md * Update _toctree.yml --------- Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
-
v2ray authored
-
Lucain authored
* Remove deprecated get_cached_models * imports
-
InfroLab authored
Fixed typo in version number for IPEX backend required minimal autoawq version
-
Mohamed Mekkouri authored
* fix bloom ggml * fix falcon output * make style
-
Pedro Cuenca authored
Works for fine-tuned or exported models: ```py from transformers import AutoModelForImageClassification checkpoint = "timm/vit_base_patch16_224.augreg2_in21k_ft_in1k" model = AutoModelForImageClassification.from_pretrained(checkpoint) model.push_to_hub("pcuenq/tw1") ``` The uploaded model will now show snippets for both the timm and the transformers libraries.
-
fzyzcjy authored
* var * more * test
-
Louie Tsai authored
* add a new example for flax inference cases * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update examples/flax/language-modeling/README.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * fix for "make fixup" --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
Aritra Roy Gosthipaty authored
* adding blog post to model doc * Update docs/source/en/model_doc/timm_wrapper.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * review suggestions * review suggestions --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
Yih-Dar authored
* fix * fix * skip * better error message --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* First draft * Add equivalence test * Update docstrings * Add tests * Use numpy * Fix tests * Improve variable names * Improve docstring * Add link * Remove script * Add copied from * Address comment * Add note in docs * Add docstring, data format * Improve test * Add test * update * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by:
Pavel Iakubovskii <qubvel@gmail.com> * Update src/transformers/models/llava/image_processing_llava.py Co-authored-by:
Pavel Iakubovskii <qubvel@gmail.com> * loop once only --------- Co-authored-by:
raushan <raushan@huggingface.co> Co-authored-by:
Raushan Turganbay <raushan.turganbay@alumni.nu.edu.kz> Co-authored-by:
Pavel Iakubovskii <qubvel@gmail.com>
-
ivarflakstad authored
-
Raushan Turganbay authored
* fix "test_chat_template_dict" in llava_onevision * Update src/transformers/models/llava_next_video/processing_llava_next_video.py Co-authored-by:
Pavel Iakubovskii <qubvel@gmail.com> * get one video calles once --------- Co-authored-by:
Pavel Iakubovskii <qubvel@gmail.com>
-
Cyril Vallez authored
deterministic sort
-