- 18 Feb, 2025 7 commits
-
-
Hugging Face Bot (RC Testing) authored
-
Parteek authored
* Added Support for Custom Quantization * Update code * code reformatted * Updated Changes * Updated Changes --------- Co-authored-by:
Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
-
ivarflakstad authored
Flatten the expected slice tensor
-
Damiano Amatruda authored
* Fix XGLM loss computation (PyTorch and TensorFlow) * Update expected output string in XGLM sample test This updates the expected output string of test_xglm_sample for torch 2.0 to the correct one and removes the one for torch 1.13.1 + cu116 (transformers moved to torch 2.0 with PR #35358). * Update expected output IDs in XGLM generation test
-
Mehant Kammakomati authored
* feat: add support for tensor parallel flow using accelerate Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: add tp degree to env variable Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: add version check for accelerate to allow TP Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * docs: tensor parallelism Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * nit: rename plugin name Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * fix: guard accelerate version before allow tp Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> * docs: add more docs and updates related to TP Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> --------- Signed-off-by:
Mehant Kammakomati <mehant.kammakomati2@ibm.com> Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Raushan Turganbay authored
* fix * nit * no logits processor needed * two more tests on assisted decoding
-
andrewor14 authored
**Summary:** TorchAoConfig optionally contains a `torchao.dtypes.Layout` object which is a dataclass and not JSON serializable, and so the following fails: ``` import json from torchao.dtypes import TensorCoreTiledLayout from transformers import TorchAoConfig config = TorchAoConfig("int4_weight_only", layout=TensorCoreTiledLayout()) config.to_json_string() json.dumps(config.to_dict()) ``` This also causes `quantized_model.save_pretrained(...)` to fail because the first step of this call is to JSON serialize the config. Fixes https://github.com/pytorch/ao/issues/1704 . **Test Plan:** python tests/quantization/torchao_integration/test_torchao.py -k test_json_serializable Co-authored-by:
Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
- 17 Feb, 2025 12 commits
-
-
Yih-Dar authored
* fix * fix * fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Marc Sun authored
add compressed_tensors in the dockerfile
-
dependabot[bot] authored
Bump transformers in /examples/research_projects/codeparrot/examples Bumps [transformers](https://github.com/huggingface/transformers) from 4.38.0 to 4.48.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.com/huggingface/transformers/compare/v4.38.0...v4.48.0 ) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
eustlb authored
-
Joao Gante authored
-
Arthur Zucker authored
-
ivarflakstad authored
-
Joao Gante authored
-
ivarflakstad authored
* Add hipsolver and hipblastlt as dependencies * Upgrade torch libs with rocm6.2.4 index
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Fanli Lin authored
fix failed test
-
- 14 Feb, 2025 15 commits
-
-
Yih-Dar authored
* just skip * fix * fix * fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Mayank Mishra authored
* Modular GraniteMoE with shared Experts. Signed-off-by:
Shawn Tan <shawntan@ibm.com> * Modified * Import order. * Modified for style * Fix space. * Test * Remove extra granitemoe file. * New converted file and tests * Modified __init__ files. * Formatting. * Dummy PT objects * register granitemoe shared model Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix linting of a file Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix import in modeling file Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * update generated modeling file Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * add documentation Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * update docstrings Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * update generated modeling file Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * fix docstrings in config class Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> * merge main Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> --------- Signed-off-by:
Shawn Tan <shawntan@ibm.com> Signed-off-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> Co-authored-by:
Shawn Tan <shawntan@ibm.com> Co-authored-by:
Shawn Tan <shawn@wtf.sg> Co-authored-by:
Sukriti-Sharma4 <sukriti.sharma4@ibm.com> Co-authored-by:
Sukriti Sharma <Ssukriti@users.noreply.github.com>
-
ivarflakstad authored
-
Kyle Sayers authored
* add is_composition flag to LlavaConfig Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com> * WIP: pixtral text config Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com> * fix style Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com> * add test Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com> * use is_composition for pixtral Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com> * Revert "use is_composition for pixtral" This reverts commit a53d5f9fc5149c84419b0e9e03db6d99362add53. * Revert "Revert "use is_composition for pixtral"" This reverts commit 3ab1c99404e2c2963fba0bcf94b9786d6365db0f. --------- Signed-off-by:
Kyle Sayers <kylesayrs@gmail.com>
-
Raushan Turganbay authored
* llavas * add mroe models * fix `compile_forward` test for all models * fix copies * make style * also doesn't support cache class * fix some tests * not copied from * ci green? * fix tests * fix copies * fix tests * check with `numel` and remove `item` * fix copies * fix copies * Update src/transformers/models/cohere2/modeling_cohere2.py Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * opt remove cross attn * gemma2 * fixup * fixup * fix newly added test * maybe fixed? * green please? --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
David LaPalomento authored
* archive_file may not be specified When loading a pre-trained model from a gguf file, resolved_archive_file may not be set. Guard against that case in the safetensors availability check. * Remap partial disk offload to cpu for GGUF files GGUF files don't support disk offload so attempt to remap them to the CPU when device_map is auto. If device_map is anything else but None, raise a NotImplementedError. * Don't remap auto device_map and raise RuntimeError If device_map=auto and modules are selected for disk offload, don't attempt to map them to any other device. Raise a runtime error when a GGUF model is configured to map any modules to disk. --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Arthur authored
* dito * add a test * upsate * test needs fa2 * update test and configuration * test requires fa2 * style
-
Mohamed Mekkouri authored
fix
-
Andrei Panferov authored
* new flute * new higgs working * small adjustments * progress and quallity * small updates * style --------- Co-authored-by:
Andrey Panferov <panferov.andrey3@wb.ru> Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by:
Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>
-
Raushan Turganbay authored
* allow processor to preprocess conversation + video metadata * allow callable * add test * fix test * nit: fix * add metadata frames_indices * Update src/transformers/processing_utils.py Co-authored-by:
Pablo Montalvo <39954772+molbap@users.noreply.github.com> * Update src/transformers/processing_utils.py Co-authored-by:
Pablo Montalvo <39954772+molbap@users.noreply.github.com> * port updates from Orr and add one more test * Update src/transformers/processing_utils.py Co-authored-by:
Pablo Montalvo <39954772+molbap@users.noreply.github.com> * typo * as dataclass * style * docstring + maek sure tests green --------- Co-authored-by:
Pablo Montalvo <39954772+molbap@users.noreply.github.com>
-
Isotr0py authored
* add qwen2 fast image processor to modular file Signed-off-by:
isotr0py <2037008807@qq.com> * fix modular Signed-off-by:
isotr0py <2037008807@qq.com> * fix circle import Signed-off-by:
isotr0py <2037008807@qq.com> * add docs Signed-off-by:
isotr0py <2037008807@qq.com> * fix typo Signed-off-by:
isotr0py <2037008807@qq.com> * add modular generated files Signed-off-by:
isotr0py <2037008807@qq.com> * revert qwen2vl fast image processor Signed-off-by:
isotr0py <2037008807@qq.com> * remove qwen2.5-vl image processor from modular Signed-off-by:
isotr0py <2037008807@qq.com> * re-generate qwen2.5-vl files Signed-off-by:
isotr0py <2037008807@qq.com> * remove unnecessary test Signed-off-by:
isotr0py <2037008807@qq.com> * fix auto map Signed-off-by:
isotr0py <2037008807@qq.com> * cleanup Signed-off-by:
isotr0py <2037008807@qq.com> * fix model_input_names Signed-off-by:
isotr0py <2037008807@qq.com> * remove import Signed-off-by:
isotr0py <2037008807@qq.com> * make fix-copies Signed-off-by:
isotr0py <2037008807@qq.com> --------- Signed-off-by:
isotr0py <2037008807@qq.com>
-
Raushan Turganbay authored
* decompose chat template docs * add docs * update model docs * qwen2-5 * pixtral * remove old chat template * also video as list frames supported * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/chat_template_multimodal.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * remove audio for now --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
Raushan Turganbay authored
* fix * also the docstring
-
Amit Garg authored
* Added support for partial_rotary_factor * addressed comments * refactored
-
- 13 Feb, 2025 6 commits
-
-
Yoni Gozlan authored
* uniformize owlvit processor * uniformize owlv2 * nit * add positional arg test owlvit * run-slow: owlvit, owlv2 * run-slow: owlvit, owlv2 * remove one letter variable
-
Yoni Gozlan authored
* add support for initial shift in video processing and other fixes * revert modifications video loading functions
-
Yih-Dar authored
fix my bad Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Mohamed Mekkouri authored
fix
-
Wizyoung authored
fix load key name for _load_rng_state under torch.cuda Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Yih-Dar authored
* speeddddd * speeddddd * speeddddd * speeddddd --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-