- 07 Nov, 2024 1 commit
-
-
Nicolas Patry authored
-
- 05 Nov, 2024 12 commits
-
-
Ahnjj_DEV authored
* docs: ko: convbert.md * Update _toctree.yml * feat: nmt draft
-
Isotr0py authored
* fix stablelm qkv_bias * fix stablelm qkv_bias and use_parallel_residual * remove original_model.config for stablelm gguf test
-
Yoni Gozlan authored
fix-torch-interpolation-ci
-
Mohamed Mekkouri authored
* Changing __repr__ in torchao * small update * make style * small update * add LinearActivationQuantizedTensor * remove some cases * update imports & handle return None * update
-
Yih-Dar authored
* update * update * update * update * update * update * update * update * update * update * update --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yoni Gottesman authored
* Fix assistant tokens when truncated * fix test * fix test * step
-
Eon Kim authored
-
Guang Yang authored
* DistillBERT is ExecuTorch compatible * [run_slow] distilbert * [run_slow] distilbert --------- Co-authored-by:
Guang Yang <guangyang@fb.com>
-
Raushan Turganbay authored
* save/load sub-configs * nit forgot these * fix copies * move test to common * use dict for sub-configs * add load-save-laod test * clean up modeling check * oops this are correct keys * fix some tests, missed some composite configs * this model was missed
-
Benjamin Bossan authored
FIX Broken repr of TorchAoConfig The __repr__ method references a non-existent self.kwargs. This is now fixed. There does not appear to be a uniform way of defining __repr__ for quantization configs. I copied the method as implemented for HQQ: https://github.com/huggingface/transformers/blob/e2ac16b28a0b8b900e136750309ca40c49d975c5/src/transformers/utils/quantization_config.py#L285-L287
-
AbdelKarim ELJANDOUBI authored
* Skip DeepSpeed ZeRO Stage 3 model initialization when it is intended to be quantized. * Propagate the quantization state using a context manager * make fixup
-
- 04 Nov, 2024 9 commits
-
-
Yih-Dar authored
update Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
kang sheng authored
fix TrainerState doc because num_input_tokens_seen is unused by default config Co-authored-by:
kangsheng <kangsheng@meituan.com>
-
Ju Hoon Park authored
* Update README_ko.md Delete the blank paragraph in the language selection button and Edit to synchronize with the English version of README.md * [i18n-KO] Update README_ko.md * Additional edit for keep consistency with main [documentation](https://huggingface.co/docs/transformers/v4.44.2/ko/index). (메인 문서와 일관성 유지를 위한 수정) * Update README_ko.md Additional update. * Change docs link to Korean translated page if it exists. * Change doc link to korean translated if it exists. Change the link of doc and delete a row 'migration' of the table Learn more[더 알아보기], since it does not exist in the main version of doc. * modify a link of the main README.md from `https://huggingface.co/docs/transformers/index#supported-frameworks` to `https://huggingface.co/docs/transformers/index#supported-models-and-frameworks` since the title of 'supported table' changed. * [i18n-ko] edit links and sync with main `README.md` * docs/change comment to Korean1 Change English comment to Korean Co-authored-by:
Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * docs/change comment to Korean2 Change English comment to Korean Co-authored-by:
Jihun Lim <31366038+heuristicwave@users.noreply.github.com> * revise to original to seperate `edit_README_ko_md` and `README.md` * Synchronization with English documentation. Synchronization with English documentation, and translated a line of comment from English to Korean. --------- Co-authored-by:
Jihun Lim <31366038+heuristicwave@users.noreply.github.com>
-
MaCAT authored
* Translated to Ko, 1st version * updated _toctree.yml
-
Karthik Vallamsetla authored
* [i18n-HI] Translated TFLite page to Hindi * [i18n-HI] Translated TFLite page to Hindi * Update docs/source/hi/tflite.md Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com> --------- Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com>
-
JacobLinCool authored
* feat: add text support to TensorBoardCallback * feat: ignore long strings in trainer progress * docs: add docstring for max_str_len * style: remove trailing whitespace --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Joao Gante authored
* apply fix * tested * make fixup
-
Raushan Turganbay authored
* kinda works * update * add tests * update * use special tokens in processors * typo * fix copies * fix * fix moshi after rebase * update * fix tests * update * Update docs/source/en/main_classes/tokenizer.md Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * update docs * test for load time adding tokens * fix some more tests which are now fetched better * one more fix --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
Zach Mueller authored
* Update trainer for easier handling of accumulate + proper reporting * test * Fixup tests * Full fix * Fix style * rm comment * Fix tests * Minimize test + remove py 311 check * Unused import * Forward contrib credits from discussions * Fix reported metrics * Refactor, good as it's going to get * rm pad tok id check * object detection and audio are being annoying * Fin * Fin x2 --------- Co-authored-by:
Gyanateet Dutta <Ryukijano@users.noreply.github.com>
-
- 01 Nov, 2024 5 commits
-
-
Karthik Vallamsetla authored
* [i18n-HI] Translated accelerate page to Hindi * Update docs/source/hi/accelerate.md Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com> * Update docs/source/hi/accelerate.md Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com> --------- Co-authored-by:
Kay <kay@Kays-MacBook-Pro.local> Co-authored-by:
K.B.Dharun Krishna <kbdharunkrishna@gmail.com>
-
Cyril Vallez authored
* rework converter * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * Update modular_model_converter.py * cleaning * cleaning * finalize imports * imports * Update modular_model_converter.py * Better renaming to avoid visiting same file multiple times * start converting files * style * address most comments * style * remove unused stuff in get_needed_imports * style * move class dependency functions outside class * Move main functions outside class * style * Update modular_model_converter.py * rename func * add augmented dependencies * Update modular_model_converter.py * Add types_to_file_type + tweak annotation handling * Allow assignment dependency mapping + fix regex * style + update modular examples * fix modular_roberta example (wrong redefinition of __init__) * slightly correct order in which dependencies will appear * style * review comments * Performance + better handling of dependencies when they are imported * style * Add advanced new classes capabilities * style * add forgotten check * Update modeling_llava_next_video.py * Add prority list ordering in check_conversion as well * Update check_modular_conversion.py * Update configuration_gemma.py
-
Pablo Montalvo authored
🔴 🔴 fix `query_pre_attn_scalar` different of `num_heads` in default gemma2 config (#34540) * fix query_pre_attn_scalar different of num_heads in default config * propagate modular changes * fix copies * fix modular copies * fix copies? * correct copies fix -
Raushan Turganbay authored
* blip2 tests * instructblips * copies * fix slow tests * fix * uncomment this * clean up after rebase * should be model main input * fix overwritten tests * oops len should be multiple of frame number * style * fix some tests
-
Raushan Turganbay authored
* set-get embeds * add tests * fix tests * remove * return dict True * fix tests * why did i remove this * enabel torchscript tests
-
- 31 Oct, 2024 13 commits
-
-
Ahmed Almaghz authored
* Add docs/source/ar/multilingual.md to Add_docs_source_ar_multilingual.md * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/multilingual.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml * Update _toctree.yml * Add Translated files to branch for merg * Update _toctree.yml * Update _toctree.yml * Update custom_models.md * Update chat_templating.md * Update docs/source/ar/create_a_model.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update create_a_model.md * Update gguf.md * Update gguf.md * Update gguf.md * Update gguf.md --------- Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
jiqing-feng authored
* update doc * Update docs/source/en/perf_train_cpu.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * delete closing tip --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
NielsRogge authored
* Remove interpolate_pos_encoding * Make fixup * Make interpolate_pos_encoding default to True * Reuse existing interpolation * Add integration test
-
Yoni Gozlan authored
* Standardize image-text-to-text-models-output add post_process_image_text_to_text to chameleon and cleanup Fix legacy kwarg behavior and deprecation warning add post_process_image_text_to_text to qwen2_vl and llava_onevision Add post_process_image_text_to_text to idefics3, mllama, pixtral processor * nit var name post_process_image_text_to_text udop * nit fix deprecation warnings * Add image-text-to-text pipeline * add support for image url in chat template for pipeline * Reformat to be fully compatible with chat templates * Add tests chat template * Fix imports and tests * Add pipeline tag * change logic handling of single prompt ans multiple images * add pipeline mapping to models * fix batched inference * fix tests * Add manual batching for preprocessing * Fix outputs with nested images * Add support for all common processing kwargs * Add default padding when multiple text inputs (batch size>1) * nit change version deprecation warning * Add support for text only inference * add chat_template warnings * Add pipeline tests and add copied from post process function * Fix batched pipeline tests * nit * Fix pipeline tests blip2 * remove unnecessary max_new_tokens * revert processing kosmos2 and remove unnecessary max_new_tokens * fix pipeline tests idefics * Force try loading processor if pipeline supports it * revert load_processor change * hardcode loading only processor * remove unnecessary try except * skip imagetexttotext tests for kosmos2 as tiny model causes problems * Make code clearer * Address review comments * remove preprocessing logic from pipeline * fix fuyu * add BC resize fuyu * Move post_process_image_text_to_text to ProcessorMixin * add guard in post_process * fix zero shot object detection pipeline * add support for generator input in pipeline * nit * change default image-text-to-text model to llava onevision * fix owlv2 size dict * Change legacy deprecation warning to only show when True
-
fpgaminer authored
Update SiglipVisionEmbeddings.forward to cast input to correct dtype before embedding it.
-
Yih-Dar authored
* try * try * try * try * try * try * update * update * update * update * update * update * update --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Luc Georges authored
* feat: add benchmarks pg indexes * refactor: remove debug `df -h`
-
Phillip Kuznetsov authored
* fix(DPT,Depth-Anything) Address expected_slice errors inside inference tests Signed-off-by:
Phillip Kuznetsov <philkuz@gimletlabs.ai> * [run_slow] dpt, depth_anything --------- Signed-off-by:
Phillip Kuznetsov <philkuz@gimletlabs.ai>
-
Joao Gante authored
it has complex inputs_embeds computation
-
Yih-Dar authored
* update * update * update * update * update --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
kibitzing authored
* replace total_batched_samples with step while counting grad accum step * remove unused variable * simplify condition for update step * fix format by ruff * simplify update step condition using accelerator.sync_gradients * simplify update condition using do_sync_step * remove print for test --------- Co-authored-by:
Zach Mueller <muellerzr@gmail.com>
-
jp authored
Fix: unpadding img mismatch
-
jiqing-feng authored
* enable QA bf16 pipeline * add tests
-