- 23 Jan, 2025 1 commit
-
-
Arthur Zucker authored
-
- 17 Jan, 2025 2 commits
-
-
Arthur Zucker authored
-
Ross Wightman authored
An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, optimize string search. (#35615) * An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, reduce number of characters searched on every load considerably. * Fix fix on load issue * Fix gamma/beta warning test * A style complaint * Improve efficiency of weight norm key rename. Add better comments about weight norm and layer norm renaming. * Habitual elif redunant with the return
-
- 16 Jan, 2025 15 commits
-
-
Sai-Suraj-27 authored
Added resource class configuration option for check_circleci_user job.
-
Joao Gante authored
* generate returns a Cache object by default * fix tests * fix test for encoder-decoder models
-
Joao Gante authored
fix failing instantiation
-
Joao Gante authored
* rm command * remove exception
-
Joao Gante authored
* remove things scheduled for removal * make fixup
-
Joao Gante authored
* add test * augment test as suggested * Update tests/utils/test_modeling_utils.py Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * rerun tests --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
Quinten Roets authored
* use max batch size * revert unneccessary change --------- Co-authored-by:
Raushan Turganbay <raushan@huggingface.co>
-
Cyril Vallez authored
* fix function imports * improve comment * Update modeling_switch_function.py * make checks more robust * improvement * rename * final test update
-
efsotr authored
Optimize ForCausalLMLoss by removing unnecessary contiguous() call to reduce memory overhead (#35646) Optimize ForCausalLMLoss by removing unnecessary contiguous() calls to reduce memory overhead
-
Matt authored
* Cleanup jinja2 imports * Raise a proper error if Jinja is missing * make fixup
-
Joao Gante authored
fix type hint
-
人民艺术家 authored
Fix the bug that the accelerator.autocast does not pass parameters correctly when calling torch_jit_model_eval (#35706)
-
kang sheng authored
* fix condition when GA loss bug fix is not performed * max loss diff is 2.29 * fix typo * add an extra validation that loss should not vary too much
-
Mohamed Mekkouri authored
* fix falcon tie_word_embeddings * fix style
-
Mikko Reinikainen authored
* Replace deprecated batch_size with max_batch_size - Functionality remains the same, because property getter batch_size(self) returned max_batch_size anyways. - This change just avoids an unnecessary warning about deprecation. * Use max_batch_size instead of deprecated batch_size with HybridCache * Use max_batch_size instead of deprecated batch_size with HybridCache - Change generated code to match original source
-
- 15 Jan, 2025 5 commits
-
-
hiroaki222 authored
doc: Update original code repository URL
-
Mohamed Mekkouri authored
* fixing nemotron processor * make style
-
jiqing-feng authored
* gptqmodel Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update readme Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by:
ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by:
Qubitium-ModelCloud <qubitium@modelcloud.ai> * Revert quantizer_gptq.py (#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix warning Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix version check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * revert unrelated changes Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * enable gptqmodel tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix requires gptq Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * Fix Transformer compat (#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by:
LRL <lrl@lbx.dev> Co-authored-by:
Qubitium-ModelCloud <qubitium@modelcloud.ai> * fix format Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix format again Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update gptqmodel version (#6) * update gptqmodel version * update gptqmodel version * fix unit test (#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (#7) * fix format and tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix memory check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix device mismatch Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * fix result check Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * update tests Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * review: update docs (#10) * review: update docs (#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> * update document (#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by:
Qubitium-ModelCloud <qubitium@modelcloud.ai> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/gptq.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/quantization/overview.md Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Signed-off-by:
jiqing-feng <jiqing.feng@intel.com> Co-authored-by:
LRL-ModelCloud <165116337+LRL-ModelCloud@users.noreply.github.com> Co-authored-by:
ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by:
Qubitium-ModelCloud <qubitium@modelcloud.ai> Co-authored-by:
ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com> Co-authored-by:
LRL <lrl@lbx.dev> Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by:
Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
Matt authored
* Add future import for Py < 3.10 * make fixup * Same issue in convert_olmo2_weights_to_hf.py
-
Raushan Turganbay authored
* remove manual assignment tie-word-embeddings * remove another unused attribute * fix tests * fix tests * remove unnecessary overwrites * fix * decoder=True * clean pix2struct * run-all * forgot `_tied_weights_keys` when adding Emu3 * also Aria + fix-copies * and clean aria
-
- 14 Jan, 2025 7 commits
-
-
Mahdi Baghbanzadeh authored
* DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing * DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing * Addressed review comments, modified the docstring and made a test for the DataCollatorForLanguageModeling
-
Ego Joseph Oborakpororo authored
* Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md Enhanced installation section with troubleshooting, GPU setup, and OS-specific details. * Update README.md Enhanced installation section with troubleshooting, GPU setup, and OS-specific details. * Update installation.md Updated installation.md to include virtual environment and GPU setup instructions. * Update installation.md Updated installation.md to include virtual environment and GPU setup instructions. * Update installation.md Updated installation.md to include virtual environment, troubleshooting and GPU setup instructions. * Update installation.md * Update installation.md * Update installation.md * Update installation.md Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions. * Update installation.md Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions. * Update installation.md Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions. * Update README.md Removed numbering from README.md. * Update README.md Removed unnecessary "a)" formatting as per maintainer feedback. * Update README.md Added blank lines around code snippets for better readability. * Update README.md Removed the line "b) Install a backend framework:" from README.md as per feedback. * Update README.md Simplified "For Windows:" to "Windows" in README.md as per feedback as well as "For macOS/Linux:" to "macOS/Linux" * Update README.md Removed unnecessary heading and retained valid code snippet. * Update README.md Removed unnecessary heading "d) Optional: Install from source for the latest updates" as per feedback. * Update README.md Removed "GPU Setup (Optional)" section to align with minimal design feedback. * Update installation.md Removed "Create and Activate a Virtual Environment" section from installation.md as per feedback. * Update installation.md Adjusted "Troubleshooting" to a second-level heading and added an introductory line as per feedback. * Update installation.md Updated troubleshooting section with simplified headings and formatted code blocks as per feedback. * Update installation.md Integrated GPU setup instructions into the "Install with pip" section for better content flow. * Update README.md Removed Troubleshooting section from README.md for minimalism as per maintainer feedback.
-
Mohamed Mekkouri authored
fix gemma2 gated model test
-
Mohamed Mekkouri authored
fix expected output
-
Mohamed Mekkouri authored
* fix * make style * adding require_hqq * make style
-
Martin authored
* Update torchao.md: use auto-compilation * Update torchao.md: indicate updating transformers to the latest --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Mohamed Mekkouri authored
* fix docker * fix
-
- 13 Jan, 2025 10 commits
-
-
RTrace authored
-
Arthur authored
* Add the helium model. * Add a missing helium. * And add another missing helium. * Use float for the rmsnorm mul. * Add the Helium tokenizer converter. * Add the pad token as suggested by Arthur. * Update the RMSNorm + some other tweaks. * Fix more rebase issues. * fix copies and style * fixes and add helium.md * add missing tests * udpate the backlink * oups * style * update init, and expected results * small fixes * match test outputs * style fixup, fix doc builder * add dummies and we should be good to go!z * update sdpa and fa2 documentation --------- Co-authored-by:
laurent <laurent.mazare@gmail.com>
-
Ahmed Almaghz authored
* Create token_classification.md * Update token_classification.md * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update docs/source/ar/tasks/token_classification.md Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com> * Update _toctree.yml --------- Co-authored-by:
Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
-
Fanli Lin authored
* intial commit * remove unrelated files * further remove * Update test_trainer.py * fix style
-
Arthur authored
* rename test to only compile forward! * style emu
-
Raushan Turganbay authored
* fix * fix test * add tests * add more tests * fix tests * supposed to be a torch.dtype test * handle BC and make fp32 default
-
Arthur authored
bias should be True
-
Sai-Suraj-27 authored
* Removed duplicate class field definition. * Removed duplicate code in try-except block. --------- Co-authored-by:
Pablo Montalvo <39954772+molbap@users.noreply.github.com>
-
jiqing-feng authored
Fix compile error Signed-off-by:
jiqing-feng <jiqing.feng@intel.com>
-
Cyril Vallez authored
fix rope device
-