- 31 Jan, 2025 1 commit
-
-
eustlb authored
compute head_dim_padding at init
-
- 30 Jan, 2025 10 commits
-
-
Yoni Gozlan authored
* move make_flat_list_of_images and make_batched_videos to image_utils * remove unnecessary is_vision_available * move make_nested_list_of_images to image_utils * fix fast pixtral image processor * fix import mllama * fix make_nested_list_of_images * add tests * convert 4d arrays/tensors to list * add test_make_batched_videos * add support nested batch of videos * fix image processing qwen2vl
-
Marcel authored
* Handle empty change indices in RLE conversion for masks * [test] Add unit tests for RLE encoding of masks in SamProcessor * [test] Update RLE conversion tests to use TensorFlow implementation * [test] Fix formatting in SamProcessorTest according to check_code_quality action * [test] Fix formatting in SamProcessorTest according to check_code_quality * [test] Refactored rle test cases into one test and used tf tensors in tf test cases * [test] Fix: removed self parameter from refactored methods * [test] Removed nested methods in run-length encoding tests for PyTorch and TensorFlow * [test] Added description to individual to run-length encoding tests for PyTorch and TensorFlow.
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Nat Jeffries authored
* Add support for attention masking in moonshine. Tested against Open ASR Leaderboard with batch size 256. * Update comments and ensure attention masks are passed everywhere. Perform attention mask downsampling inside of moonshine forward call. * Hide padding behind conditional. Fix encoder/decoder masking. - Correctly pipe encoder attention mask into decoder - Add correct scaling factor if one is not already provided. - Fix formatting with ruff * Add auto generated modeling_moonshine file. * Update formatting in generated model file. * Address review comments. * Fix typo. * Add `pad_head_dim_to_multiple_of` to moonshine config. * Correct args order for MooonshineConfig. * Update configuration moonshine too. * Update src/transformers/models/moonshine/modular_moonshine.py * Update src/transformers/models/moonshine/configuration_moonshine.py --------- Co-authored-by:
eustlb <94853470+eustlb@users.noreply.github.com>
-
Yih-Dar authored
* fix * remove is_flaky * fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Matt authored
* p_mask back to being a list * Remove breakpoint
-
Raushan Turganbay authored
* fix * remove overriden method * small change
-
Raushan Turganbay authored
* initial POC * - batch mix feature * fix tests * fix tests * make style * do not skip and instead fix tests * update * return back the test * correct text with the correct ckpt
-
Joao Gante authored
fix tests
-
Ilyas Moutawwakil authored
* fix is_causal being a tensor * convert in sdpa attention only when jit tracing
-
- 29 Jan, 2025 11 commits
-
-
Wing Lian authored
-
Joao Gante authored
* move max time tests to their right place * move test to the right place
-
Boris Malashenko authored
There should be a dot after pip install .
-
Fanli Lin authored
* bug fix * update with more cases * more entries * Fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
update docker file for deepspeed Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Zach Mueller authored
* start * So far: 30% * Small fix * Continuing update * Continuing * Forgot to check if not None * Continuing refactor * Fix if else * Fix ref * Should make tests pass * Keep grad norm same * Document * Apply suggestions from code review Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> * Err instead of info for logging RNG state error * Seperate out to func --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Jonas Rohw authored
* Support for generate_argument: return_dict_in_generate=True, instead of returning a error * fix: call test with return_dict_in_generate=True * fix: Only import torch if it is present * update: Encapsulate output_dict changes * fix: added back original comments --------- Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
Yih-Dar authored
fix Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* Fix * Fix * Fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* fix * Fix --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Nadav Timor authored
fix test_generated_length_assisted_generation
-
- 28 Jan, 2025 11 commits
-
-
Mohamed Abu El-Nasr authored
* use torch constraints to check for psd * small nit * Small change * Small change for the ci * nit
-
Ella Charlaix authored
remove INC notebook in documentation
-
NanoCode012 authored
fix(FA): QKV not being casted to target_dtype due to dtype check
-
Joao Gante authored
-
Cyril Vallez authored
* fix tp * Update modeling_utils.py * style * style * Update test_tp.py * Update test_tp.py * style * Update test_tp.py * Update test_tp.py * Update test_tp.py * Update test_tp.py
-
Raushan Turganbay authored
fix
-
Cyril Vallez authored
* correctly slice * check mask * Update modular_gemma2.py * fix * add tests * fix typo * finally fix mask slicing * Finally correctly slice in all cases!! * add test for all attention functions * small fix in tests * trick around dynamo tracing issue * last update * more robust * kwargs propagation * make it explicit for checkpointing * apply modular
-
Raushan Turganbay authored
* fix dtype as dict for some models + add test * add comment in tests
-
Cyril Vallez authored
* Add some tp plans! * More tp plans! * Add it in the comment * style * Update configuration_mixtral.py * Update configuration_phi.py * update the layout according to special archs * fix mixtral * style * trigger CIs * trigger CIs * CIs * olmo2 --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
ivarflakstad authored
* Use rocm6.2 as rocm6.3 only has nightly pytorch wheels atm * Use stable wheel index for torch libs
-
Yih-Dar authored
* use mask_fill * remove comment --------- Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
- 27 Jan, 2025 7 commits
-
-
Steven Liu authored
fix code block
-
Matt authored
* close zamba2 code block * Add Zamba2 to toctree
-
Matt authored
* Fix the config class comparison when repeatedly saving and loading remote code models * once again you have committed your debug breakpoint
-
Steven Liu authored
uv install
-
CalOmnie authored
* Fix typing in audio_utils.chroma_filter_bank * Apply make style --------- Co-authored-by:
Louis Groux <louis.cal.groux@gmail.com>
-
Isotr0py authored
* clean up ggml test Signed-off-by:
Isotr0py <2037008807@qq.com> * port remaining tests Signed-off-by:
Isotr0py <2037008807@qq.com> * further cleanup Signed-off-by:
Isotr0py <2037008807@qq.com> * format Signed-off-by:
Isotr0py <2037008807@qq.com> * fix broken tests Signed-off-by:
Isotr0py <2037008807@qq.com> * update comment Signed-off-by:
Isotr0py <2037008807@qq.com> * fix Signed-off-by:
Isotr0py <2037008807@qq.com> * reorganize tests Signed-off-by:
Isotr0py <2037008807@qq.com> * k-quants use qwen2.5-0.5B Signed-off-by:
Isotr0py <2037008807@qq.com> * move ggml tokenization test Signed-off-by:
Isotr0py <2037008807@qq.com> * remove dead code Signed-off-by:
Isotr0py <2037008807@qq.com> * add assert for serilization test Signed-off-by:
Isotr0py <2037008807@qq.com> * use str for parameterize Signed-off-by:
Isotr0py <2037008807@qq.com> --------- Signed-off-by:
Isotr0py <2037008807@qq.com>
-
Ross Wightman authored
🚨 🚨 🚨 image-classification pipeline single-label and multi-label prob type squashing fns (sigmoid vs softmax) are backwards (#35848) single-label and multi-label prob type squashing fns (sigmoid vs softmax) were backwards for image-classification pipeline
-