1. 11 Mar, 2025 4 commits
  2. 10 Mar, 2025 4 commits
  3. 07 Mar, 2025 8 commits
  4. 06 Mar, 2025 10 commits
  5. 05 Mar, 2025 2 commits
  6. 04 Mar, 2025 5 commits
    • Afanti's avatar
      chore: enhance messages in docstrings (#36525) · 66f29aaa
      Afanti authored
      chore: enhance the message in docstrings
      66f29aaa
    • Mohamed Mekkouri's avatar
      Fix links in quantization doc (#36528) · 89d27fa6
      Mohamed Mekkouri authored
      fix quantization doc
      89d27fa6
    • ivarflakstad's avatar
      Fix bamba tests amd (#36535) · c0c5acff
      ivarflakstad authored
      c0c5acff
    • co63oc's avatar
      chore: Fix typos in docs and examples (#36524) · 37508816
      co63oc authored
      
      Fix typos in docs and examples
      
      Signed-off-by: default avatarco63oc <co63oc@users.noreply.github.com>
      37508816
    • Arthur's avatar
      Add aya (#36521) · 84f0186e
      Arthur authored
      
      * initial commit
      
      * small fix
      
      * move stuff to image processing file
      
      * remove stuff in validate turn and fix return tensor
      
      * remove liquid stuff
      
      * in the process of addressing comments
      
      * changes to get the right tokenization
      
      * new __init__ works
      
      * fixing defulat std and mean
      
      * works
      
      * small testing scipt -- to be deleted before merge
      
      * remove redundant code
      
      * addressing comments
      
      * fix inits, add docs templates
      
      * refactor processor, switch to gotocr image processor
      
      * remove image proc from init
      
      * refactor to working llava-style architecture
      
      * Change AyaVisionModel to AyaVisionForConditionalGeneration
      
      * add tests
      
      * fixups
      
      * update doc
      
      * Adding logits_to_keep explicitly in ayavision forward to enable compatibility with cohere model
      
      * better variable names + remove code paths
      
      * Updates to aya_vision.md
      
      * address comments
      
      * adding copied from
      
      * make style and remove unused projector_hidden_act from config
      
      * sort init
      
      * include usage of fast image proc and proc on cuda in doc
      
      * update checkpoint iin test processor
      
      * update checkpoint in test processor 2
      
      * remove test_model and update docstring
      
      * skip failing tests
      
      ---------
      
      Co-authored-by: default avatarSaurabh Dash <saurabh@cohere.com>
      Co-authored-by: default avataryonigozlan <yoni.gozlan@huggingface.co>
  7. 03 Mar, 2025 7 commits
    • Steven Liu's avatar
      [docs] Redesign (#31757) · c0f8d055
      Steven Liu authored
      
      * toctree
      
      * not-doctested.txt
      
      * collapse sections
      
      * feedback
      
      * update
      
      * rewrite get started sections
      
      * fixes
      
      * fix
      
      * loading models
      
      * fix
      
      * customize models
      
      * share
      
      * fix link
      
      * contribute part 1
      
      * contribute pt 2
      
      * fix toctree
      
      * tokenization pt 1
      
      * Add new model (#32615)
      
      * v1 - working version
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * rename to correct name
      
      * fix title
      
      * fixup
      
      * rename files
      
      * fix
      
      * add copied from on tests
      
      * rename to `FalconMamba` everywhere and fix bugs
      
      * fix quantization + accelerate
      
      * fix copies
      
      * add `torch.compile` support
      
      * fix tests
      
      * fix tests and add slow tests
      
      * copies on config
      
      * merge the latest changes
      
      * fix tests
      
      * add few lines about instruct
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * fix
      
      * fix tests
      
      ---------
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * "to be not" -> "not to be" (#32636)
      
      * "to be not" -> "not to be"
      
      * Update sam.md
      
      * Update trainer.py
      
      * Update modeling_utils.py
      
      * Update test_modeling_utils.py
      
      * Update test_modeling_utils.py
      
      * fix hfoption tag
      
      * tokenization pt. 2
      
      * image processor
      
      * fix toctree
      
      * backbones
      
      * feature extractor
      
      * fix file name
      
      * processor
      
      * update not-doctested
      
      * update
      
      * make style
      
      * fix toctree
      
      * revision
      
      * make fixup
      
      * fix toctree
      
      * fix
      
      * make style
      
      * fix hfoption tag
      
      * pipeline
      
      * pipeline gradio
      
      * pipeline web server
      
      * add pipeline
      
      * fix toctree
      
      * not-doctested
      
      * prompting
      
      * llm optims
      
      * fix toctree
      
      * fixes
      
      * cache
      
      * text generation
      
      * fix
      
      * chat pipeline
      
      * chat stuff
      
      * xla
      
      * torch.compile
      
      * cpu inference
      
      * toctree
      
      * gpu inference
      
      * agents and tools
      
      * gguf/tiktoken
      
      * finetune
      
      * toctree
      
      * trainer
      
      * trainer pt 2
      
      * optims
      
      * optimizers
      
      * accelerate
      
      * parallelism
      
      * fsdp
      
      * update
      
      * distributed cpu
      
      * hardware training
      
      * gpu training
      
      * gpu training 2
      
      * peft
      
      * distrib debug
      
      * deepspeed 1
      
      * deepspeed 2
      
      * chat toctree
      
      * quant pt 1
      
      * quant pt 2
      
      * fix toctree
      
      * fix
      
      * fix
      
      * quant pt 3
      
      * quant pt 4
      
      * serialization
      
      * torchscript
      
      * scripts
      
      * tpu
      
      * review
      
      * model addition timeline
      
      * modular
      
      * more reviews
      
      * reviews
      
      * fix toctree
      
      * reviews reviews
      
      * continue reviews
      
      * more reviews
      
      * modular transformers
      
      * more review
      
      * zamba2
      
      * fix
      
      * all frameworks
      
      * pytorch
      
      * supported model frameworks
      
      * flashattention
      
      * rm check_table
      
      * not-doctested.txt
      
      * rm check_support_list.py
      
      * feedback
      
      * updates/feedback
      
      * review
      
      * feedback
      
      * fix
      
      * update
      
      * feedback
      
      * updates
      
      * update
      
      ---------
      
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      Co-authored-by: default avatarQuentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
      c0f8d055
    • Matt's avatar
      Remove unused code (#36459) · 6aa98884
      Matt authored
      6aa98884
    • Kashif Rasul's avatar
      [Style] fix E721 warnings (#36474) · 9fe82793
      Kashif Rasul authored
      * fix E721 warnings
      
      * config.hidden_size is not a tuple
      
      * fix copies
      
      * fix-copies
      
      * not a tuple
      
      * undo
      
      * undo
      9fe82793
    • Matt's avatar
      Fix edge case for continue_final_message (#36404) · 1975be4d
      Matt authored
      * Fix edge case for continue_final_message
      
      * lstrip() correctly
      
      * Add regression test
      
      * Add a clearer error message when the final message is not present
      
      * Add a clearer error message when the final message is not present
      
      * Fix massive bug!
      1975be4d
    • Matt's avatar
      Fix pipeline+peft interaction (#36480) · 2aff9389
      Matt authored
      * Fix pipeline-peft interaction
      
      * once again you have committed a debug breakpoint
      
      * Remove extra testing line
      
      * Add a test to check adapter loading
      
      * Correct adapter path
      
      * make fixup
      
      * Remove unnecessary check
      
      * Make check a little more stringent
      2aff9389
    • Afanti's avatar
      chore: fix message descriptions in arguments and comments (#36504) · 28159aee
      Afanti authored
      chore: fix messagedescriptions in arguments and comments
      28159aee
    • co63oc's avatar
      Fix some typos in docs (#36502) · acb8586d
      co63oc authored
      
      Co-authored-by: default avatarMatt <Rocketknight1@users.noreply.github.com>
      acb8586d