1. 23 Jan, 2025 1 commit
  2. 17 Jan, 2025 2 commits
    • Arthur Zucker's avatar
      add more TP support · ef0b5e27
      Arthur Zucker authored
      ef0b5e27
    • Ross Wightman's avatar
      :rotating_light::rotating_light::rotating_light: An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope,... · 8c1b5d37
      Ross Wightman authored
      :rotating_light::rotating_light::rotating_light: An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, optimize string search. (#35615)
      
      * An attempt to fix #29554. Include 'LayerNorm.' in gamma/beta rename scope, reduce number of characters searched on every load considerably.
      
      * Fix fix on load issue
      
      * Fix gamma/beta warning test
      
      * A style complaint
      
      * Improve efficiency of weight norm key rename. Add better comments about weight norm and layer norm renaming.
      
      * Habitual elif redunant with the return
      8c1b5d37
  3. 16 Jan, 2025 15 commits
  4. 15 Jan, 2025 5 commits
  5. 14 Jan, 2025 7 commits
    • Mahdi Baghbanzadeh's avatar
      Enhance DataCollatorForLanguageModeling with Configurable Token Replacement Probabilities (#35251) · c61fcde9
      Mahdi Baghbanzadeh authored
      * DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing
      
      * DataCollatorForLanguageModeling class was updated with new parameters that provides more control over the token masking and relacing
      
      * Addressed review comments, modified the docstring and made a test for the DataCollatorForLanguageModeling
      c61fcde9
    • Ego Joseph Oborakpororo's avatar
      Enhanced Installation Section in README.md (#35094) · b0cdbd91
      Ego Joseph Oborakpororo authored
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      Enhanced installation section with troubleshooting, GPU setup, and OS-specific details.
      
      * Update README.md
      
      Enhanced installation section with troubleshooting, GPU setup, and OS-specific details.
      
      * Update installation.md
      
      Updated installation.md to include virtual environment and GPU setup instructions.
      
      * Update installation.md
      
      Updated installation.md to include virtual environment and GPU setup instructions.
      
      * Update installation.md
      
      Updated installation.md to include virtual environment, troubleshooting and GPU setup instructions.
      
      * Update installation.md
      
      * Update installation.md
      
      * Update installation.md
      
      * Update installation.md
      
      Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.
      
      * Update installation.md
      
      Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.
      
      * Update installation.md
      
      Updated installation.md to include virtual environment, troubleshooting functions and GPU setup instructions.
      
      * Update README.md
      
      Removed numbering from README.md.
      
      * Update README.md
      
      Removed unnecessary "a)" formatting as per maintainer feedback.
      
      * Update README.md
      
      Added blank lines around code snippets for better readability.
      
      * Update README.md
      
      Removed the line "b) Install a backend framework:" from README.md as per feedback.
      
      * Update README.md
      
      Simplified "For Windows:" to "Windows" in README.md as per feedback as well as "For macOS/Linux:" to "macOS/Linux"
      
      * Update README.md
      
      Removed unnecessary heading and retained valid code snippet.
      
      * Update README.md
      
      Removed unnecessary heading "d) Optional: Install from source for the latest updates" as per feedback.
      
      * Update README.md
      
      Removed "GPU Setup (Optional)" section to align with minimal design feedback.
      
      * Update installation.md
      
      Removed "Create and Activate a Virtual Environment" section from installation.md as per feedback.
      
      * Update installation.md
      
      Adjusted "Troubleshooting" to a second-level heading and added an introductory line as per feedback.
      
      * Update installation.md
      
      Updated troubleshooting section with simplified headings and formatted code blocks as per feedback.
      
      * Update installation.md
      
      Integrated GPU setup instructions into the "Install with pip" section for better content flow.
      
      * Update README.md
      
      Removed Troubleshooting section from README.md for minimalism as per maintainer feedback.
      b0cdbd91
    • Mohamed Mekkouri's avatar
      Fix : add require_read_token for gemma2 gated model (#35687) · a11041ff
      Mohamed Mekkouri authored
      fix gemma2 gated model test
      a11041ff
    • Mohamed Mekkouri's avatar
      Fix expected output for ggml test (#35686) · df2a812e
      Mohamed Mekkouri authored
      fix expected output
      df2a812e
    • Mohamed Mekkouri's avatar
      Fix : HQQ config when hqq not available (#35655) · 05063651
      Mohamed Mekkouri authored
      * fix
      
      * make style
      
      * adding require_hqq
      
      * make style
      05063651
    • Martin's avatar
      Update torchao.md: use auto-compilation (#35490) · 715fdd64
      Martin authored
      
      * Update torchao.md: use auto-compilation
      
      * Update torchao.md: indicate updating transformers to the latest
      
      ---------
      
      Co-authored-by: default avatarMarc Sun <57196510+SunMarc@users.noreply.github.com>
      715fdd64
    • Mohamed Mekkouri's avatar
      4b8d1f7f
  6. 13 Jan, 2025 10 commits