1. 19 Apr, 2024 6 commits
    • ydshieh's avatar
      run · 04dd2e6c
      ydshieh authored
      04dd2e6c
    • Yoach Lacombe's avatar
      restore casting of masked_spec_embed · dcbf4404
      Yoach Lacombe authored
      dcbf4404
    • Yoach Lacombe's avatar
    • Yoach Lacombe's avatar
      restore casting of masked_spec_embed · a5520be6
      Yoach Lacombe authored
      a5520be6
    • Jacky Lee's avatar
      Enable multi-device for some models (#30207) · 30b45320
      Jacky Lee authored
      
      * feat: multidevice for resnet
      
      * feat: yes! resnet
      
      * fix: compare all elements in tuple
      
      * feat: support for regnet
      
      * feat: support for convnextv2
      
      * feat: support for bit
      
      * feat: support for cvt
      
      * feat: add support for focalnet
      
      * feat: support for yolos
      
      * feat: support for glpn
      
      * feat: support for imagegpt
      
      * feat: support for levit
      
      * feat: support for mgp_str
      
      * feat: support for mobilnet_v1
      
      * feat: support for mobilnet_v2
      
      * feat: support for mobilevit
      
      * feat: support for mobilevitv2
      
      * feat: support for poolformer
      
      * fix: copies
      
      * fix: code quality check
      
      * update: upstream changes from main
      
      * fix: consistency check
      
      * feat: support for sam
      
      * feat: support for switchformer
      
      * feat: support for swin
      
      * feat: support for swinv2
      
      * feat: support for timesformer
      
      * feat: suport for trocr
      
      * feat: support for upernet
      
      * fix: check copies
      
      * update: rerun CI
      
      * update: rerun again, maybe
      
      * update: one more rerun
      
      ---------
      
      Co-authored-by: default avatarJacky Lee <jackylee328@gmail.com>
      30b45320
    • NielsRogge's avatar
      [UDOP] Add special tokens to tokenizer (#29594) · ecfe9be7
      NielsRogge authored
      * Add special tokens
      
      * Add special tokens
      
      * Use fmt
      
      * Uncomment code
      
      * Add test
      
      * Remove scripts
      
      * Address comments
      
      * Improve tests
      
      * Address comment
      
      * Remove flag
      ecfe9be7
  2. 18 Apr, 2024 20 commits
  3. 17 Apr, 2024 14 commits
    • fxmarty's avatar
      Fix quality Olmo + SDPA (#30302) · ec92f983
      fxmarty authored
      fix olmo
      ec92f983
    • fxmarty's avatar
      Re-enable SDPA's FA2 path (#30070) · 05bdef16
      fxmarty authored
      
      * tentatively re-enable FA2 + SDPA
      
      * better comment
      
      * _ignore_causal_mask_sdpa as staticmethod
      
      * type hints
      
      * use past_seen_tokens instead
      
      * enable copied from for sdpa
      
      * ruff
      
      * llama simplifications on review
      
      * remove unnecessary self.is_causal check
      
      * fix copies
      
      * cleaning
      
      * precise message
      
      * better doc
      
      * add test
      
      * simplify
      
      * Update src/transformers/models/llama/modeling_llama.py
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/llama/modeling_llama.py
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * Update src/transformers/models/llama/modeling_llama.py
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      
      * style
      
      ---------
      
      Co-authored-by: default avatarArthur <48595927+ArthurZucker@users.noreply.github.com>
      05bdef16
    • Shane A's avatar
      Add OLMo model family (#29890) · e4ea19b9
      Shane A authored
      * Add OLMo using add-new-model-like with Llama
      
      * Fix incorrect tokenizer for OLMo
      
      * Copy-paste relevant OLMo methods and their imports
      
      * Add OLMo config
      
      * Modify OLMo config to follow HF conventions
      
      * Remove unneeded Llama code from OLMo model
      
      * Add ability for OLMo model to output attentions
      
      * Add OLMoPreTrainedModel and OLMoModel
      
      * Add OLMoForCausalLM
      
      * Minor fixes to OLMo model for style and missing functions
      
      * Implement OLMo tokenizer
      
      * Implement OLMo to HF conversion script
      
      * Add tests for OLMo model
      
      * Add tests for OLMo fast tokenizer
      
      * Add auto-generated dummy objects
      
      * Remove unimplemented OLMo classes from auto and init classes and re-format
      
      * Add README and associated auto-generated files
      
      * Use OLMo names for common properties
      
      * Run make fixup
      
      * Remove `|` from OLMo typing
      
      * Remove unneeded tokenization_olmo.py
      
      * Revert model, config and converter to add-new-model-like Llama
      
      * Move logic for adding bos/eos token into GPTNeoxTokenizerFast
      
      * Change OLMoConfig defaults to match OLMo-7B
      
      * Use GPTNeoXToknizerFast in OLMo tokenizer tests
      
      * Modify auto-generated OLMoModelTests to work for OLMo
      
      * Add non-parametric layer norm OLMoLayerNorm
      
      * Update weight conversion script for OLMo
      
      * Fix __init__ and auto structure for OLMo
      
      * Fix errors from make fixup
      
      * Remove OLMoTokenizerFast from documentation
      
      * Add missing 'Copied from' for OLMoModel._update_causal_mask
      
      * Run make fix-copies
      
      * Rearrange string replacements in OLMoForCausalLM Copied from
      
      * Move OLMo and Llama CausalLM.forward example into global constants
      
      * Fix OLMO_GENERATION_EXAMPLE doc string typo
      
      * Add option for qkv clipping to OLMo
      
      * Rearrange OLMoConfig kwargs in convert_olmo_weights_to_hf
      
      * Add clip_qkv to OLMoConfig in convert_olmo_weights_to_hf
      
      * Fix OLMo tokenization bug using conversion script
      
      * Keep model in full precision after conversion
      
      * Do not add eos token automatically
      
      * Update references to OLMo model in HF Hub
      
      * Do not add eos token during encoding by default
      
      * Fix Llama generation example
      
      * Run make fixup
      
      * OLMo 7B integration test fix
      
      * Remove unneeded special case for OLMoConfig
      
      * OLMo 7B Twin 2T integration test fix
      
      * Fix test_model_7b_greedy_generation
      
      * Remove test_compile_static_cache
      
      * Fix OLMo and Llama generation example
      
      * Run make fixup
      
      * Revert "OLMo 7B integration test fix"
      
      This reverts commit 4df56a4b150681bfa559846f40e9b7b7f97d7908.
      
      * Revert "OLMo 7B Twin 2T integration test fix"
      
      This reverts commit 9ff65a4a294ace89ab047b793ca55e623a9ceefc.
      
      * Ungate 7B integration tests and fix greedy generation test
      
      * Add retries for flaky test_eager_matches_sdpa_generate
      
      * Fix output of doc example for OLMoForCausalLM.forward
      
      * Downsize OLMo doc test for OLMoForCausalLM.forward to 1B model
      
      * Try fix incorrect characters in OLMoForCausalLM.forward doct test
      
      * Try fix incorrect characters in OLMoForCausalLM.forward doc test using end quotes
      
      * Remove pretraining_tp from OLMo config and model
      
      * Add missing 'Copied from' instances
      
      * Remove unneeded causal_mask from OLMoModel
      
      * Revert Llama changes
      
      * Ignore copy for OLMoForCausalLM.forward
      
      * Change 'OLMo' to 'Olmo' in classes
      
      * Move minimal OLMo tokenization tests to model tests
      
      * Add missed 'Copied from' for repeat_kv
      e4ea19b9
    • Nicolas Patry's avatar
      Upgrading to tokenizers 0.19.0 (#30289) · 8e5f76f5
      Nicolas Patry authored
      * [DO NOT MERGE] Testing tokenizers 0.19.0rc0
      
      * Accounting for the breaking change.
      
      * Ruff.
      
      * Upgrading to tokenizers `0.19` (new release with preprend_scheme fixed
      and new surface for BPE tiktoken bug).
      8e5f76f5
    • Pavel Iakubovskii's avatar
      Add strategy to store results in evaluation loop (#30267) · c15aad09
      Pavel Iakubovskii authored
      * Add evaluation loop container for interm. results
      
      * Add tests for EvalLoopContainer
      
      * Formatting
      
      * Fix padding_index in test and typo
      
      * Move EvalLoopContainer to pr_utils to avoid additional imports
      
      * Fix `eval_do_concat_batches` arg description
      
      * Fix EvalLoopContainer import
      c15aad09
    • st81's avatar
      Add token type ids to CodeGenTokenizer (#29265) · 8d6b5096
      st81 authored
      * Add create token type ids to CodeGenTokenizer
      
      * Fix inconsistent length of token type ids
      
      * Format source codes
      
      * Fix inconsistent order of methods
      
      * Update docstring
      
      * add test_tokenizer_integration test
      
      * Format source codes
      
      * Add `copied from` comment to CodeGenTokenizerFast
      
      * Add doc of create_token_type_ids_from_sequences
      
      * Make return_token_type_ids False by default
      
      * Make test_tokenizer_integration as slow test
      
      * Add return_token_type_ids to tokenizer init arg
      
      * Add test for tokenizer's init return_token_type_ids
      
      * Format source codes
      8d6b5096
    • Younes Belkada's avatar
      FIX: Fix push important models CI (#30291) · 812a5de2
      Younes Belkada authored
      Update push-important-models.yml
      812a5de2
    • Yih-Dar's avatar
    • Yih-Dar's avatar
      05dab4e5
    • Raushan Turganbay's avatar
      Enable fx tracing for Mistral (#30209) · 304c6a1e
      Raushan Turganbay authored
      * tracing for mistral
      
      * typo
      
      * fix copies
      304c6a1e
    • Utkarsha Gupte's avatar
      Configuring Translation Pipelines documents update #27753 (#29986) · 98717cb3
      Utkarsha Gupte authored
      * Configuring Translation Pipelines documents update #27753
      
      Configuring Translation Pipelines documents update
      
      * Language Format Addition
      
      * adding supported list of languages list
      98717cb3
    • Younes Belkada's avatar
      FIX / AWQ: Fix failing exllama test (#30288) · 080b7008
      Younes Belkada authored
      fix filing exllama test
      080b7008
    • Yoach Lacombe's avatar
      41145247
    • fxmarty's avatar
      Fix SDPA sliding window compatibility (#30127) · 40eb6d6c
      fxmarty authored
      
      * fix sdpa + sliding window
      
      * give credit
      
      Co-authored-by: default avatarehuaa <ehuamail@163.com>
      
      * remove unnecessary warning
      
      * fix typog
      
      * add test
      
      ---------
      
      Co-authored-by: default avatarehuaa <ehuamail@163.com>
      40eb6d6c