1. 21 Jan, 2022 13 commits
  2. 20 Jan, 2022 9 commits
  3. 19 Jan, 2022 18 commits
    • jsnfly's avatar
      Fix usage of additional kwargs in `from_encoder_decoder_pretrained` in... · baf1ebe9
      jsnfly authored
      Fix usage of additional kwargs in `from_encoder_decoder_pretrained` in encoder-decoder models (#15056)
      
      * [EncoderDecoder] Add test for usage of extra kwargs
      
      * [EncoderDecoder] Fix usage of extra kwargs in from pretrained
      
      * [EncoderDecoder] apply suggested changes (passing **kwargs_encoder)
      
      * [EncoderDecoder] create new test function and make sure it passes
      
      Co-authored-by: default avatarjonas <jsnfly@gmx.de>
      baf1ebe9
    • Nicolas Patry's avatar
      Make chuking smartly (long files) work on asr ctc_with_lm. (#15219) · 3fefee99
      Nicolas Patry authored
      
      * [WIP] Make chuking smartly (long files) work on asr ctc_with_lm.
      
      * Slow test with functionality.
      
      * Fixing regular test.
      
      * fix for batch size 1
      
      * Handling batch outside `rescale_Stride`.
      
      - Renamed to `rescale_stride`.
      
      * Disable equality in the test.
      
      * Remove print.
      
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      3fefee99
    • NielsRogge's avatar
      Update Trainer code example (#15070) · 80f72960
      NielsRogge authored
      * Update code example
      
      * Fix code quality
      
      * Add comment
      80f72960
    • NielsRogge's avatar
      Add ViLT (#14895) · ac227093
      NielsRogge authored
      
      * First commit
      
      * Add conversion script
      
      * Make conversion script work for base model
      
      * More improvements
      
      * Update conversion script, works for vqa
      
      * Add indexing argument to meshgrid
      
      * Make conversion script work for ViltForPreTraining
      
      * Add ViltForPreTraining to docs
      
      * Fix device issue
      
      * Add processor
      
      * Add MinMaxResize to feature extractor
      
      * Implement call method of ViltProcessor
      
      * Fix tests
      
      * Add integration test
      
      * Add loss calculation for VQA
      
      * Improve tests
      
      * Improve some more tests
      
      * Debug tests
      
      * Small improvements
      
      * Add support for attention_mask
      
      * Remove mask_it
      
      * Add pixel_mask
      
      * Add tests for ViltFeatureExtractor
      
      * Improve tests
      
      * Add ViltForNaturalLanguageVisualReasoning
      
      * Add ViltForNaturalLanguageVisualReasoning to conversion script
      
      * Minor fixes
      
      * Add support for image_embeds, update docstrings to markdown
      
      * Update docs to markdown
      
      * Improve conversion script
      
      * Rename ViltForPreTraining to ViltForMaskedLM
      
      * Improve conversion script
      
      * Convert docstrings to markdown
      
      * Fix code example of retrieval model
      
      * Properly convert masked language model
      
      * Add integration test for nlvr
      
      * Fix code quality
      
      * Apply suggestions from code review
      
      * Add copied from statements
      
      * Fix pretrained_config_archive_map
      
      * Fix docs
      
      * Add model to README
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply more suggestions from code review
      
      * Make code more readable
      
      * Add ViltForNaturalLanguageVisualReasoning to the tests
      
      * Rename ViltForVisualQuestionAnswering to ViltForQuestionAnswering
      
      * Replace pixel_values_2 by single tensor
      
      * Add hidden_states and attentions
      
      * Fix one more test
      
      * Fix all tests
      
      * Update year
      
      * Fix rebase issues
      
      * Fix another rebase issue
      
      * Remove ViltForPreTraining from auto mapping
      
      * Rename ViltForImageRetrievalTextRetrieval to ViltForImageAndTextRetrieval
      
      * Make it possible to use BertTokenizerFast in the processor
      
      * Use BertTokenizerFast by default
      
      * Rename ViltForNaturalLanguageVisualReasoning, define custom model output
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      ac227093
    • Patrick von Platen's avatar
      Update README.md (#15233) · 691878ee
      Patrick von Platen authored
      691878ee
    • Sylvain Gugger's avatar
      Fix checkpoint for ViT Config · f4b7420d
      Sylvain Gugger authored
      f4b7420d
    • Lysandre Debut's avatar
      Fix PR number (#15231) · 6a3c883c
      Lysandre Debut authored
      * Fix PR number
      
      * Fix PR number
      6a3c883c
    • Li-Huai (Allan) Lin's avatar
      Fix typo in BERT tokenization file (#15228) · f778edb7
      Li-Huai (Allan) Lin authored
      * Fix typo
      
      * Fix copies
      f778edb7
    • Suraj Patil's avatar
      fix speech event readme (#15227) · 2a5a3849
      Suraj Patil authored
      2a5a3849
    • NielsRogge's avatar
      [ViTMAE] Various fixes (#15221) · 842298f8
      NielsRogge authored
      * Add MAE to AutoFeatureExtractor
      
      * Add link to notebook
      
      * Fix relative paths
      842298f8
    • Patrick von Platen's avatar
      Update README.md (#15226) · 6d92c429
      Patrick von Platen authored
      6d92c429
    • Patrick von Platen's avatar
      Update README.md · 19c217b4
      Patrick von Platen authored
      19c217b4
    • Patrick von Platen's avatar
      Update README.md · 5439cda7
      Patrick von Platen authored
      5439cda7
    • Li-Huai (Allan) Lin's avatar
      Add FastTokenizer to REALM (#15211) · 841d9791
      Li-Huai (Allan) Lin authored
      * Remove BertTokenizer abstraction
      
      * Add FastTokenizer to REALM
      
      * Fix config archive map
      
      * Fix copies
      
      * Update realm.mdx
      
      * Apply suggestions from code review
      841d9791
    • Yih-Dar's avatar
    • Lysandre Debut's avatar
      Build dev documentation (#15210) · 653379c0
      Lysandre Debut authored
      * Wrap up
      
      * Remove secret
      
      * Fix path
      
      * Typo
      
      Revert image switch
      
      * Specific token for comments
      
      * Cleaner comments
      
      * Correct PR number
      
      * Explicit master install
      
      * Force uninstall
      653379c0
    • Matt's avatar
      Rename compute_loss in TF models (#15207) · 2708bfa1
      Matt authored
      * Rename compute_loss to hf_compute_loss to avoid conflicts with the new Keras method
      
      * make style
      
      * Adding deprecation warning to `compute_loss`
      
      * Fix sneaky reference to compute_loss
      
      * Replace logger.warning with warnings.warn
      
      * Clarifying warning and deprecation timeline
      2708bfa1
    • Kamal Raj's avatar
      [FLAX] glue training example refactor (#13815) · d1f5ca1a
      Kamal Raj authored
      * refactor run_flax_glue.py
      
      * updated readme
      
      * rm unused import and args typo fix
      
      * refactor
      
      * make consistent arg name across task
      
      * has_tensorboard check
      
      * argparse -> argument dataclasses
      
      * refactor according to review
      
      * fix
      d1f5ca1a