1. 30 May, 2022 1 commit
    • younesbelkada's avatar
      fex fixes · 20b95cb2
      younesbelkada authored
      
      fix tokenizer autodoc
      
      fix minor CI issues
      
      fix minor CI issues
      
      fix minor CI issues
      
      fix style issue
      
      fix minor import issues
      
      fix few issues
      
      remove def main on the test
      
      add require torch
      
      replace decorator with 'with'
      
      fix style
      
      change to bloom
      
      add quick fix tokenizer
      
      fix tokenizer file
      
      fix tokenizer
      
      - merge tests
      - small fixes
      
      fix import issue
      
      add bloom to readme
      
      fix consistency
      
      Update docs/source/en/model_doc/bloom.mdx
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Apply suggestions from code review
      
      fix comment issues on file headers
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      fix doc issue
      
      small fix - modeling test
      
      some changes
      
      - refactor some code
      - taking into account reviews
      - more tests should pass
      - removed pruning tests
      
      remove useless division
      
      more tests should pass
      
      more tests should pass
      
      more tests should pass
      
      let's try this one
      
      -add alibi offset
      - remove all permutes to make the grad operations work
      - finger crossed
      
      Update data2vec.mdx to include a Colab Notebook link (that shows fine-tuning) (#17194)
      
      * Update data2vec.mdx
      
      * Update data2vec.mdx
      
      * Update docs/source/en/model_doc/data2vec.mdx
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Dev version
      
      Add test to ensure models can take int64 inputs (#17210)
      
      * Add test to ensure models can take int64 inputs
      
      * is_integer is an attribute, not a method
      
      * Fix test when some inputs aren't tensors
      
      * Add casts to blenderbot and blenderbot-small
      
      * Add casts to the other failing models
      
      Fix dependency table
      
      update BART docs (#17212)
      
      Black preview (#17217)
      
      * Black preview
      
      * Fixup too!
      
      * Fix check copies
      
      * Use the same version as the CI
      
      * Bump black
      
      Fix typo in bug report template (#17178)
      
      * Fix typo
      
      * Force rerun workflows
      
      Co-authored-by: default avatarFelix Marty <felix@huggingface.co>
      
      Added translation of installation.mdx to Portuguese Issue #16824 (#16979)
      
      * Added translation of installation.mdx to Portuguese, as well
      as default templates of _toctree.yml and _config.py
      
      * [ build_documentation.yml ] - Updated doc_builder to build
      documentation in Portuguese.
      [ pipeline_tutorial.mdx ] - Created translation for the pipeline_tutorial.mdx.
      
      * [ build_pr_documentation.yml ] - Added pt language to pr_documentation builder.
      
      [ pipeline_tutorial.mdx ] - Grammar changes.
      
      * [ accelerate.mdx ] - Translated to Portuguese the acceleration tutorial.
      
      * [ multilingual.mdx ] - Added portuguese translation for multilingual tutorial.
      
      [ training.mdx ] - Added portuguese translation for training tutorial.
      
      * [ preprocessing.mdx ] - WIP
      
      * Update _toctree.yml
      
      * Adding Pré-processamento to _toctree.yml
      
      * Update accelerate.mdx
      
      * Nits and eliminate preprocessing file while it is ready
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      OPT-fix (#17229)
      
      * try fixes
      
      * Revert "try fixes"
      
      This reverts commit a8ad75ef69d4fc03a402ef61bd034b018aa8555e.
      
      * add correct shape
      
      * add correct path
      
      OPT - fix docstring and improve tests slighly (#17228)
      
      * correct some stuff
      
      * fix doc tests
      
      * make style
      
      Update self-push workflow (#17177)
      
      * update push ci
      
      * install git-python
      
      * update comment
      
      * update deepspeed jobs
      
      * fix report
      
      * skip 2 more tests that require fairscale
      
      * Fix changes in test_fetcher.py (to deal with `setup.py` is changed)
      
      * set RUN_PT_TF_CROSS_TESTS=1 and final clean-up
      
      * remove SIGOPT_API_TOKEN
      
      * remove echo "$matrix_folders"
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      fix --gpus option for docker (#17235)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Handle copyright in add-new-model-like (#17218)
      
      Fix Trainer for Datasets that don't have dict items (#17239)
      
      install dev. version of accelerate (#17243)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Fix push CI channel (#17242)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Add PR title to push CI report (#17246)
      
      * add PR title to push CI report
      
      * add link
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial (#17076)
      
      * [ fast_tokenizers.mdx ] - Added translation to portuguese to tutorial
      
      * Delete docs/source/pt-br directory
      
      * [ fast_tokenizers.mdx ] - Continuing work on file
      
      * [ fast_tokenizers.mdx ] - Continuing work on file
      
      * Add fast tokenizers to _toctree.yml
      
      * Eliminated config and toctree.yml
      
      * Nits in fast_tokenizers.mdx
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      Translated version of model_sharing.mdx doc to spanish (#16184)
      
      * Translated version of model_sharing to spanish
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Update docs/source_es/model_sharing.mdx
      
      * Addind model sharing to _toctree.yml
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      Guide to create custom models in Spanish (#17158)
      
      * file copied and toctree updated
      
      * Intro and configuration translated
      
      * model section translated
      
      * enter hotfix
      
      * Translation over, correction pending
      
      * Typos and corrections
      
      * Update docs/source/es/create_a_model.mdx
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      * Update docs/source/es/create_a_model.mdx
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      Co-authored-by: default avatarOmar U. Espejel <espejelomar@gmail.com>
      
      Fix obvious typos in flax decoder impl (#17279)
      
      Change config.encoder_ffn_dim -> config.decoder_ffn_dim for decoder.
      
      TF - Fix convnext classification example (#17261)
      
      [WIP] [doc] performance/scalability revamp (#15723)
      
      * [doc] performance/scalability revamp
      
      * link the new docs
      
      * no :
      
      * mixed precision
      
      * work on the first doc
      
      * expand the main doc
      
      * Trigger CI
      
      * style
      
      * revamp single GPU training section
      
      * work on training performance
      
      * remove files not used anymore or will be added later
      
      * final touches
      
      * fix rebase
      
      * Add hardware section to toctree
      
      * fix toctree again
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * remove `fast_tokenizers` entry that was copied in rebase
      
      * add warning about DP vs DDP
      
      * remove todo
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * fix missing closure of codeblock
      
      * Update docs/source/en/perf_train_gpu_many.mdx
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * sync with #16860
      
      * update toc
      
      Co-authored-by: default avatarleandro <leandro.vonwerra@spoud.io>
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      fixed bug in run_mlm_flax_stream.py (#17203)
      
      * fixed bug run_mlm_flax_stream.py
      
      Fixed bug caused by an update to tokenizer keys introduced in recent transformers versions (between `4.6.2` and `4.18.0`) where additional keys were introduced to the tokenizer output.
      
      * Update run_mlm_flax_stream.py
      
      * adding missing paranthesis
      
      * formatted to black
      
      * remove cols from dataset instead
      
      * reformat to black
      
      * moved rem. columns to map
      
      * formatted to black
      
      Co-authored-by: default avatarKennethEnevoldsen <kennethcenevolsen@gmail.com>
      
       Updated checkpoint support for Sagemaker Model Parallel (#17219)
      
      * adding partial checkpoint support for optimizer state
      
      * formatted trainer.py
      
      * Refactoring based on comments
      
      * reformatting
      
      * Update src/transformers/trainer.py
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/trainer.py
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Update src/transformers/trainer.py
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Co-authored-by: default avatarCavdar <dcavdar@a07817b12d7e.ant.amazon.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      Update codeparrot data preprocessing (#16944)
      
      * add new preprocessing arguments
      
      * add new filters
      
      * add new filters to readme
      
      * fix config and test count, update function names and docstrings
      
      * reformat code
      
      * update readme
      
      * Update readme
      
      * rename config_test filter
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * rename few_assignments filter
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * rename tokenizer in arguments
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * rename functions and add limit_line argument for config_test filter
      
      * update threshold for config_test filter
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      Co-authored-by: default avatarLoubna ben allal <loubnabenallal@gmail.com>
      
      CodeParrot data pretokenization (#16932)
      
      * add pretokenization arguments
      
      * add pretokenization script
      
      * add support for pretokenized data
      
      * reformat code
      
      * fix run command for training
      
      * fix model call from config
      
      * remove a package
      
      * add comments on pretokenization in the readme
      
      * remove explicit parallelization
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * update readme
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * update readme -remove username
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * update readme -remove username
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      * keep data parallelization
      
      * reformat code
      
      * reformat code
      
      * update readme
      
      * reformat code
      
      * Update examples/research_projects/codeparrot/README.md
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      
      Co-authored-by: default avatarLeandro von Werra <lvwerra@users.noreply.github.com>
      Co-authored-by: default avatarLoubna ben allal <loubnabenallal@gmail.com>
      
      Remove next sentence prediction from supported ONNX tasks (#17276)
      
      Align logits and labels in OPT (#17237)
      
      Mlflowcallback fix nonetype error (#17171)
      
      * Fix edge cases TypeError: 'NoneType' object is not callable
      
      * fix style
      
      Automatically sort auto mappings (#17250)
      
      * Automatically sort auto mappings
      
      * Better class extraction
      
      * Some auto class magic
      
      * Adapt test and underlying behavior
      
      * Remove re-used config
      
      * Quality
      
      Make TrainerHyperParameterSigOptIntegrationTest slow test (#17288)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Better error in the Auto API when a dep is missing (#17289)
      
      Fix FlavaForPreTrainingIntegrationTest CI test (#17232)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Use the PR URL in CI report (#17269)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      logging documentation update (#17174)
      
      * logging documentation
      
      * style
      
      Co-authored-by: default avatarSander Land <sander@chatdesk.com>
      
      docs(transformers): fix typo (#17263)
      
      Add Tensorflow Swin model (#16988)
      
      Co-authored-by: default avatarMatt <Rocketknight1@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      [Tests] Fix slow opt tests (#17282)
      
      * fix opt tests
      
      * remove unused tok
      
      * make style
      
      * make flake8 happy
      
      * Update tests/models/opt/test_modeling_opt.py
      
      Fix test_model_parallelization (#17249)
      
      * Fix test_model_parallelization
      
      * Modify
      
      Add Wav2Vec2Conformer (#16812)
      
      * save intermediate
      
      * add wav2vec2 conformer
      
      * add more code
      
      * more
      
      * first test passes
      
      * make all checkpoints work
      
      * update
      
      * up
      
      * more clean ups
      
      * save clean-up
      
      * save clean-up
      
      * save more
      
      * remove bogus
      
      * finalize design conformer
      
      * remove vision
      
      * finish all tests
      
      * more changes
      
      * finish code
      
      * add doc tests
      
      * add slow tests
      
      * fix autoconfig test
      
      * up
      
      * correct docstring
      
      * up
      
      * update
      
      * fix
      
      * Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      
      * Update docs/source/en/model_doc/wav2vec2-conformer.mdx
      
      * upload
      
      * save copied from
      
      * correct configs
      
      * fix model outputs
      
      * add to docs
      
      * fix imports
      
      * finish
      
      * finish code
      
      * correct copied from
      
      * correct again
      
      * correct make fix
      
      * improve make fix copies
      
      * save
      
      * correct fix copy from
      
      * correct init structure
      
      * correct
      
      * fix import
      
      * apply suggestions
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarAnton Lozhkov <aglozhkov@gmail.com>
      
      Fix missing job action button in CI report  (#17270)
      
      * use matrix.machine_type
      
      * fix job names used in job_link
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      Fix wrong PT/TF categories in CI report (#17272)
      
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      
      [ConvNeXT] Fix drop_path_rate (#17280)
      
      * Fix drop_path_rate
      
      * Fix TF's drop path rate
      
      fix retribert's `test_torch_encode_plus_sent_to_model` (#17231)
      
      Fix tests of mixed precision now that experimental is deprecated (#17300)
      
      * Fix tests of mixed precision now that experimental is deprecated
      
      * Fix mixed precision in training_args_tf.py too
      
      Rewrite TensorFlow train_step and test_step (#17057)
      
      * Initial commit
      
      * Better label renaming
      
      * Remove breakpoint before pushing (this is your job)
      
      * Test a lot more in the Keras fit() test
      
      * make fixup
      
      * Clarify the case where we flatten y dicts into tensors
      
      * Clarify the case where we flatten y dicts into tensors
      
      * Extract label name remapping to a method
      
      correct opt (#17301)
      
      refactor
      
      - refactor code
      - style changes
      - add new threshold for test
      
      major changes
      
      - change BLOOM to Bloom
      - add quick doc on bloom.mdx
      - move embeddings test on modeling test
      
      modify readme
      
      small fixes
      
      small fix
      
      - better threshold for a test
      
      remove old test file from fetcher
      
      fix small typo
      
      major change
      
      - change BloomLMHead to BloomForCausalLM
      
      remove onnx config
      
      major changes
      
      - refactor the code
      - remove asserts
      - change tol for test
      
      make style
      
      small change
      
      adding a slow test + commenting old ones for now
      
      make style
      
      Apply suggestions from code review
      
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      make style
      
      fix duplicates
      
      cleaning comments on config
      
      clean a bit conversion file
      
      refacor a bit modeling file
      
      refactor tokenizer file
      
      fix tokenization test issue
      
      fix tokenization issue second try
      
      fix tokenization issue #2
      
      fix test issue
      
      make style + add suggestions
      
      change test fetcher
      
      try this one
      
      - slow tests should pass
      - finger crossed
      
      possible final changes
      
      make style
      
      try fix padding side issue
      
      fix side
      
      fix padding issue
      
      fix ko-readme
      
      fix config auto
      
      cleaning modeling file
      
      keep bloom in caps in ko
      
      update config docs
      
      remove pretraining_pp
      
      remove model parallel
      
      update config
      
      - add correct config files
      
      fix duplicates
      
      fix fetcher
      
      fix refactor issue
      
      - remove divide function
      
      try to remove alibi
      
      small fixes
      
      - fix alibi
      - remove seq length
      - refactor a bit the code
      
      put correct values
      
      - fix bos and eos token ids
      
      fix attention mask loop
      
      Co-authored-by: default avatarthomasw21 <24695242+thomasw21@users.noreply.github.com>
      
      small fixes:
      
      - remove skip bias add
      
      small fixes
      
      - fix typo in readme
      - fix typos in config
      
      small changes
      
      - remove a test
      - add reconstruction test
      - change config
      
      small changes
      
      - change Scaled Softmax to BloomScaledSoftmax
      
      small fixes
      
      - fix alibi dtype
      
      major changes
      
      - removing explicit dtype when loading modules
      - fixing test args (torch_dtype=auto)
      - add dosctring
      
      fix readmes
      
      major changes
      
      - now bloom supports alibi shifting
      - refactor a bit the code
      - better test tolerance now
      
      refactor a bit
      
      refactor a bit
      
      put correct name on test
      
      change docstring
      
      small changes
      
      - fix docstring modeling
      - fix test tolerance
      
      fix small nit
      
      - take dtype from tensors in the conversion script
      
      minor fix
      
      - fix mdx issue
      
      minor fix
      
      - change config docstring
      
      forward contrib credits from PR14084
      
      Apply suggestions from code review
      
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      apply modifications
      
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      resolve softmax upcast
      
      Apply suggestions from code review
      
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      Update src/transformers/models/bloom/modeling_bloom.py
      
      Co-authored-by: default avatarNiklas Muennighoff <n.muennighoff@gmail.com>
      
      final changes modeling
      
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      Merge commit 'd156898f
      
      '
      
      merge commit
      
      Apply suggestions from code review
      
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      
      apply suggestions
      
      Apply suggestions from Stas comments
      Co-authored-by: default avatarStas Bekman <stas00@users.noreply.github.com>
      20b95cb2
  2. 12 May, 2022 19 commits
  3. 11 May, 2022 13 commits
  4. 10 May, 2022 7 commits