Commits · robust_config_ckpt_check · zhusg / transformers-new

10 Nov, 2023 9 commits

fix · ecded403
ydshieh authored 1 year ago

ecded403

Normalize floating point cast (#27249) · ed115b34

amyeroberts authored 1 year ago

* Normalize image - cast input images to float32.

This is done if the input image isn't of floating type. Issues can occur when do_rescale=False is set in an image processor. When this happens, the image passed to the call is of type uint8 becuase of the type casting that happens in resize because of the PIL image library. As the mean and std values are cast to match the image dtype, this can cause NaNs and infs to appear in the normalized image, as the floating values being used to divide the image are now set to 0.

The reason the mean and std values are cast is because previously they were set as float32 by default. However, if the input image was of type float16, the normalization would result in the image being upcast to float32 too.

* Add tests

* Remove float32 cast

ed115b34

Add Phi-1 and Phi-1_5 (#26170) · e1c3ac25

Susnato Dhar authored 1 year ago

* only dir not even init

* init

* tokenizer removed and reference of codegen added

* modeling file updated a lot remaining app_rotary_emb

* conversion script done

* conversion script fixed, a lot of factoring done and most tests pass

* added token_clf and extractive_QA_head

* integration tests pass

* flash attn tests pass!

* config done

* more docs in modeling file

* some style fix

* style and others

* doc test error fix

* more doc fix

* some attention fixes

* most fixes

* style and other fixes

* docs fix and config

* doc fix

* some comments

* conversion script updated

* conversion script updated

* Revert "conversion script updated"

This reverts commit e92378c54084ec0747041b113083d1746ecb6c7f.

* final comments

* add Phi to language_modeling.md

* edit phi.md file

* rebase and fix

* removed phi-1.5 example

* changed model_type from 'phi'->'mixformer-sequential'

* small change

* small change

* revert \small change

* changed mixformer-sequential->phi

* small change

* added phi-1.5 example instead of phi-1

* doc test might pass now

* rebase and small change

* added the dropout layer

* more fixes

* modified .md file

* very very small doc change

e1c3ac25

At most 2 GPUs for CI (#27435) · 00dc8562

Yih-Dar authored 1 year ago


At most 2 GPUs

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

00dc8562

[`AttentionMaskConverter`] ]Fix-mask-inf (#27114) · 68afca3e

Arthur authored 1 year ago

* fix?

* actual fix

* fixups

* add dataclass to the attention mask converter

* refine testing suite

* make sure there are no overflows

* update the test

68afca3e

Add CLVP (#24745) · 7e9f10ac

Susnato Dhar authored 1 year ago

* init commit

* attention arch done except rotary emb

* rotary emb done

* text encoder working

* outputs matching

* arch first pass done

* make commands done, tests and docs remaining

* all tests passed, only docs remaining

* docs done

* doc-builder fix

* convert script removed(not relevant)

* minor comments done

* added ckpt conversion script

* tokenizer done

* very minor fix of index.md 2

* mostly make fixup related

* all done except fe and rotary emb

* very small change

* removed unidecode dependency

* style changes

* tokenizer removed require_backends

* added require_inflect to tokenizer tests

* removed VOCAB_FILES in tokenizer test

* inflect dependency removed

* added rotary pos emb cache and simplified the apply method

* style

* little doc change

* more comments

* feature extractor added

* added processor

* auto-regressive config added

* added CLVPConditioningEncoder

* comments done except the test one

* weights added successfull(NOT tested)

* tokenizer fix with numbers

* generate outputs matching

* almost tests passing Integ tests not written

* Integ tests added

* major CUDA error fixed

* docs done

* rebase and multiple fixes

* fixed rebase overwrites

* generate code simplified and tests for AutoRegressive model added

* minor changes

* refectored gpt2 code in clvp file

* weights done and all code refactored

* mostly done except the fast_tokenizer

* doc test fix

* config file's doc fixes

* more config fix

* more comments

* tokenizer comments mostly done

* modeling file mostly refactored and can load modules

* ClvpEncoder tested

* ClvpDecoder, ClvpModel and ClvpForCausalLM tested

* integration and all tests passed

* more fixes

* docs almost done

* ckpt conversion refectored

* style and some failing tests fix

* comments

* temporary output fix but test_assisted_decoding_matches_greedy_search test fails

* majority changes done

* use_cache outputs same now! Along with the asisted_greedy_decoding test fix

* more comments

* more comments

* prepare_inputs_for_generation fixed and _prepare_model_inputs added

* style fix

* clvp.md change

* moved clvpconditionalencoder norms

* add model to new index

* added tokenizer input_ids_with_special_tokens

* small fix

* config mostly done

* added config-tester and changed conversion script

* more comments

* comments

* style fix

* some comments

* tokenizer changed back to prev state

* small commnets

* added output hidden states for the main model

* style fix

* comments

* small change

* revert small change

* .

* Update clvp.md

* Update test_modeling_clvp.py

* :)

* some minor change

* new fixes

* remove to_dict from FE

7e9f10ac

update Bark FA2 docs (#27400) · 9dd58c53

Yoach Lacombe authored 1 year ago


* update Bark FA2 docs

* update benchmark section

* Update bark.md

* Apply suggestions from code review

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* rephrase

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

9dd58c53

[`Quantization`] Add str to enum conversion for AWQ (#27320) · fd685cfd

Younes Belkada authored 1 year ago


* add str to enum conversion

* fixup

* Apply suggestions from code review

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

fd685cfd

add attention_mask and position_ids in assisted model (#26892) · 184f60dc

jiqing-feng authored 1 year ago

* add attention_mask and position_ids in assisted model

* fix bug

* fix attention mask

* fix attention_mask

* check assist inputs

* check assist input ids length

* fix assist model type

* set assist attention mask device

184f60dc

09 Nov, 2023 15 commits
- Run all tests if `circleci/create_circleci_config.py` is modified (#27413) · cf32c941
  Yih-Dar authored 1 year ago
```
* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  cf32c941
- Fix `Owlv2` checkpoint name and a default value in `Owlv2VisionConfig` (#27402) · 740cd935
  Yih-Dar authored 1 year ago
```
* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  740cd935
- remove failing tests and clean FE files (#27414) · 51a98c40
  Yoach Lacombe authored 1 year ago
```
* remove failing tests and clean FE files

* remove same similar text from tvlt
```
  51a98c40
- Fix RequestCounter to make it more future-proof (#27406) · e38348ae
  Lucain authored 1 year ago
```
* Fix RequestCounter to make it more future-proof

* code quality
```
  e38348ae
- Final fix of the accelerate installation issue (#27408) · c8b6052f
  Yih-Dar authored 1 year ago
```
* fix

* [test-all] commit

* fix

* [test-all] commit

* [test-all] commit

* fix

* fix

* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  c8b6052f
- Use editable install for git deps (#27404) · c5037b45
  Zach Mueller authored 1 year ago
```
* Use editable install

* Full command
```
  c5037b45
- Fix fuyu checkpoint repo in `FuyuConfig` (#27399) · cf2a3f37
  Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  cf2a3f37
- use `pytest.mark` directly (#27390) · 3258ff93
  Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  3258ff93
- Adds dvclive callback (#27352) · 791ec370
  Dave Berenbaum authored 1 year ago
```
* dvclive trainer callback

* style fixes

* dvclive link fixes
```
  791ec370
- device-agnostic deepspeed testing (#27342) · c5d7754b
  Hz, Ji authored 1 year ago
  
  c5d7754b
- Skip failing cache call tests (#27393) · 9999b739
  amyeroberts authored 1 year ago
```
* Skip failing cache call tests

* Fixup
```
  9999b739
- Put doctest options back to `pyproject.toml` (#27366) · bc086a25
  Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  bc086a25
- Change thresh in test (#27378) · e9adb0c9
  Zach Mueller authored 1 year ago
```
Change thresh
```
  e9adb0c9
- [`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are... · 085ea7e5
  Arthur authored 1 year ago
```
[`CodeLlamaTokenizer`] Nit, update __init__ to make sure the AddedTokens are not normalized because they are special (#27359)

* make sure tokens are properly initialized for codellama slow

* add m ore pretrained models

* style

* test more tokenizers checkpoints
```
  085ea7e5
- Smangrul/fix failing ds ci tests (#27358) · 7ecd229b
  Sourab Mangrulkar authored 1 year ago
```
* fix failing DeepSpeed CI tests due to `safetensors` being default

* debug

* remove debug statements

* resolve comments

* Update test_deepspeed.py
```
  7ecd229b
08 Nov, 2023 13 commits

translate debugging.md to chinese (#27374) · ced9fd86
jiaqiw09 authored 1 year ago
```
* update

* update
```
ced9fd86
Update deprecated `torch.range` in `test_modeling_ibert.py` (#27355) · 0e402e14
Sergii Dymchenko authored 1 year ago
```
* Update deprecated torch.range

* Remove comment
```
0e402e14

Add Flash Attention 2 support to Bark (#27364) · a5bee89c

Yoach Lacombe authored 1 year ago


* change handmade attention mask to _prepare_4d_attention_mask

* add flashattention2 support in Bark

* add flashattention2 tests on BarkSemanticModel

* make style

* fix flashattention and tests + make style

* fix memory leak and allow Bark to pass flash attention to sub-models

* make style

* Apply suggestions from code review

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove unecessary code from tests + justify overriding

* Update tests/models/bark/test_modeling_bark.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make style

---------

Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

a5bee89c

translate big_models.md and performance.md to chinese (#27334) · ef716736

jiaqiw09 authored 1 year ago

* translate performance.md

* tranlsate performance.md and big_models.md

* update translation

* update review

ef716736

Fix tiny model script: not using `from_pt=True` (#27372) · bd8f45b1
Yih-Dar authored 1 year ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
bd8f45b1
[Flax Whisper] large-v3 compatibility (#27360) · 7b175cfa
Sanchit Gandhi authored 1 year ago

7b175cfa
Remove unused param from example script tests (#27354) · 845aa832
Zach Mueller authored 1 year ago
```
Unused param
```
845aa832

Translate index.md to Turkish (#27093) · eb30a49b

Mert Yanık authored 1 year ago


* Add index.md for tukish language

* Fix index.md (huggingface/transformers#27088)

* Add 'tr' to additional files

* Update docs/source/tr/_toctree.yml

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update index.md

---------

Co-authored-by: Mert Yanık <mert.yanik@lcwaikiki.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

eb30a49b

MusicGen Update (#27084) · f16ff0f0

Sanchit Gandhi authored 1 year ago

* [MusicGen] Add stereo model

* safe serialization

* Update src/transformers/models/musicgen/modeling_musicgen.py

* split over 2 lines

* fix slow tests on cuda

f16ff0f0

Fix `Kosmos-2` device issue (#27346) · 5ef650b0

Yih-Dar authored 1 year ago


* fix

* fix

* fix

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5ef650b0

Fix example tests from failing (#27353) · efa57cb2
Zach Mueller authored 1 year ago
```
* Fix example tests from failing

* CHange thresh
```
efa57cb2
moving example of benchmarking to legacy dir (#27337) · b6dbfee0
Hz, Ji authored 1 year ago
```
move example of benchmarking to legacy
```
b6dbfee0

Add numpy alternative to FE using torchaudio (#26339) · be74b2ea

Yoach Lacombe authored 1 year ago

* add audio_utils usage in the FE of SpeechToText

* clean unecessary parameters of AudioSpectrogramTransformer FE

* add audio_utils usage in AST

* add serialization tests and function to FEs

* make style

* remove use_torchaudio and move to_dict to FE

* test audio_utils usage

* make style and fix import (remove torchaudio dependency import)

* fix torch dependency for jax and tensor tests

* fix typo

* clean tests with suggestions

* add lines to test if is_speech_availble is False

be74b2ea

07 Nov, 2023 3 commits

translate model_sharing.md and llm_tutorial.md to chinese (#27283) · e2647450

jiaqiw09 authored 1 year ago

* translate model_sharing.md

* translate llm_tutorial.md to chiense

* update wrong translation

* update _torctree.yml

* update typos

* update

e2647450

translate the en tokenizer_summary.md to Chinese (#27291) · f213d5dd
九是否随意的称呼 authored 1 year ago
```
* translate the en tokenizer_summary.md to Chinese

* revise WordPiece

* add to source/zh/_toctree.yml
```
f213d5dd

Allow scheduler parameters (#26480) · 7e1eff76

Plemeur authored 1 year ago


* Allow for scheduler kwargs

* Formatting

* Arguments checks, passing the tests

* Black failed somehow

---------

Co-authored-by: Pierre <pierre@avatarin.com>

7e1eff76