Commits · ping_author · 某某某 / transformers-new

25 Sep, 2024 5 commits
- ping myself · 8a4e8372
  ydshieh authored 8 months ago
  
  8a4e8372
- ping myself · 10a829fe
  ydshieh authored 8 months ago
  
  10a829fe
- ping myself · a6be9b2e
  ydshieh authored 8 months ago
  
  a6be9b2e
- ping myself · 96e619be
  ydshieh authored 8 months ago
  
  96e619be
- ping myself · 52ed976f
  ydshieh authored 8 months ago
  
  52ed976f
24 Sep, 2024 8 commits

ping myself · fe80f82b
ydshieh authored 8 months ago

fe80f82b
ping myself · 7e102cca
ydshieh authored 8 months ago

7e102cca
ping myself · aa9bbd60
ydshieh authored 8 months ago

aa9bbd60
ping myself · 3c0908f3
ydshieh authored 8 months ago

3c0908f3

Modular `transformers`: modularity and inheritance for new model additions (#33248) · 317e069e

Arthur authored 8 months ago


* update exampel

* update

* push the converted diff files for testing and ci

* correct one example

* fix class attributes and docstring

* nits

* oups

* fixed config!

* update

* nitd

* class attributes are not matched against the other, this is missing

* fixed overwriting self.xxx now onto the attributes I think

* partial fix, now order with docstring

* fix docstring order?

* more fixes

* update

* fix missing docstrings!

* examples don't all work yet

* fixup

* nit

* updated

* hick

* update

* delete

* update

* update

* update

* fix

* all default

* no local import

* fix more diff

* some fix related to "safe imports"

* push fixed

* add helper!

* style

* add a check

* all by default

* add the

* update

* FINALLY!

* nit

* fix config dependencies

* man that is it

* fix fix

* update diffs

* fix the last issue

* re-default to all

* alll the fixes

* nice

* fix properties vs setter

* fixup

* updates

* update dependencies

* make sure to install what needs to be installed

* fixup

* quick fix for now

* fix!

* fixup

* update

* update

* updates

* whitespaces

* nit

* fix

* simplify everything, and make it file agnostic (should work for image processors)

* style

* finish fixing all import issues

* fixup

* empty modeling should not be written!

* Add logic to find who depends on what

* update

* cleanup

* update

* update gemma to support positions

* some small nits

* this is the correct docstring for gemma2

* fix merging of docstrings

* update

* fixup

* update

* take doc into account

* styling

* update

* fix hidden activation

* more fixes

* final fixes!

* fixup

* fixup instruct  blip video

* update

* fix bugs

* align gemma2 with the rest as well

* updats

* revert

* update

* more reversiom

* grind

* more

* arf

* update

* order will matter

* finish del stuff

* update

* rename to modular

* fixup

* nits

* update makefile

* fixup

* update order of the checks!

* fix

* fix docstring that has a call inside

* fiix conversion check

* style

* add some initial documentation

* update

* update doc

* some fixup

* updates

* yups

* Mostly todo gimme a minut

* update

* fixup

* revert some stuff

* Review docs for the modular transformers (#33472)

Docs

* good update

* fixup

* mmm current updates lead to this code

* okay, this fixes it

* cool

* fixes

* update

* nit

* updates

* nits

* fix doc

* update

* revert bad changes

* update

* updates

* proper update

* update

* update?

* up

* update

* cool

* nits

* nits

* bon bon

* fix

* ?

* minimise changes

* update

* update

* update

* updates?

* fixed gemma2

* kind of a hack

* nits

* update

* remove `diffs` in favor of `modular`

* fix make fix copies

---------

Co-authored-by: Lysandre Debut <hi@lysand.re>

317e069e

uniformize git processor (#33668) · 75b7485c
Yoni Gozlan authored 8 months ago
```
* uniformize git processor

* update doctring
```
75b7485c

Fix error string after refactoring into get_chat_template (#33652) · 01aec8c9

Tibor Reiss authored 8 months ago


* Fix error string after refactoring into get_chat_template

* Take suggestion from CR

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

01aec8c9

Enable BNB multi-backend support (#31098) · 11c27dd3

jiqing-feng authored 8 months ago


* enable cpu bnb path

* fix style

* fix code style

* fix 4 bit path

* Update src/transformers/utils/import_utils.py

Co-authored-by: Aarni Koskela <akx@iki.fi>

* add multi backend refactor tests

* fix style

* tweak 4bit quantizer + fix corresponding tests

* tweak 8bit quantizer + *try* fixing corresponding tests

* fix dequant bnb 8bit

* account for Intel CPU in variability of expected outputs

* enable cpu and xpu device map

* further tweaks to account for Intel CPU

* fix autocast to work with both cpu + cuda

* fix comments

* fix comments

* switch to testing_utils.torch_device

* allow for xpu in multi-gpu tests

* fix tests 4bit for CPU NF4

* fix bug with is_torch_xpu_available needing to be called as func

* avoid issue where test reports attr err due to other failure

* fix formatting

* fix typo from resolving of merge conflict

* polish based on last PR review

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fix CI

* Update src/transformers/integrations/integration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/integrations/integration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix error log

* fix error msg

* add \n in error log

* make quality

* rm bnb cuda restriction in doc

* cpu model don't need dispatch

* fix doc

* fix style

* check cuda avaliable in testing

* fix tests

* Update docs/source/en/model_doc/chameleon.md

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update docs/source/en/model_doc/llava_next.md

Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update tests/quantization/bnb/test_4bit.py

Co-authored-by: Aarni Koskela <akx@iki.fi>

* Update tests/quantization/bnb/test_4bit.py

Co-authored-by: Aarni Koskela <akx@iki.fi>

* fix doc

* fix check multibackends

* fix import sort

* remove check torch in bnb

* docs: update bitsandbytes references with multi-backend info

* docs: fix small mistakes in bnb paragraph

* run formatting

* reveret bnb check

* move bnb multi-backend check to import_utils

* Update src/transformers/utils/import_utils.py

Co-authored-by: Aarni Koskela <akx@iki.fi>

* fix bnb check

* minor fix for bnb

* check lib first

* fix code style

* Revert "run formatting"

This reverts commit ac108c6d6b34f45a5745a736ba57282405cfaa61.

* fix format

* give warning when bnb version is low and no cuda found]

* fix device assignment check to be multi-device capable

* address akx feedback on get_avlbl_dev fn

* revert partially, as we don't want the function that public, as docs would be too much (enforced)

---------

Co-authored-by: Aarni Koskela <akx@iki.fi>
Co-authored-by: Titus von Koeller <9048635+Titus-von-Koeller@users.noreply.github.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

11c27dd3

23 Sep, 2024 7 commits
- Generation: deprecate `PreTrainedModel` inheriting from `GenerationMixin` (#33203) · e15687ff
  Joao Gante authored 8 months ago
  
  e15687ff
- Uniformize kwargs for Udop processor and update docs (#33628) · 14561209
  Yoni Gozlan authored 8 months ago
```
* Add optional kwargs and uniformize udop

* cleanup Unpack

* nit Udop
```
  14561209
- Fix Llava conversion for LlavaQwen2ForCausalLM with Clip vision tower (#33613) · be9cf070
  Isotr0py authored 8 months ago
```
fix llavaqwen2 model conversion
```
  be9cf070
- add back self.max_position_embeddings = config.max_position_embeddings (#33550) · 214db9e6
  chengchengpei authored 8 months ago
```
* add back self.max_position_embeddings = config.max_position_embeddings

* fix-copies
```
  214db9e6
- handle dependency errors in check_imports (#33622) · 6d02968d
  Pablo Montalvo authored 8 months ago
```
* handle dependency errors in check_imports

* change log level to warning
```
  6d02968d
- Fix DPT /Dinov2 sdpa regression on main (#33660) · b7c381f0
  Pablo Montalvo authored 8 months ago
```
* fallback to eager if output attentions.

* fix copies
```
  b7c381f0
- Clean up Unpack imports (#33631) · 9eb93854
  Pablo Montalvo authored 8 months ago
```
clean up Unpack imports
```
  9eb93854
21 Sep, 2024 2 commits

Sdpa dino v2 (#33403) · 78b2929c

Avishai Elmakies authored 9 months ago


* add sdpa to dinov2

* fixup

* add dinov2 to sdpa doc

* update doc order

* [run-slow] dinov2

* common to eager

* [run-slow] dinov2

* update attn implementation in common

* update test_modeling_dinov2 to have mask_ration, num_masks and mask_length similar to vit

* [run-slow] dinov2

---------

Co-authored-by: Avishai Elmakies <avishai.elma@cs.huji.ac.il>

78b2929c

Pixtral update example checkpoint (#33633) · e71bf70e
amyeroberts authored 9 months ago
```
* Update pixtral example checkpoint

* Fix typo
```
e71bf70e

20 Sep, 2024 18 commits

Granitemoe (#33207) · e472e077

Mayank Mishra authored 9 months ago


* first commit

* drop tokenizer

* drop tokenizer

* drop tokenizer

* drop convert

* granite

* drop tokenization test

* mup

* fix

* reformat

* reformat

* reformat

* fix docs

* stop checking for checkpoint

* update support

* attention multiplier

* update model

* tiny drop

* saibo drop

* skip test

* fix test

* fix test

* drop

* drop useless imports

* update docs

* drop flash function

* copied from

* drop pretraining tp

* drop pretraining tp

* drop pretraining tp

* drop unused import

* drop code path

* change name

* softmax scale

* head dim

* drop legacy cache

* rename params

* cleanup

* fix copies

* comments

* add back legacy cache

* multipliers

* multipliers

* multipliers

* text fix

* fix copies

* merge

* multipliers

* attention multiplier

* drop unused imports

* add granitemoe

* add decoration

* remove moe from sequenceclassification

* fix test

* fix

* fix

* fix

* move rope?

* merge

* drop bias

* drop bias

* Update src/transformers/models/granite/configuration_granite.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix

* Update src/transformers/models/granite/modeling_granite.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix

* fix

* fix

* fix

* drop

* drop

* fix

* fix

* cleanup

* cleanup

* fix

* fix granite tests

* fp32 test

* fix

* drop jitter

* fix

* rename

* rename

* fix config

* add gen test

---------

Co-authored-by: Yikang Shen <yikang.shn@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

e472e077

enable low-precision pipeline (#31625) · 49a0bef4

jiqing-feng authored 9 months ago


* enable low-precision pipeline

* fix parameter for ASR

* reformat

* fix asr bug

* fix bug for zero-shot

* add dtype check

* rm useless comments

* add np.float16 check

* Update src/transformers/pipelines/image_classification.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/pipelines/token_classification.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* fix comments

* fix asr check

* make fixup

* No more need for is_torch_available()

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>
Co-authored-by: Matt <rocketknight1@gmail.com>

49a0bef4

Fix typos (#33583) · 7b2b536a
litianjian authored 9 months ago
```
Co-authored-by: litianjian <litianjian@bytedance.com>
```
7b2b536a
Fix qwen2vl float16 inference bug (#33312) · e9356a42
GeLee authored 9 months ago
```
* fix qwen2vl float16 inference bug

* [run-slow] qwen2_vl
```
e9356a42

Update daily ci to use new cluster (#33627) · 75c878da

Yih-Dar authored 9 months ago


* update

* re-enable daily CI

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

75c878da

Fix some missing tests in circleci (#33559) · 077b552f

Yih-Dar authored 9 months ago


* fix

* fix

* fix

* fix

* skip

* skip more

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

077b552f

Generate: assistant should sample when the main model samples (#33534) · 77c5d59e
Joao Gante authored 9 months ago

77c5d59e

Fix contrastive search to correctly handle input with padding (#33507) · dc8b6eae

Duc-Viet Hoang authored 9 months ago

* fix: handle padding in contrastive search for decoder-only models

* fix: handle padding in contrastive search for encoder-decoder models

* tests: move padding contrastive test to test_util, add t5 test

* fix: handle if model_kwargs["decoder_attention_mask"] is None

* refactor: improve padding input contrastive search generation tests

* chore: _ranking_fast to use LongTensor for cosine_matrix_mask

dc8b6eae

Add support for args to ProcessorMixin for backward compatibility (#33479) · c0c6815d

Yoni Gozlan authored 9 months ago

* add check and prepare args for BC to ProcessorMixin, improve ProcessorTesterMixin

* change size and crop_size in processor kwargs tests to do_rescale and rescale_factor

* remove unnecessary llava processor kwargs test overwrite

* nit

* change data_arg_name to input_name

* Remove unnecessary test override

* Remove unnecessary tests Paligemma

* Move test_prepare_and_validate_optional_call_args to TesterMixin, add docstring

c0c6815d

Fix missing test in `torch_job` (#33593) · 31caf0b9
Yih-Dar authored 9 months ago
```
fix missing tests

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31caf0b9
VLM generate: tests can't generate image/video tokens (#33623) · 2fdb5e74
Joao Gante authored 9 months ago

2fdb5e74

Add sdpa for BioGpt (#33592) · 653eb404

Omar Salman authored 9 months ago

* Add sdpa for BioGpt

* Updates

* Add the docs

* [run_slow] biogpt

* Use the copy mechanism to ensure consistency

* [run_slow] biogpt

653eb404

Remove unnecessary CPM model tests (#33621) · f9b44097
amyeroberts authored 9 months ago
```
Remove model tests
```
f9b44097
Generate: remove flakyness in `test_generate_from_inputs_embeds_decoder_only` (#33602) · 266d0a63
Joao Gante authored 9 months ago
```
almost zero is not zero
```
266d0a63

Update modeling_mamba2.py, fix pad size (#32599) · ec1424c6

Lake Lee authored 9 months ago

* Update modeling_mamba2.py

Fix pad_size calculation to ensure it's less than self.chunk_size

* [run_slow] mamba2

* [run-slow] mamba2

* [run-slow] Add @require_read_token decorator to failing tests for token propagation

* [run_slow] mamba2

ec1424c6

[tests] make more tests device-agnostic (#33580) · 8bd1f2f3

Fanli Lin authored 9 months ago

* enable

* fix

* add xpu skip

* add marker

* skip for xpu

* add more

* enable on accelerator

* add more cases

* add more tests

* add more

8bd1f2f3

Allow CI could be run on private forked repositories (e.g. new model additions) (#33594) · 31650a53
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
31650a53
Fix CircleCI nightly run (#33558) · 6dc36461
Yih-Dar authored 9 months ago
```
fix

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
6dc36461