Commits · 19fb1e22d2bdadf6611e029a6ae82606d1520c5f · zhusg / transformers-new

06 Mar, 2024 10 commits

added the max_matching_ngram_size to GenerationConfig (#29131) · 19fb1e22


* added the max_matching_ngram_size parameter into the GenerationConfig, for the PromptLookupCandidateGenerator

* switched back to keyword arguments

* added PromptLookupCandidateGenerator docstring for its parameters

* ruff reformat

* Update src/transformers/generation/configuration_utils.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

19fb1e22

Generate: torch.compile-ready generation config preparation (#29443) · ddb4fda3
Joao Gante authored 1 year ago

ddb4fda3
Fix test failure on DeepSpeed (#29444) · 9322576e
Zach Mueller authored 1 year ago
```
* Fix test failure

* use item
```
9322576e
Avoid dummy token in PLD to optimize performance (#29445) · 0a5b0516
Ofir Zafrir authored 1 year ago

0a5b0516
Generate: get generation mode from the generation config instance 🧼 (#29441) · 700d48fb
Joao Gante authored 1 year ago

700d48fb
Generate: add tests for caches with `pad_to_multiple_of` (#29462) · 41f7b7ae
Joao Gante authored 1 year ago

41f7b7ae

Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor (#29447) · 2890116a

Matthew Hoffman authored 1 year ago

* Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor

dataloader_prefetch_factor was added to TrainingArguments in #28498 with the default value None, but  versions of torch<2.0.0 do not accept None and will raise an error if num_workers == 0 and prefetch_factor != 2

* Add is_torch_available() check

* Use is_torch_greater_or_equal_than_2_0

add back check for dataloader_prefetch_factor

2890116a

[`docs`] Add starcoder2 docs (#29454) · b27aa206

Younes Belkada authored 1 year ago


* add accelerate docs

* Apply suggestions from code review

Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* Update starcoder2.md

* add correct generation

---------

Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

b27aa206

[`Docs` / `Awq`] Add docs on exllamav2 + AWQ (#29474) · 2a002d07
Younes Belkada authored 1 year ago
```
* add docs on exllamav2 + AWQ

* Update docs/source/en/quantization.md
```
2a002d07
[FIX] `offload_weight()` takes from 3 to 4 positional arguments but 5 were given (#29457) · 00bf4427
Fanli Lin authored 1 year ago
```
* use require_torch_gpu

* enable on XPU

* fix
```
00bf4427

05 Mar, 2024 16 commits

[i18n-KO] Translated generation_strategies.md to Korean (#29086) · 7b01579f

AI4Harmony authored 1 year ago


* Update ko _toctree.yml

* Create ko: generation_strategies.md

* Apply suggestions from code review

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* Apply suggestions from code review

Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

7b01579f

[i18n-zh] Translate add_new_pipeline.md into Chinese (#29432) · 638c423c
Michael authored 1 year ago
```
* [i18n-zh] Translate add_new_pipeline.md into Chinese

* apply suggestions from Fan-Lin
```
638c423c

Automatic safetensors conversion when lacking these files (#29390) · a69cbf4e

Lysandre Debut authored 1 year ago

* Automatic safetensors conversion when lacking these files

* Remove debug

* Thread name

* Typo

* Ensure that raises do not affect the main thread

a69cbf4e

Update pytest `import_path` location (#29154) · 9c5e5609

Logan Adams authored 1 year ago

* Update to pull function from proper lib

* Fix ruff formatting error

* Remove accidently added file

9c5e5609

Fix bug with passing capture_* args to neptune callback (#29041) · 8f3f8e67

AleksanderWWW authored 1 year ago

* Fix bug with passing capture_* args to neptune callback

* ruff happy?

* instantiate (frozen)set only once

* code review

* code review 2

* ruff happy?

* code review

8f3f8e67

[`Add Mamba`] Adds support for the `Mamba` models (#28094) · fb1c62e9

Arthur authored 1 year ago


* initial-commit

* start cleaning

* small nits

* small nits

* current updates

* add kernels

* small refactoring little step

* add comments

* styling

* nit

* nits

* Style

* Small changes

* Push dummy mambda simple slow

* nit

* Use original names

* Use original names and remove norm

* Updates for inference params

* Style nd updates

* nits

* Match logits

* Add a test

* Add expected generated text

* nits doc, imports and styling

* style

* oups

* dont install kernels, invite users to install the required kernels

* let use use the original packages

* styling

* nits

* fix some copieds

* update doc

* fix-copies

* styling done

* nits

* fix import check

* run but wrong cuda ress

* mamba CUDA works :)

* fix the fast path

* config naming nits

* conversion script is not required at this stage

* finish fixing the fast path: generation make sense now!

* nit

* Let's start working on the CIs

* style

* better style

* more nits

* test nit

* quick fix for now

* nits

* nit

* nit

* nit

* nits

* update test rest

* fixup

* update test

* nit

* some fixes

* nits

* update test values

* fix styling

* nit

* support peft

* integrations tests require torchg

* also add slow markers

* styling

* chose forward wisely

* nits

* update tests

* fix gradient checkpointing

* fixup

* nit

* fix doc

* check copies

* fix the docstring

* fix some more tests

* style

* fix beam search

* add init schene

* update

* nit

* fix

* fixup the doc

* fix the doc

* fixup

* tentative update but slow is no longer good

* nit

* should we always use float32?

* nits

* revert wrong changes

* res in float32

* cleanup

* skip fmt for now

* update generation values

* update test values running original model

* fixup

* update tests + rename inference_params to cache_params + make sure training does not use cache_params

* small nits

* more nits

* fix final CIs

* style

* nit doc

* I hope final doc nits

* nit

* 🫠

* final touch!

* fix torch import

* Apply suggestions from code review

Co-authored-by: Lysandre Debut <hi@lysand.re>

* Apply suggestions from code review

* fix fix and fix

* fix base model prefix!

* nit

* Update src/transformers/models/mamba/__init__.py

* Update docs/source/en/model_doc/mamba.md

Co-authored-by: Lysandre Debut <hi@lysand.re>

* nit

---------

Co-authored-by: Lysandre Debut <hi@lysand.re>

fb1c62e9

Generate: inner decoding methods are no longer public (#29437) · 87a0783d
Joao Gante authored 1 year ago

87a0783d
[`Udop imports`] Processor tests were not run. (#29456) · 4d892b72
Arthur authored 1 year ago
```
* fix udop imports

* sort imports
```
4d892b72
Revert-commit 0d52f9f5 (#29455) · 57d007b9
Arthur authored 1 year ago
```
* style

* revert with RP

* nit

* exact revert
```
57d007b9
more fix · 0d52f9f5
Arthur Zucker authored 1 year ago

0d52f9f5
[`UdopTokenizer`] Fix post merge imports (#29451) · 13285220
Arthur authored 1 year ago
```
* update

* ...

* nits

* arf

* 🧼

* beat the last guy

* style everyone
```
13285220

[tests] enable test_pipeline_accelerate_top_p on XPU (#29309) · fa7f3cf3

Fanli Lin authored 1 year ago


* use torch_device

* Update tests/pipelines/test_pipelines_text_generation.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix style

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

fa7f3cf3

[docs] Update starcoder2 paper link (#29418) · ebccb091
Joshua Lochner authored 1 year ago
```
Update starcoder2 paper link
```
ebccb091

Fix max length for BLIP generation (#29296) · bd891aed

Raushan Turganbay authored 1 year ago


* fix mal_length for blip

* update also min length

* fixes

* add a comment

* Update src/transformers/models/instructblip/modeling_instructblip.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/blip_2/modeling_blip_2.py

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* make fixup

* fix length when user passed

* remove else

* remove brackets

---------

Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

bd891aed

Exllama kernels support for AWQ models (#28634) · 4fc708f9

Ilyas Moutawwakil authored 1 year ago


* added exllama kernels support for awq models

* doc

* style

* Update src/transformers/modeling_utils.py

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* refactor

* moved exllama post init to after device dispatching

* bump autoawq version

* added exllama test

* style

* configurable exllama kernels

* copy exllama_config from gptq

* moved exllama version check to post init

* moved to quantization dockerfile

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

4fc708f9

FIX [`Generation`] Fix some issues when running the MaxLength criteria on CPU (#29317) · 81c8191b
Younes Belkada authored 1 year ago
```
fix the bitwise or issue
```
81c8191b

04 Mar, 2024 14 commits

[Docs] Spanish Translation -Torchscript md & Trainer md (#29310) · e9476832

njackman-2344 authored 1 year ago

* torchscript and trainer md es translation

* corrected md es files and even corrected spelling in en md

* made es corrections to trainer.md

* deleted entrenamiento... title on yml

* placed entrenamiento in right place

e9476832

Add UDOP (#22940) · 836921fd

NielsRogge authored 1 year ago


* First draft

* More improvements

* More improvements

* More fixes

* Fix copies

* More improvements

* More fixes

* More improvements

* Convert checkpoint

* More improvements, set up tests

* Fix more tests

* Add UdopModel

* More improvements

* Fix equivalence test

* More fixes

* Redesign model

* Extend conversion script

* Use real inputs for conversion script

* Add image processor

* Improve conversion script

* Add UdopTokenizer

* Add fast tokenizer

* Add converter

* Update README's

* Add processor

* Add fully fledged tokenizer

* Add fast tokenizer

* Use processor in conversion script

* Add tokenizer tests

* Fix one more test

* Fix more tests

* Fix tokenizer tests

* Enable fast tokenizer tests

* Fix more tests

* Fix additional_special_tokens of fast tokenizer

* Fix tokenizer tests

* Fix more tests

* Fix equivalence test

* Rename image to pixel_values

* Rename seg_data to bbox

* More renamings

* Remove vis_special_token

* More improvements

* Add docs

* Fix copied from

* Update slow tokenizer

* Update fast tokenizer design

* Make text input optional

* Add first draft of processor tests

* Fix more processor tests

* Fix decoder_start_token_id

* Fix test_initialization

* Add integration test

* More improvements

* Improve processor, add test

* Add more copied from

* Add more copied from

* Add more copied from

* Add more copied from

* Remove print statement

* Update README and auto mapping

* Delete files

* Delete another file

* Remove code

* Fix test

* Fix docs

* Remove asserts

* Add doc tests

* Include UDOP in exotic model tests

* Add expected tesseract decodings

* Add sentencepiece

* Use same design as T5

* Add UdopEncoderModel

* Add UdopEncoderModel to tests

* More fixes

* Fix fast tokenizer

* Fix one more test

* Remove parallelisable attribute

* Fix copies

* Remove legacy file

* Copy from T5Tokenizer

* Fix rebase

* More fixes, copy from T5

* More fixes

* Fix init

* Use ArthurZ/udop for tests

* Make all model tests pass

* Remove UdopForConditionalGeneration from auto mapping

* Fix more tests

* fixups

* more fixups

* fix the tokenizers

* remove un-necessary changes

* nits

* nits

* replace truncate_sequences_boxes with truncate_sequences for fix-copies

* nit current path

* add a test for input ids

* ids that we should get taken from c9f7a32f57440d90ff79890270d376a1cc0acb68

* nits converting

* nits

* apply ruff

* nits

* nits

* style

* fix slow order of addition

* fix udop fast range as well

* fixup

* nits

* Add docstrings

* Fix gradient checkpointing

* Update code examples

* Skip tests

* Update integration test

* Address comment

* Make fixup

* Remove extra ids from tokenizer

* Skip test

* Apply suggestions from code review

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update year

* Address comment

* Address more comments

* Address comments

* Add copied from

* Update CI

* Rename script

* Update model id

* Add AddedToken, skip tests

* Update CI

* Fix doc tests

* Do not use Tesseract for the doc tests

* Remove kwargs

* Add original inputs

* Update casting

* Fix doc test

* Update question

* Update question

* Use LayoutLMv3ImageProcessor

* Update organization

* Improve docs

* Update forward signature

* Make images optional

* Remove deprecated device argument

* Add comment, add add_prefix_space

* More improvements

* Remove kwargs

---------

Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

836921fd

DeformableDETR support bfloat16 (#29232) · ed74d978

Donggeun Yu authored 1 year ago


* Update ms_deform_attn_cuda.cu

* Update ms_deform_attn_cuda.cuh

* Update modeling_deformable_detr.py

* Update src/transformers/models/deformable_detr/modeling_deformable_detr.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update modeling_deformable_detr.py

* python utils/check_copies.py --fix_and_overwrite

* Fix dtype missmatch error

* Update test_modeling_deformable_detr.py

* Update test_modeling_deformable_detr.py

* Update modeling_deformable_detr.py

* Update modeling_deformable_detr.py

* Support DeformableDETR with bfloat16

* Add test code

* Use AT_DISPATCH_FLOATING_TYPES_AND2

Use AT_DISPATCH_FLOATING_TYPES_AND2

* Update tests/models/deformable_detr/test_modeling_deformable_detr.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/models/deformable_detr/test_modeling_deformable_detr.py

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix not found require_torch_bf16 function

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ed74d978

Avoid edge case in audio utils (#28836) · bcd23a54
Yoach Lacombe authored 1 year ago

bcd23a54

Fix grad_norm unserializable tensor log failure (#29212) · 7941769e

Sven Schultze authored 1 year ago

* Fix grad_norm unserializable tensor log failure

* Fix origin of grad_norm logs to be in deepspeed get_global_grad_norm()

7941769e

Fully revert atomic checkpointing (#29370) · 1681a6d4
Zach Mueller authored 1 year ago
```
Fully revert atomic checkpointing
```
1681a6d4

Fix OneFormer `post_process_instance_segmentation` for panoptic tasks (#29304) · 8ef98628

Nick DeGroot authored 1 year ago

*  Fix oneformer instance post processing when using panoptic task type

* 

 Add unit test for oneformer instance post processing panoptic bug

---------

Co-authored-by: Nick DeGroot <1966472+nickthegroot@users.noreply.github.com>

8ef98628

Fix: Fixed the previous tracking URI setting logic to prevent clashes with... · 81220cba

Sean (Seok-Won) Yi authored 1 year ago

Fix: Fixed the previous tracking URI setting logic to prevent clashes with original MLflow code. (#29096)

* Changed logic for setting the tracking URI.

The previous code was calling the `mlflow.set_tracking_uri` function
regardless of whether or not the environment variable
`MLFLOW_TRACKING_URI` is even set. This led to clashes with the original
MLflow implementation and therefore the logic was changed to only
calling the function when the environment variable is explicitly set.

* Check if tracking URI has already been set.

The previous code did not consider the possibility that the tracking URI
may already be set elsewhere and was therefore (erroneously) overriding
previously set tracking URIs using the environment variable.

* Removed redundant parentheses.

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix docstring to reflect library convention properly.

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Fix docstring to reflect library convention properly.

"Unset by default" is the correct expression rather than "Default to `None`."

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------

Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

81220cba

Convert SlimSAM checkpoints (#28379) · 5e4b69dc

NielsRogge authored 1 year ago


* First commit

* Improve conversion script

* Convert more checkpoints

* Update src/transformers/models/sam/convert_sam_original_to_hf_format.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Rename file

* More updates

* Update docstring

* Update script

---------

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

5e4b69dc

Workaround for #27758 to avoid ZeroDivisionError (#28756) · c38a1227
Traun Leyden authored 1 year ago

c38a1227

Add mlx support to BatchEncoding.convert_to_tensors (#29406) · 704b3f74

Y4hL authored 1 year ago

* Add mlx support

* Fix import order and use def instead of lambda

* Another fix for ruff format :)

* Add detecting mlx from repr, add is_mlx_array

704b3f74

[Mixtral] Fixes attention masking in the loss (#29363) · 39ef3fb2
Siming Dai authored 1 year ago
```
Fix mixtral load balancing loss

Co-authored-by: dingkunbo <dingkunbo@baidu.com>
```
39ef3fb2

update path to hub files in the error message (#29369) · 38953a75

Poedator authored 1 year ago

update path to hub files

need to add `tree/` to path to files at HF hub.
see example path:
`https://huggingface.co/meta-llama/Llama-2-7b-hf/tree/main`

38953a75

[tests] enable automatic speech recognition pipeline tests on XPU (#29308) · aade711d
Fanli Lin authored 1 year ago
```
* use require_torch_gpu

* enable on XPU
```
aade711d