Commits · test-seentok · 某某某 / transformers-new

21 Nov, 2024 4 commits
- nits · d5756709
  Arthur Zucker authored 7 months ago
  
  d5756709
- fix · b96810d8
  Arthur Zucker authored 7 months ago
  
  b96810d8
- utiles? · 0eb34435
  Arthur Zucker authored 7 months ago
  
  0eb34435
- test · a8093c65
  Arthur Zucker authored 7 months ago
  
  a8093c65
18 Nov, 2024 15 commits

Yih-Dar authored 7 months ago

* Revert "Revert "Fix Whisper CI" (#34605)"

This reverts commit 74d3824c

.

* update

---------

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

eed11f34

Allow handling files as args for a tool created with Tool.from_space (#34687) · 759a378e
Aymeric Roucher authored 7 months ago
```
* Allow handling files as args for a tool created with `Tool.from_space`
```
759a378e

Simplify Tensor Parallel implementation with PyTorch TP (#34184) · 20142ab5

Ke Wen authored 7 months ago

* Simplify Tensor Parallel implementation with PyTorch TP

* Move tp_plan to config

* Lint

* Format and warning

* Disable copy-from check

* Conditionally get attr from config

* make fix-copies

* Move base_model_tp_plan to PretrainedConfig

* Move TP into from_pretrained

* Add device context for load

* Do not serialize

* Move _tp_plan setting to post_init

* Add has_tp_plan

* Add test_tp

* Add 'Multi-gpu inference' doc

* Add backward support for device type identification

* Auto-detect accelerator

* supports_tp_plan

* copyright year

* Fix copy

20142ab5

fix: Wrong task mentioned in docs (#34757) · 7df93d6f
ecyht2 authored 7 months ago

7df93d6f
Fix callback key name (#34762) · 7693b622
Hun-soo Jung authored 7 months ago
```
Fixes typo.
```
7693b622
fix: Update pixel_values parameter in hf_model input (#34782) · 1ef6c5f1
Eon Kim authored 7 months ago

1ef6c5f1
[tests] add XPU part to testing (#34778) · e80a65ba
Fanli Lin authored 7 months ago
```
add XPU part to testing

Signed-off-by: Lin, Fanli <fanli.lin@intel.com>
```
e80a65ba
[docs] add XPU besides CUDA, MPS etc. (#34777) · 9568a9df
Fanli Lin authored 7 months ago
```
add XPU
```
9568a9df
[docs] make `empty_cache` device-agnostic (#34774) · 8568bf1b
Fanli Lin authored 7 months ago
```
make device-agnostic
```
8568bf1b
make sure to disable gradients for integer tensor (#32943) · 36759f33
Wing Lian authored 7 months ago

36759f33

Fix skip of test_training_gradient_checkpointing (#34723) · 1c471fc3

Dmitry Rogozhkin authored 7 months ago

19d58d31 has introduced a context manager to manage subtests of
test_training_gradient_checkpointing. However, test body was not
moved under "with" statement. Thus, while tests are correctly
marked as skipped, test bodies were still executed. In some cases,
as with llama this caused attribute errors.

Fixes: #34722
Fixes: 19d58d31

 ("Add MLLama (#33703)")

Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>

1c471fc3

fix a typo bug where 'id2label' was incorrectly written as 'i2label' when reading config (#34637) · c772d4d9
ZuoChen_BUPT authored 7 months ago
```
fix a bug where 'id2label' was incorrectly written as 'i2label' when reading the config from pretrained config
```
c772d4d9
Fix broken link (#34618) · eb0ab3ed
Ofek Lev authored 7 months ago

eb0ab3ed
VLMs: `patch_size` -> `num_image_tokens` in processing (#33424) · 1646ffb4
Raushan Turganbay authored 7 months ago
```
* use num additional tokens

* fix copies + docs

* another fix copies :)

* add docs

* move order for BC
```
1646ffb4

Add OLMo November 2024 (#34551) · 3ee24e22

Shane A authored 7 months ago

* Add model skeletion with transformers-cli add-new-model-like

* Convert config to modular, add rms_norm_eps, delete clip_qkv

* Convert model to modular, add RMSNorm

* Add flash attention with qk norm and no qkv clipping

* Add decoder layer with RMSNorm after attention/feedforward layers

* Add base and causal model

* Add converter improvements from OLMo repo

* Update weight loading in OLMo to HF converter

* Set correct default for rms_norm_eps

* Set correct pipeline_model_mapping in test

* Run make fixup

* Fix model type

* Re-run modular conversion

* Manually set config docs to fix build errors

* Convert olmo-1124 to olmo_1124 to fix flash attention docs errors

* Start updating tests

* Update tests

* Copy upstream test_eager_matches_sdpa_inference_1_bfloat16 changes to olmo_1124

* Rename input_layernorm and post_attention_layernorm to reflect their ops better

* Use correct tokenizer

* Remove test unsupported by GPT2 tokenizer

* Create GenerationConfig outside of from_pretrained call

* Use simpler init file structure

* Add explicit __all__ to support simplified init

* Make safetensor serialization the default

* Update OLMo November 2024 docs

3ee24e22

15 Nov, 2024 7 commits

🧼 remove v4.44 deprecations (#34245) · 13493215

Joao Gante authored 7 months ago


* remove v4.44 deprecations

* PR comments

* deprecations scheduled for v4.50

* hub version update

* make fiuxp

---------

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

13493215

Remove FSDP wrapping from sub-models. (#34452) · 8d50fda6

AbdelKarim ELJANDOUBI authored 7 months ago

* Remove FSDP wrapping from sub-models.

* solve conflict trainer.py

* make fixup

* add unit test for fsdp_auto_wrap_policy when using auto_find_batch_size

* put back extract_model_from_parallel

* use transformers unwrap_model

8d50fda6

FSDP grad accum fix (#34645) · b0c0ba7b

Wing Lian authored 7 months ago

* add gradient accumulation steps tests for fsdp

* invert no_sync context to fix training for fsdp

b0c0ba7b

add xpu path for awq (#34712) · 52ea4aa5
jiqing-feng authored 7 months ago
```
* add xpu path for awq

* update readme
```
52ea4aa5
fix(wandb): pass fake dataset to avoid exception in trainer (see #34455) (#34720) · 7b3d615b
CezaPasc authored 7 months ago

7b3d615b
Update llava.md (#34749) · f5dbfab7
Lysandre Debut authored 7 months ago
```
LLava -> Llava
```
f5dbfab7

Retain newlines in chat template when `continue_final_message=True` (#34253) · 8ba3e150

lewtun authored 7 months ago


* Retain newlines in chat template when

* Add try/except

* Add regression test

* Simplify test

* Apply suggestions from code review

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------

Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

8ba3e150

13 Nov, 2024 4 commits

[docs] add xpu device check (#34684) · a3d69a89

Fanli Lin authored 7 months ago


* add XPU path

* use accelerate API

* Update docs/source/en/tasks/semantic_segmentation.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update more places with accelerate API

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a3d69a89

Fix example in EsmConfig docstring (#34653) · 68f8186a
Xiao Yuan authored 7 months ago

68f8186a
[docs] Broken link in generation_strategies (#34717) · e7c36a9d
Pedro Cuenca authored 7 months ago
```
[docs] Broken link
```
e7c36a9d
🌐 [i18n-KO] Translated marian.md to Korean (#34698) · be8748a5
MaCAT authored 7 months ago
```
* initial translation

* removed english

* Fixed Trivial Typos, updated _toctree.yml
```
be8748a5

11 Nov, 2024 3 commits

Agents: Small fixes in streaming to gradio + add tests (#34549) · 33eef992
Aymeric Roucher authored 7 months ago
```
* Better support transformers.agents in gradio: small fixes and additional tests
```
33eef992

[i18n-ar] Translated file : `docs/source/ar/torchscript.md` into Arabic (#33079) · 6de2a4d1

Ahmed Almaghz authored 7 months ago


* Add docs/source/ar/torchscript.md to Add_docs_source_ar_torchscript.md

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/torchscript.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Merge troubleshooting.md with this Branch

* Update _toctree.yml

* Update torchscript.md

* Update troubleshooting.md

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

6de2a4d1

[docs] update not-working model revision (#34682) · 25f510a9
Fanli Lin authored 7 months ago
```
update revision
```
25f510a9

10 Nov, 2024 2 commits

Agents: turn any Space into a Tool with `Tool.from_space()` (#34561) · 3ea3ab62
Aymeric Roucher authored 7 months ago
```
* Agents: you can now load a Space as a tool
```
3ea3ab62

Update llm_engine.py (#33332) · 134ba90d

Louis Brulé Naudet authored 7 months ago

* Update llm_engine.py
- Added support for optional token and max_tokens parameters in the constructor.
- Provided usage examples and detailed documentation for each method.

134ba90d

09 Nov, 2024 1 commit

[i18n-ar] Translated file : `docs/source/ar/trainer.md` into Arabic (#33080) · 768f3c01

Ahmed Almaghz authored 7 months ago


* Add docs/source/ar/trainer.md to Add_docs_source_ar_trainer.md

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>

* Update trainer.md

* Update trainer.md

* Update trainer.md

* Create _toctree.yml

* Delete docs/source/ar/_toctree.yml

* Update _toctree.yml - add trainer

* Update _toctree.yml

* merge serialization.md into this branch

* merge sagemaker.md into this PR

* Update _toctree.yml

* Update docs/source/ar/trainer.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ar/trainer.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Co-authored-by: Abdullah Mohammed <554032+abodacs@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

768f3c01

08 Nov, 2024 1 commit

🌐

[i18n-KO] Translated bert.md to Korean (#34627) · a06a0d12

MaCAT authored 7 months ago


* Translated bert.md, Need additional check

* Translation 2nd ver, changed _toctree.yml

* Fixed Typo

* Update bert.md

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>

* Update bert.md

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>

* Update bert.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update bert.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------

Co-authored-by: YONGSANG <71686691+4N3MONE@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a06a0d12

07 Nov, 2024 2 commits
- 🌐 [i18n-KO] Translated `timesformer.md` to Korean (#33972) · 1cf17077
  Jiwook Han authored 7 months ago
```
* docs: ko: model_doc/timesformer.md

* feat: nmt draft

* fix: manual edits

* fix_toctree

* fix toctree on Video Models
```
  1cf17077
- fix(dvclive): pass fake dataset to avoid exception in trainer init (#34455) · 6938524a
  Ivan Shcheklein authored 7 months ago
```
fix(dvclive): pass fake dataset to avoid exception in trainer
```
  6938524a
05 Nov, 2024 1 commit
- 🌐 [i18n-KO] Translated `convbert.md` to Korean (#34599) · 7bbc6247
  Ahnjj_DEV authored 7 months ago
```
* docs: ko: convbert.md

* Update _toctree.yml

* feat: nmt draft
```
  7bbc6247