Skip to content
GitLab
Projects
Groups
Topics
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
zhusg
transformers-new
Commits
75bbfd5b
Unverified
Commit
75bbfd5b
authored
1 year ago
by
Joao Gante
Committed by
GitHub
1 year ago
Browse files
Options
Download
Patches
Plain Diff
Cache: Static cache as a standalone object (#30476)
parent
0ae789e0
Changes
20
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
docs/source/en/internal/generation_utils.md
+1
-0
docs/source/en/internal/generation_utils.md
docs/source/en/llm_optims.md
+38
-27
docs/source/en/llm_optims.md
src/transformers/cache_utils.py
+41
-43
src/transformers/cache_utils.py
src/transformers/generation/utils.py
+40
-20
src/transformers/generation/utils.py
src/transformers/models/cohere/modeling_cohere.py
+24
-47
src/transformers/models/cohere/modeling_cohere.py
src/transformers/models/dbrx/modeling_dbrx.py
+75
-82
src/transformers/models/dbrx/modeling_dbrx.py
src/transformers/models/gemma/modeling_gemma.py
+31
-47
src/transformers/models/gemma/modeling_gemma.py
src/transformers/models/jamba/modeling_jamba.py
+2
-2
src/transformers/models/jamba/modeling_jamba.py
src/transformers/models/llama/modeling_llama.py
+33
-55
src/transformers/models/llama/modeling_llama.py
src/transformers/models/mistral/modeling_mistral.py
+1
-1
src/transformers/models/mistral/modeling_mistral.py
src/transformers/models/mixtral/modeling_mixtral.py
+1
-1
src/transformers/models/mixtral/modeling_mixtral.py
src/transformers/models/olmo/modeling_olmo.py
+24
-52
src/transformers/models/olmo/modeling_olmo.py
src/transformers/models/persimmon/modeling_persimmon.py
+1
-1
src/transformers/models/persimmon/modeling_persimmon.py
src/transformers/models/phi/modeling_phi.py
+1
-1
src/transformers/models/phi/modeling_phi.py
src/transformers/models/phi3/modeling_phi3.py
+1
-1
src/transformers/models/phi3/modeling_phi3.py
src/transformers/models/qwen2_moe/modeling_qwen2_moe.py
+1
-1
src/transformers/models/qwen2_moe/modeling_qwen2_moe.py
src/transformers/models/stablelm/modeling_stablelm.py
+1
-1
src/transformers/models/stablelm/modeling_stablelm.py
src/transformers/models/starcoder2/modeling_starcoder2.py
+1
-1
src/transformers/models/starcoder2/modeling_starcoder2.py
tests/models/llama/test_modeling_llama.py
+40
-40
tests/models/llama/test_modeling_llama.py
tests/quantization/aqlm_integration/test_aqlm.py
+24
-5
tests/quantization/aqlm_integration/test_aqlm.py
with
381 additions
and
428 deletions
+381
-428
Write
Preview
Supports
Markdown
0%
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment