Skip to content
GitLab
Projects
Groups
Topics
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
educg-net-43734-3136859
project59内存受限环境的大语言模型推理优化-3549
Repository
Branches
Overview
Active
Stale
All
FlexInfer-v4.0
aa297ab6
·
doc: add sophomore grade to team members on cover page
·
Jun 18, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ahkvm
917bad1f
·
complete ahkvm-v1.0
·
Jun 15, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
sharegpt-cgroup-latency-bench
c9d73d06
·
scripts: add native macOS ShareGPT benchmark runner
·
Jun 11, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
main
default
protected
0c1bb0b8
·
docs: add Qwen3 memory optimization presentation
·
Jun 11, 2026
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lazy-kvcache
62fee6c2
·
add source code list
·
Jun 08, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v3.0
merged
501ce8f4
·
Asynchronous Prefetching with layer streaming
·
Jun 07, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v2.0
merged
b8d2a753
·
do prefetch works
·
Jun 05, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
zbm-flexinfer-experiments
dc3e883b
·
Validate runtime prefetch eviction under memory limits
·
May 28, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v1.0
2b3789ad
·
new feature: based on paper FlexInfer
·
May 21, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
measure
merged
7c3169af
·
add a python measure script
·
May 17, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar