Skip to content
GitLab
Projects
Groups
Topics
Snippets
/
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
内存不够脑子凑
project59内存受限环境的大语言模型推理优化
Repository
Branches
Overview
Active
Stale
All
main
default
protected
287126a7
·
docs: 修正第九节仓库目录结构,对齐实际core-code目录
·
Jun 30, 2026
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
ahkvm
merged
940bfb16
·
do reduction of codes
·
Jun 27, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v4.0
aa297ab6
·
doc: add sophomore grade to team members on cover page
·
Jun 18, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
sharegpt-cgroup-latency-bench
c9d73d06
·
scripts: add native macOS ShareGPT benchmark runner
·
Jun 11, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
lazy-kvcache
62fee6c2
·
add source code list
·
Jun 08, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v3.0
merged
501ce8f4
·
Asynchronous Prefetching with layer streaming
·
Jun 07, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v2.0
merged
b8d2a753
·
do prefetch works
·
Jun 05, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
zbm-flexinfer-experiments
dc3e883b
·
Validate runtime prefetch eviction under memory limits
·
May 28, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
FlexInfer-v1.0
2b3789ad
·
new feature: based on paper FlexInfer
·
May 21, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar
measure
merged
7c3169af
·
add a python measure script
·
May 17, 2026
Compare
Select Archive Format
Download source code
zip
tar.gz
tar.bz2
tar