InternLMLearning

全链条开放体系 | 部署

大模型特点
- 内存开销巨大：本身参数量大、采用自回归生成token，需要缓存k/v
- 动态shape：单个token * length，length不定，请求数不固定
- 模型结构是Decoder only ![
技术挑战
- 设备：低存储设备的部署（消费级显卡、移动端）如何部署？
- 推理：加速token的生成熟读、动态shape的推理、如何有效管理和利用内存
- 服务：提高整体吞吐量，降低请求的平均响应时间
部署方案
- 模型并行，低比特量化，Attention优化，计算和访存优化，Continuous Batching

...（完整笔记见下文）

InternLM Lesson-1

https://g4w90egc30.feishu.cn/docx/IJLxd5TnyoQTwkxZ1tJcLajAnQe?from=from_copylink

InternLM lesson-2 web_demo部署 + Agent Prompt与运行逻辑研读

https://g4w90egc30.feishu.cn/docx/KIw4ddOrQoIeWCxJMjbciXb8nbf?from=from_copylink

Lagent源码阅读 ReAct、Rewoo、AutoGPT

https://g4w90egc30.feishu.cn/docx/QhDLdN5BUou8sAxXJ2rccOotntd?from=from_copylink

InternLM Lesson-3 简单RAG

https://g4w90egc30.feishu.cn/docx/LbIydMCbYoF7YsxI8egcAIznnwe?from=from_copylink

InternLM Lesson-4

https://g4w90egc30.feishu.cn/docx/K8qodGQ27ojqWrxJxIycyin6nRg?from=from_copylink

InternLM Lesson-5 量化部署

https://g4w90egc30.feishu.cn/docx/S7lPdG6F5ojQcxxFhQ1cYSC6nnh?from=from_copylink

InternLM Lesson-6 Opencompass评测

https://g4w90egc30.feishu.cn/docx/ZOJZdv1duotLW4x0Nv8cWpyVnEb?from=from_copylink

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
InternLM Lesson-1		InternLM Lesson-1
InternLM Lesson-4		InternLM Lesson-4
InternLM Lesson-5		InternLM Lesson-5
InternLM Lesson-6		InternLM Lesson-6
InternLM lesson-2笔记+作业+源码阅读		InternLM lesson-2笔记+作业+源码阅读
InternLM lesson-3		InternLM lesson-3
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InternLMLearning

全链条开放体系 | 部署

InternLM Lesson-1

InternLM lesson-2 web_demo部署 + Agent Prompt与运行逻辑研读

Lagent源码阅读 ReAct、Rewoo、AutoGPT

InternLM Lesson-3 简单RAG

InternLM Lesson-4

InternLM Lesson-5 量化部署

InternLM Lesson-6 Opencompass评测

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

InternLMLearning

全链条开放体系 | 部署

InternLM Lesson-1

InternLM lesson-2 web_demo部署 + Agent Prompt与运行逻辑研读

Lagent源码阅读 ReAct、Rewoo、AutoGPT

InternLM Lesson-3 简单RAG

InternLM Lesson-4

InternLM Lesson-5 量化部署

InternLM Lesson-6 Opencompass评测

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages