InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 654
Star 7.6k

Code
Issues 513
Pull requests 53
Discussions
Actions
Projects
Security 1
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

53 Open 2,017 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

ci(lint): skip flaky deadlink test for python wiki page

#4357 opened Feb 13, 2026 by windreamer

Loading…

build(docker): skip FA2 when use cu13

#4356 opened Feb 12, 2026 by windreamer

Loading…

support glm5

#4355 opened Feb 12, 2026 by grimoire

Loading…

Improve proxy server improvement

#4354 opened Feb 12, 2026 by lvhan028

Loading…

[WIP] Qwen3.5

#4351 opened Feb 11, 2026 by grimoire • Draft

bump version to v0.12.1

#4350 opened Feb 11, 2026 by lvhan028

Loading…

fix: add clear_grammar to remove grammar from reused model_request

#4349 opened Feb 11, 2026 by windreamer

Loading…

[WIP]: support glm4.7 with mtp WIP

#4346 opened Feb 10, 2026 by RunningLeon • Draft

Support MiniMax-M2 in TurboMind engine

#4343 opened Feb 10, 2026 by zh-nj

Loading…

Fix authorization Bug:P1

#4338 opened Feb 9, 2026 by lvhan028

Loading…

[WIP]Support torch compile

#4336 opened Feb 8, 2026 by grimoire • Draft

add preliminary support for EP(single-node) of turbomind backend

#4332 opened Feb 6, 2026 by irexyc

Loading…

Qwen/Internlm/Llama Dense/Moe model fp8 quant online enhancement

New feature or request

#4324 opened Feb 5, 2026 by 43758726

Loading…

Compatible with transformers 5.0 at TurboMind side improvement

#4304 opened Jan 28, 2026 by lvhan028

Loading…

change ascend paged attention from BSH format to TND format for better performace

#4295 opened Jan 27, 2026 by jinminxi104 • Draft

return BadRequest for all invlid inputs Bug:P2

#4291 opened Jan 26, 2026 by lvhan028

Loading…

support repetition ngram logits processor enhancement

New feature or request

#4288 opened Jan 23, 2026 by grimoire

Loading…

fix dllm mask on set_step

#4278 opened Jan 18, 2026 by grimoire

Loading…

[ascend] fix awq and smoothq

#4277 opened Jan 16, 2026 by wanfengcxz • Draft

test: add mixing guided and non-guided tests

#4267 opened Jan 12, 2026 by windreamer

Loading…

Update benchmark serving script for proxy_server

#4173 opened Dec 1, 2025 by lvhan028

Loading…

[WIP]: Support prefix caching with routed experts

#4171 opened Nov 28, 2025 by RunningLeon • Draft

Support fp32 head for qwen and internlm models improvement

#4160 opened Nov 27, 2025 by RunningLeon

Loading…

fix: fix lora weight loading for internvl

#4106 opened Nov 6, 2025 by windreamer • Draft

Update installation.md

#4095 opened Nov 3, 2025 by krescent

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!