Compatible with transformers 5.0 at TurboMind side by lvhan028 · Pull Request #4304 · InternLM/lmdeploy

lvhan028 · 2026-01-28T10:02:53Z

No description provided.

Copilot

Pull request overview

This PR aims to make lmdeploy compatible with transformers 5.0 at the TurboMind side. Transformers 5.0 introduced a breaking change where rope-related parameters (including rope_theta and rope_scaling) are now aggregated into a single rope_parameters dictionary in the model configuration.

Changes:

Removed unused utility functions from lmdeploy/vl/model/utils.py including weight loading helpers and import hacks
Added support for the new rope_parameters structure in transformers 5.0 for LlamaModel while maintaining backward compatibility with older versions
Added similar rope_parameters support for DeepSeek2Model

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
lmdeploy/vl/model/utils.py	Cleaned up by removing unused functions: `load_weight_ckpt`, `get_used_weight_files`, `load_model_from_weight_files`, `add_sys_path`, and `hack_import_with`
lmdeploy/turbomind/deploy/source_model/llama.py	Added conditional logic to check for `rope_parameters` (transformers 5.0) and extract `rope_theta` and `rope_scaling` from it, with fallback to the old structure for backward compatibility
lmdeploy/turbomind/deploy/source_model/deepseek2.py	Added similar conditional logic to handle `rope_parameters` for DeepSeek2 models

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-28T10:07:47Z

lmdeploy/turbomind/deploy/source_model/llama.py

+        if 'rope_parameters' in model_arg:
+            # transformers v5.0.0 aggregates all rope-related parameters into 'rope_parameters'
+            rope_scaling = model_arg['rope_parameters']
+            rope_theta = rope_scaling.get('rope_theta', 10000.0)


The rope_theta value retrieved from rope_parameters is not explicitly converted to float, unlike line 140 which uses float(model_arg.get('rope_theta', 10000.0)). For consistency and to ensure type safety, this should be: rope_theta = float(rope_scaling.get('rope_theta', 10000.0)).

Suggested change

rope_theta = rope_scaling.get('rope_theta', 10000.0)

rope_theta = float(rope_scaling.get('rope_theta', 10000.0))

RunningLeon

v5 broke for vl model

lmdeploy/lmdeploy/vl/model/utils.py", line 12, in <module>
from transformers.utils import (SAFE_WEIGHTS_INDEX_NAME, SAFE_WEIGHTS_NAME, WEIGHTS_INDEX_NAME, WEIGHTS_NAME,
ImportError: cannot import name 'is_safetensors_available' from 'transformers.utils' (/nvme1/xxx/miniconda3/envs/py310/lib/python3.10/site-packages/transformers/utils/__init__.py)

Compatible with transformers 5.0

b855b0c

Copilot AI review requested due to automatic review settings January 28, 2026 10:02

lvhan028 added the improvement label Jan 28, 2026

Copilot started reviewing on behalf of lvhan028 January 28, 2026 10:03 View session

lvhan028 requested review from Copilot and lzhangzz and removed request for Copilot January 28, 2026 10:03

Copilot started reviewing on behalf of lvhan028 January 28, 2026 10:04 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

lvhan028 requested a review from zhulinJulia24 January 28, 2026 10:15

RunningLeon reviewed Jan 30, 2026

View reviewed changes

This was referenced Feb 4, 2026

[Bug] Broken with very long prompt #4310

Closed

[Bug] gpt-oss2 v100 unexcepted token #4307

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatible with transformers 5.0 at TurboMind side#4304

Compatible with transformers 5.0 at TurboMind side#4304
lvhan028 wants to merge 1 commit intoInternLM:mainfrom
lvhan028:compatible-transformers-5.0

lvhan028 commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

RunningLeon left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	rope_theta = rope_scaling.get('rope_theta', 10000.0)
	rope_theta = float(rope_scaling.get('rope_theta', 10000.0))

Conversation

lvhan028 commented Jan 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

RunningLeon left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants