[bugfix] fix qwen3.5 gpt_bridge lora by Jintao-Huang · Pull Request #28 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-14T06:18:18Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the weight loading and exporting logic in gpt_bridge.py to support interleaved layouts for linear attention components, specifically affecting QKV, Z, B, and A projections. The changes include refactoring dimension calculations and updating tensor reshaping and splitting logic. Feedback identifies a critical issue where FP8 scale factors (in_scale_inv) are not interleaved, which will lead to incorrect scaling in Megatron-Core. Additionally, the reviewer suggests adding safeguards against ZeroDivisionError when num_key_heads is zero and handling edge cases in tensor slicing when a_dim is zero to prevent incorrect weight exports.

Jintao-Huang · 2026-04-14T06:48:36Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the weight conversion logic in gpt_bridge.py to support an interleaved layout for linear attention, specifically adjusting how key and value dimensions are handled across multiple heads for both standard and LoRA weights. The review identified a critical NameError where in_scale_inv is referenced at line 1271 despite its definition being removed in the same patch. Additionally, the reviewer noted that scale_inv has become an unused variable following the removal of FP8 scale export logic and suggested using an underscore to ignore the return value.

Jintao-Huang · 2026-04-14T07:04:21Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the weight loading and export logic in _set_linear_attn_state by adjusting the calculation of key_dim and value_dim and updating tensor splitting and reshaping operations to account for the number of key heads. However, the changes remove the handling of scale inversion factors (hf_scale_inv and scale_inv), which are essential for maintaining FP8 quantization support. It is recommended to restore this logic to ensure compatibility with quantized models during both weight loading and export.

fix qwen3.5 gpt_bridge lora

933aaf0

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Jintao-Huang added 2 commits April 14, 2026 14:29

fix

b45c0b2

update

8df23b7

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

fix

6fb2fb7

tastelikefeet approved these changes Apr 14, 2026

View reviewed changes

fix

2585124

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Jintao-Huang merged commit b95b5df into modelscope:main Apr 14, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix qwen3.5 gpt_bridge lora#28

[bugfix] fix qwen3.5 gpt_bridge lora#28
Jintao-Huang merged 5 commits intomodelscope:mainfrom
Jintao-Huang:fix_qwen3_5_gpt_bridge

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants