support Qwen3.5 FP8 by Jintao-Huang · Pull Request #30 · modelscope/mcore-bridge

Jintao-Huang · 2026-04-14T08:58:55Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the GatedDeltaNet module to support the Megatron-Core modular API by adding an explicit init method and importing GatedDeltaNetSubmodules. Feedback suggests addressing potential backward compatibility issues with the new import and handling runtime errors that occur if the base class falls back to object.

Jintao-Huang · 2026-04-14T12:04:38Z

/gemini review

gemini-code-assist

Code Review

This pull request refactors the GatedDeltaNet implementation by splitting the input projection into two separate modules, in_proj_qkvz and in_proj_ba, and integrates Transformer Engine components like TEColumnParallelLinear and TENorm. It also introduces support for FP8 scale inversion during weight conversion and updates the transformers dependency range. The review feedback identifies several critical issues in the weight conversion logic within gpt_bridge.py, including an AttributeError caused by accessing a deleted attribute, potential KeyErrors from incorrect HuggingFace state dict keys, and a logic error where scale inversion values were overwriting weight tensors instead of being assigned to the correct scale_inv keys.

…' into support_qwen3_5_fp8

Jintao-Huang · 2026-04-15T15:47:11Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for decoupled input projections within the GatedDeltaNet module, enabling separate linear layers for QKVZ and BA components. The changes include updates to the model configuration, state dictionary conversion logic in the bridge to handle both standard and LoRA weights, and specific layer specifications for Qwen 3.5 GDN. Critical feedback identifies a potential KeyError in the bridge due to redundant prefix handling and a NameError in the GatedDeltaNet forward pass where a variable is accessed outside its conditional definition scope.

Jintao-Huang · 2026-04-15T16:28:25Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for decoupled input projections in the GatedDeltaNet architecture, specifically for Qwen 3.5 models, including configuration updates and weight conversion logic for LoRA and FP8. Review feedback highlights a potential ImportError due to a top-level import of the optional transformer_engine library and a possible regression in word_embeddings export logic that may affect various models. Additionally, suggestions were provided to replace hardcoded CUDA device references with portable device selection logic to support non-GPU environments.

update

3610538

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/mcore_bridge/model/modules/gated_delta_net.py

Comment thread src/mcore_bridge/model/modules/gated_delta_net.py Outdated

Jintao-Huang added 3 commits April 14, 2026 17:52

update

9ef3520

fix

3bc7c9c

qwen3_5 fp8

41d5162

gemini-code-assist bot reviewed Apr 14, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Jintao-Huang added 4 commits April 14, 2026 22:40

update

70490e5

Merge remote-tracking branch 'refs/remotes/origin/support_qwen3_5_fp8…

d35ca3c

…' into support_qwen3_5_fp8

fix

e41ff53

Merge remote-tracking branch 'refs/remotes/origin/support_qwen3_5_fp8…

413b7a7

…' into support_qwen3_5_fp8

hjh0119 approved these changes Apr 14, 2026

View reviewed changes

tastelikefeet approved these changes Apr 14, 2026

View reviewed changes

Jintao-Huang added 5 commits April 14, 2026 23:06

fix

b804add

fix

2a03a11

fix

a760a2e

fix

164675c

update

9f66381

gemini-code-assist bot reviewed Apr 15, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py Outdated

Comment thread src/mcore_bridge/model/modules/gated_delta_net.py

Jintao-Huang added 2 commits April 16, 2026 00:06

fix

1a786df

fix

0c1a840

gemini-code-assist bot reviewed Apr 15, 2026

View reviewed changes

Comment thread src/mcore_bridge/model/modules/gated_delta_net.py

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Jintao-Huang merged commit 4a8d965 into modelscope:main Apr 15, 2026
1 check passed

Conversation

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 14, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 15, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Jintao-Huang commented Apr 15, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants