[model] Support bailing v2 5 by Jintao-Huang · Pull Request #85 · modelscope/mcore-bridge

Jintao-Huang · 2026-05-20T02:13:19Z

No description provided.

gemini-code-assist

Code Review

This pull request introduces support for the bailing_hybrid model, including its configuration mapping and a specialized loader that handles hybrid attention layers. Review feedback highlights the need for safer and more efficient logic when retrieving transformer layer specifications, specifically recommending a try...finally block to ensure configuration state is restored. Additionally, it was suggested to remove redundant method overrides in the LinearAttention class and clean up several unused imports.

gemini-code-assist · 2026-05-20T02:16:19Z

+        layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)
+        multi_latent_attention = self.config.multi_latent_attention
+        self.config.multi_latent_attention = False
+        linear_layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)
+        self.config.multi_latent_attention = multi_latent_attention


The current implementation for getting linear_layer_specs by temporarily modifying self.config.multi_latent_attention has a couple of issues:

Safety: If super().get_transformer_layer_spec() raises an exception, self.config.multi_latent_attention will not be restored to its original value. This could lead to unexpected behavior in subsequent operations. Using a try...finally block is recommended for safety.

Efficiency: super().get_transformer_layer_spec() is called twice. If self.config.multi_latent_attention is False to begin with, both calls are identical, which is redundant and inefficient.

Consider refactoring this logic to be safer and more efficient.

Suggested change

layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)

multi_latent_attention = self.config.multi_latent_attention

self.config.multi_latent_attention = False

linear_layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)

self.config.multi_latent_attention = multi_latent_attention

multi_latent_attention = self.config.multi_latent_attention

if multi_latent_attention:

layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)

try:

self.config.multi_latent_attention = False

linear_layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)

finally:

self.config.multi_latent_attention = multi_latent_attention

else:

layer_specs = super().get_transformer_layer_spec(vp_stage=vp_stage)

linear_layer_specs = layer_specs

gemini-code-assist · 2026-05-20T02:16:19Z

+class LinearAttention(SelfAttention):
+    def __init__(
+        self,
+        config: TransformerConfig,
+        *args, **kwargs,
+    ):
+        super().__init__(config, *args, **kwargs)
+
+    def forward(
+        self,
+        hidden_states: Tensor,
+        attention_mask: Tensor,
+        **kwargs,
+    ) -> Tuple[Tensor, Tensor]:
+        return super().forward(hidden_states, attention_mask, **kwargs)


The __init__ and forward methods in the LinearAttention class are redundant as they just call the superclass methods with the same arguments. You can remove them for cleaner and more concise code.

After this change, TransformerConfig and Tuple will become unused imports and should also be removed, along with other unused imports in this file (BaseInferenceContext, PackedSeqParams, Union, and SelfAttentionSubmodules).

class LinearAttention(SelfAttention): pass

Jintao-Huang added 3 commits May 19, 2026 21:30

support bailing v2_5

c327356

update

d86b366

update

558b4d4

gemini-code-assist Bot reviewed May 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] Support bailing v2 5#85

[model] Support bailing v2 5#85
Jintao-Huang wants to merge 3 commits into
modelscope:mainfrom
Jintao-Huang:support_bailing_v2_5

Jintao-Huang commented May 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

gemini-code-assist Bot May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jintao-Huang commented May 20, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant