Skip to content

Fix qwen35 dp#4535

Open
grimoire wants to merge 2 commits intoInternLM:mainfrom
grimoire:fix-qwen35-tp
Open

Fix qwen35 dp#4535
grimoire wants to merge 2 commits intoInternLM:mainfrom
grimoire:fix-qwen35-tp

Conversation

@grimoire
Copy link
Copy Markdown
Collaborator

dp/cudagraph might padding state_ids with -1, which would be clamp to 0 in model.

0 is reserved state for dummy inputs, multiple dummy inputs might write to the same state, leads to invalid output (nan/inf).

@grimoire grimoire requested a review from yao-fengchen April 18, 2026 08:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant