[Feature]: return routed experts to reuse #4090

RunningLeon · 2025-10-31T03:25:04Z

Motivation

Please describe the motivation of this PR and the goal you want to achieve through this PR.

Modification

cache routed expert ids and return when finish seq

Use cases

from lmdeploy import pipeline, GenerationConfig, PytorchEngineConfig

if __name__ == '__main__':
    backend_config = PytorchEngineConfig(tp=1, enable_return_routed_experts=True)
    prompts = ['Hello who are you?' * 1024, 'Can you write a poem about AI?']  * 1
    gen_config = GenerationConfig(return_routed_experts=True)
    model_path = 'Qwen/Qwen3-30B-A3B'
    pipe = pipeline(model_path, backend_config=backend_config)
    reps = pipe(prompts, gen_config=gen_config)
    for res in reps:
        print(res)

BC-breaking (Optional)

Does the modification introduce changes that break the backward-compatibility of the downstream repositories?
If so, please describe how it breaks the compatibility and how the downstream projects should modify their code to keep compatibility with this PR.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit tests to ensure the correctness.
If the modification has a dependency on downstream projects of a newer version, this PR should be tested with all supported versions of downstream projects.
The documentation has been modified accordingly, like docstring or example tutorials.

RunningLeon added 7 commits October 31, 2025 10:29

return routed experts

efa13b9

only on tp rank0

986fd96

no change to model config

4e654ee

clone experts from cudagraph buffer

7b0c0c4

use ray to transfer experts

36b9b7c

Merge remote-tracking branch 'upstream/main' into return-routed-experts

217bde6

Merge remote-tracking branch 'upstream/main' into return-routed-experts

c1da9f6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: return routed experts to reuse #4090

[Feature]: return routed experts to reuse #4090

Uh oh!

RunningLeon commented Oct 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Feature]: return routed experts to reuse #4090

Are you sure you want to change the base?

[Feature]: return routed experts to reuse #4090

Uh oh!

Conversation

RunningLeon commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modification

Use cases

BC-breaking (Optional)

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

RunningLeon commented Oct 31, 2025 •

edited

Loading