[Torchax] Support bias and swiglu in MoE #1040

kyuyeunk · 2025-11-07T06:36:52Z

Description

Support bias and swiglu activation in MoE layer.

Additionally, this PR fixes an issue where program fails when attempting to load gpt-oss with expert parallelism.

Tests

VLLM_DISABLE_SHARED_EXPERTS_STREAM=1 MODEL_IMPL_TYPE=vllm vllm serve --model=unsloth/gpt-oss-120b-BF16  --max-model-len=8192 --max-num-batched-tokens 1024 --max-num-seqs=256 --no-enable-prefix-caching --disable-log-requests --gpu-memory-utilization 0.8 --tensor-parallel-size 8 --enable-expert-parallel

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

github-actions · 2025-11-07T06:37:05Z

Description

Start with a short description of what the PR does and how this is a change from
the past.

The rest of the description includes relevant details and context, examples:

why is this change being made,
the problem being solved and any relevant context,
why this is a good solution,
some information about the specific implementation,
shortcomings of the solution and possible future improvements.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/123456
FIXES: #123456

Tests

Please describe how you tested this change, and include any instructions and/or
commands to reproduce.

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

OhadRubin · 2025-11-07T07:00:18Z

@kyuyeunk when can we expect kernel support and fully functional gpt-oss-120b inference?

kyuyeunk · 2025-11-07T07:06:32Z

@kyuyeunk when can we expect kernel support and fully functional gpt-oss-120b inference?

For unquantized version, i would expect torchax path to fully support it by end of this week.

Uncertain about JAX path because I foresee heavily refactoring for not just making the kernel work, but also add additional optimizations to fully take advantage of it.

Signed-off-by: Kyuyeun Kim <[email protected]>

kyuyeunk requested review from bythew3i, bzgoogle, lsy323 and qihqi November 7, 2025 06:36

kyuyeunk force-pushed the load_moe_bias branch from 649912e to 913aa2a Compare November 7, 2025 06:40

kyuyeunk force-pushed the load_moe_bias branch 2 times, most recently from 544b317 to e0e7ffd Compare November 8, 2025 10:20

kyuyeunk changed the title ~~[Torchax] Add ability to load MoE bias~~ [Torchax] Support bias and swiglu in MoE Nov 8, 2025

kyuyeunk force-pushed the load_moe_bias branch 3 times, most recently from 30e412a to 622d07a Compare November 9, 2025 01:51

[Torchax] Support bias and swiglu in MoE

cf4ab79

Signed-off-by: Kyuyeun Kim <[email protected]>

kyuyeunk force-pushed the load_moe_bias branch from 622d07a to cf4ab79 Compare November 9, 2025 02:06

qihqi approved these changes Nov 10, 2025

View reviewed changes

kyuyeunk merged commit 6d0c11c into main Nov 10, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Torchax] Support bias and swiglu in MoE #1040

[Torchax] Support bias and swiglu in MoE #1040

Uh oh!

kyuyeunk commented Nov 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

OhadRubin commented Nov 7, 2025

Uh oh!

kyuyeunk commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Torchax] Support bias and swiglu in MoE #1040

[Torchax] Support bias and swiglu in MoE #1040

Uh oh!

Conversation

kyuyeunk commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

github-actions bot commented Nov 7, 2025

Description

Tests

Checklist

Uh oh!

OhadRubin commented Nov 7, 2025

Uh oh!

kyuyeunk commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kyuyeunk commented Nov 7, 2025 •

edited

Loading