-
Notifications
You must be signed in to change notification settings - Fork 159
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[perf] Improve Prefill Performance by Removing Redundant Padding and Optimizing Alltoall Communication
module:quantization
#949
opened May 25, 2025 by
SlightwindSec
Loading…
Improve Prefill Performance by Removing Redundant Padding and Optimizing Alltoall Communication
module:quantization
#948
opened May 25, 2025 by
SlightwindSec
Loading…
[perf][WIP] Support MOE Multi-stream in Deepseek
module:ops
module:quantization
#947
opened May 24, 2025 by
David9857
Loading…
[Bugfix] Adjust inputbatch to be compatible with latest vllm
#945
opened May 24, 2025 by
MengqingCao
Loading…
[BugFix] Fix a problem of failing to utilize custom kv cache dtype.
module:core
#944
opened May 24, 2025 by
whx-sjtu
Loading…
[Scheduler][P/D] Add support for disaggregated prefill in AsecendScheduler.
#943
opened May 24, 2025 by
whx-sjtu
Loading…
[MLA][Graph] Improve assertion on Graph mode with MLA
#933
opened May 22, 2025 by
MengqingCao
Loading…
[Do Not Merge]add grouped_matmul_swiglu_quant
module:quantization
#930
opened May 22, 2025 by
Angazenn
Loading…
[Performance] Add EPLB expert map import capabilities
module:ops
#919
opened May 21, 2025 by
songshanhu07
Loading…
[ModelRunner] Support embedding inputs
module:tests
ready
read for review
#916
opened May 21, 2025 by
Potabk
Loading…
[WIP][Platform] Add support for Ascend 310P
module:core
module:ops
#914
opened May 21, 2025 by
farawayboat
Loading…
[Fix] Fix update_aclgraph_sizes when running MoE models
module:core
#913
opened May 21, 2025 by
yiz-liu
Loading…
[perf]: add NZ transformation for QuantMatmul and use dequant_swiglu_…
module:quantization
ready
read for review
#907
opened May 20, 2025 by
linfeng-yuan
Loading…
[perf][WIP]: using NZ optimization for quantized GMM
module:quantization
#906
opened May 20, 2025 by
linfeng-yuan
Loading…
[Bugfix] Fix deepseek V0 percision issue and add acc ci for it
module:ops
module:quantization
module:tests
#905
opened May 20, 2025 by
MengqingCao
Loading…
[1/N][UT][v1 MTP] add basic v1 mtp features
module:ops
module:tests
#890
opened May 17, 2025 by
XWFAlone
Loading…
[CI/UT][PD Disaggreate] Initialize PD Disaggreate UT
module:pd
PD disaggregation related
module:tests
#889
opened May 17, 2025 by
MengqingCao
Loading…
Fix the device error when using ray as vllm-acend backend
module:core
module:ops
module:tests
#884
opened May 16, 2025 by
zhuo97
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.