-
Notifications
You must be signed in to change notification settings - Fork 168
Issues: kvcache-ai/Mooncake
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Will disaggregated vllm with MooncakeStoreConnector support TP and PP?
#127
opened Mar 5, 2025 by
c-guo16
[DOCS][BUG]Can not run the examples follow
vllm-integration-v0.2.md
and vllm-integration.md
#124
opened Feb 28, 2025 by
maobaolong
[DocsRequest]: Need vllm integration mooncake and run examples step-by-step
#122
opened Feb 27, 2025 by
maobaolong
What is the communication protocol for two GPUs within the same node?
#90
opened Jan 22, 2025 by
Rookie-Kai
vllm use mooncake as pipline by RDMA is not work, error message: transport retry counter exceeded
#82
opened Jan 16, 2025 by
yueyuep
How to set
protocol
parameter when prefill and decode in the same node?
#68
opened Jan 7, 2025 by
gujingit
Why do we need to disable CUDA graph (enforce_eager) for Tensor parallelism?
#64
opened Jan 6, 2025 by
skyCreateXian
对于p2p-store-example,用阿里云eRDMA,设置MC_GID_INDEX=1时无法启动,只有0才能启动,不过ibv_modify_qp仍然报错
#53
opened Dec 26, 2024 by
power-more
Inquiry Regarding "Mooncake- A KVCache-centric Disaggregated Architecture for LLM Serving"
#50
opened Dec 25, 2024 by
VegetaPn
[RoadMap] Mooncake Roadmap Q1 & Q2 2025
Roadmap
Future roadmap or plan for new features
#44
opened Dec 18, 2024 by
stmatengss
4 of 40 tasks
"Address not registered by any device(s)" Error when block_size is large
#38
opened Dec 16, 2024 by
c-guo16
Previous Next
ProTip!
no:milestone will show everything without a milestone.