kvcache-ai / Mooncake Public

Notifications You must be signed in to change notification settings
Fork 168
Star 2.8k

Code
Issues 29
Pull requests 4
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: kvcache-ai/Mooncake

[RoadMap] Mooncake Roadmap Q1 & Q2 2025

#44 opened Dec 18, 2024 by stmatengss

Open 8

Labels 11 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

29 Open 38 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Potential Cache Hit Rate Reduction in Cache-Aware Scheduling

#131 opened Mar 7, 2025 by cy953708688

Will disaggregated vllm with MooncakeStoreConnector support TP and PP？

#127 opened Mar 5, 2025 by c-guo16

[DOCS][BUG]Can not run the examples follow vllm-integration-v0.2.md and vllm-integration.md

#124 opened Feb 28, 2025 by maobaolong

link warn : connection already connected

#123 opened Feb 27, 2025 by zx-ai

[DocsRequest]: Need vllm integration mooncake and run examples step-by-step

#122 opened Feb 27, 2025 by maobaolong

请问有计划提供编译构建好的容器镜像吗？

#121 opened Feb 27, 2025 by cheferrari

压测出现WC状态和QP错误

#117 opened Feb 22, 2025 by power-more

Some questions about communication time

#103 opened Feb 13, 2025 by sitabulaixizawaluduo

target node error

#97 opened Feb 11, 2025 by zx-ai

What is the communication protocol for two GPUs within the same node?

#90 opened Jan 22, 2025 by Rookie-Kai

Support Model

#88 opened Jan 21, 2025 by duzw9311

Installing error!!

#85 opened Jan 21, 2025 by IEI-mjx

vllm use mooncake as pipline by RDMA is not work, error message: transport retry counter exceeded

#82 opened Jan 16, 2025 by yueyuep

proposal: improve the topology metadata

#78 opened Jan 13, 2025 by doujiang24

Libfabric transport layer support

#76 opened Jan 10, 2025 by xendo

Questions about transfer_engine_bench

#75 opened Jan 10, 2025 by cyLi-Tiger

How to set protocol parameter when prefill and decode in the same node?

#68 opened Jan 7, 2025 by gujingit

Why do we need to disable CUDA graph (enforce_eager) for Tensor parallelism?

#64 opened Jan 6, 2025 by skyCreateXian

Why is ITL's first token so long?

#62 opened Jan 3, 2025 by sunshenao

对于p2p-store-example，用阿里云eRDMA，设置MC_GID_INDEX=1时无法启动，只有0才能启动，不过ibv_modify_qp仍然报错

#53 opened Dec 26, 2024 by power-more

Inquiry Regarding "Mooncake- A KVCache-centric Disaggregated Architecture for LLM Serving"

#50 opened Dec 25, 2024 by VegetaPn

CXL/Shared memory

#48 opened Dec 22, 2024 by stmatengss

[RoadMap] Mooncake Roadmap Q1 & Q2 2025 Roadmap

Future roadmap or plan for new features

#44 opened Dec 18, 2024 by stmatengss

4 of 40 tasks

"Address not registered by any device(s)" Error when block_size is large

#38 opened Dec 16, 2024 by c-guo16

vllm-integration with multi rdma devices error

#35 opened Dec 13, 2024 by junna2016

Previous 1 2 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly