Skip to content

feat: add mlu mooncake pd push support.#1246

Merged
phantomlei3 merged 3 commits intojd-opensource:mainfrom
phantomlei3:feat/pd-support-mlu-push
Apr 14, 2026
Merged

feat: add mlu mooncake pd push support.#1246
phantomlei3 merged 3 commits intojd-opensource:mainfrom
phantomlei3:feat/pd-support-mlu-push

Conversation

@phantomlei3
Copy link
Copy Markdown
Collaborator

  • add backend-agnostic layer synchronizer wiring for layer-wise push overlap
  • enable mooncake push on mlu with descriptor-based memory registration
  • enforce mlu pd push flag validation and reject unsupported pull mode

Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for the MLU backend in the KV cache transfer mechanism, including necessary updates to the Mooncake transfer engine, synchronization logic, and model input parameters. The review comment regarding the spin-wait loop in the MLU layer synchronizer was addressed by suggesting the use of thread yielding to improve efficiency.

Comment thread xllm/core/platform/mlu/mlu_layer_synchronizer.cpp
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp
Comment thread xllm/proto/mooncake_transfer_engine.proto Outdated
Comment thread xllm/proto/mooncake_transfer_engine.proto Outdated
Comment thread xllm/xllm.cpp
Comment thread xllm/core/platform/mlu/mlu_layer_synchronizer.cpp
@phantomlei3 phantomlei3 force-pushed the feat/pd-support-mlu-push branch from d196113 to f33ce54 Compare April 11, 2026 08:52
Copy link
Copy Markdown
Collaborator Author

@phantomlei3 phantomlei3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed for current reviews

Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/xllm.cpp
Comment thread xllm/proto/mooncake_transfer_engine.proto Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp Outdated
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.cpp
@phantomlei3 phantomlei3 force-pushed the feat/pd-support-mlu-push branch from f33ce54 to fcab1ad Compare April 13, 2026 12:49
Comment thread xllm/core/framework/kv_cache/mooncake_transfer_engine.h Outdated
@phantomlei3 phantomlei3 merged commit eae1623 into jd-opensource:main Apr 14, 2026
7 of 31 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants