Skip to content

Pull requests: jd-opensource/xllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

bugfix: add ILU process group ctor for explicit local rank. (#1240)
#1271 opened Apr 13, 2026 by liutongxuan Collaborator Loading…
feat: add fused_gdn_gating kernel in tilelang-ascend.
#1267 opened Apr 13, 2026 by zhang-minchao Collaborator Loading…
feat: add startup profile run for mlu llm engine.
#1266 opened Apr 13, 2026 by phantomlei3 Collaborator Loading…
bugfix: correctly reuse residuals in FBCache.
#1265 opened Apr 13, 2026 by z-jun03 Collaborator Loading…
bugfix: fix mtp prefix cache prefill starvation.
#1264 opened Apr 13, 2026 by phantomlei3 Collaborator Loading…
bugfix: unify device_index logging conversion with helper overloads
#1263 opened Apr 12, 2026 by kuma-loong Contributor Loading…
bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258 opened Apr 10, 2026 by DongheJin Collaborator Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256 opened Apr 10, 2026 by yiming-l21 Collaborator Loading…
feat: add mlu mooncake pd push support.
#1246 opened Apr 10, 2026 by phantomlei3 Collaborator Loading…
perf: Qwen Image Optimize.
#1242 opened Apr 9, 2026 by shan-chen-feng Collaborator Loading…
feat: support in-batch prefix cache.
#1240 opened Apr 9, 2026 by Clement-Wang26 Collaborator Loading…
bugfix: optimize multi-modal preprocess accuracy.
#1235 opened Apr 9, 2026 by wly-115 Collaborator Loading…
feat: add configurable decode ACL-graph fallback threshold.
#1233 opened Apr 8, 2026 by DongheJin Collaborator Loading…
feat: support tensor parallel for Flux model on npu device.
#1231 opened Apr 8, 2026 by z-jun03 Collaborator Loading…
perf: Qwen image optimize.
#1230 opened Apr 8, 2026 by shan-chen-feng Collaborator Loading…
feat: enable rec fast sampler for llm beam search.
#1224 opened Apr 8, 2026 by RobbieLeung Collaborator Loading…
feat: improve cuda shared memory tensor handling.
#1222 opened Apr 8, 2026 by RobbieLeung Collaborator Loading…
bugfix: support per-fork disagg pd port for fork master.
#1220 opened Apr 8, 2026 by Clement-Wang26 Collaborator Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.