-
Notifications
You must be signed in to change notification settings - Fork 179
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bugfix: add ILU process group ctor for explicit local rank. (#1240)
#1271
opened Apr 13, 2026 by
liutongxuan
Collaborator
Loading…
feat: support Qwen down_proj fallback for compressed-tensors ignored modules.
#1270
opened Apr 13, 2026 by
yingxudeng
Collaborator
Loading…
feat: add fused_gdn_gating kernel in tilelang-ascend.
#1267
opened Apr 13, 2026 by
zhang-minchao
Collaborator
Loading…
feat: add startup profile run for mlu llm engine.
#1266
opened Apr 13, 2026 by
phantomlei3
Collaborator
Loading…
bugfix: correctly reuse residuals in FBCache.
#1265
opened Apr 13, 2026 by
z-jun03
Collaborator
Loading…
bugfix: fix mtp prefix cache prefill starvation.
#1264
opened Apr 13, 2026 by
phantomlei3
Collaborator
Loading…
bugfix: unify device_index logging conversion with helper overloads
#1263
opened Apr 12, 2026 by
kuma-loong
Contributor
Loading…
bugfix: init ssm_cache by config ssm_cache_type and unify compute precision to fp32.
#1259
opened Apr 10, 2026 by
JC-ut0
Contributor
Loading…
bugfix: fix DeepSeek V3 crash and V3.2 prefix-cache OOM.
#1258
opened Apr 10, 2026 by
DongheJin
Collaborator
Loading…
bugfix: fix CFG negative prompt judgment & simplify variable names fo…
#1256
opened Apr 10, 2026 by
yiming-l21
Collaborator
Loading…
feat: support Qwen down_proj fallback for compressed-tensors ignored modules.
#1254
opened Apr 10, 2026 by
yingxudeng
Collaborator
Loading…
bugfix: remove spurious backslash breaking output redirection in launch scripts.
#1248
opened Apr 10, 2026 by
kuishou68
Loading…
feat: add mlu mooncake pd push support.
#1246
opened Apr 10, 2026 by
phantomlei3
Collaborator
Loading…
feat: support in-batch prefix cache.
#1240
opened Apr 9, 2026 by
Clement-Wang26
Collaborator
Loading…
bugfix: optimize multi-modal preprocess accuracy.
#1235
opened Apr 9, 2026 by
wly-115
Collaborator
Loading…
feat: add configurable decode ACL-graph fallback threshold.
#1233
opened Apr 8, 2026 by
DongheJin
Collaborator
Loading…
feat: support tensor parallel for Flux model on npu device.
#1231
opened Apr 8, 2026 by
z-jun03
Collaborator
Loading…
feat: expose startup runtime flags through c and python apis.
#1229
opened Apr 8, 2026 by
RobbieLeung
Collaborator
•
Draft
feat: enable rec fast sampler for llm beam search.
#1224
opened Apr 8, 2026 by
RobbieLeung
Collaborator
Loading…
feat: improve cuda shared memory tensor handling.
#1222
opened Apr 8, 2026 by
RobbieLeung
Collaborator
Loading…
bugfix: support per-fork disagg pd port for fork master.
#1220
opened Apr 8, 2026 by
Clement-Wang26
Collaborator
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.