-
Notifications
You must be signed in to change notification settings - Fork 208
Pull requests: jd-opensource/xllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support dumping xllm server flags to json file.
#1518
opened May 22, 2026 by
XuZhang99
Collaborator
Loading…
feat:remove unused func and support deepseek_v4_mtp graph on npu.
#1517
opened May 22, 2026 by
panxua
Contributor
Loading…
feat: expose cached token usage in responses.
#1514
opened May 21, 2026 by
zhang-minchao
Collaborator
Loading…
bugfix: fix precision issue of TileLang operator chunk_gated_delta_ru…
#1510
opened May 21, 2026 by
fengz72
Loading…
feat: enable REC XAttention for Qwen3 MoE on cuda device.
#1500
opened May 20, 2026 by
LMX-xin
Collaborator
Loading…
feat: support vae parallel for qwen-image-edit-plus.
#1499
opened May 20, 2026 by
shan-chen-feng
Collaborator
Loading…
feat: add TileLang chunk_gated_delta_rule_fwd_h kernel.
#1498
opened May 20, 2026 by
fengz72
Loading…
bugfix: use max_concurrent_requests for single block and linear state allocation.
#1496
opened May 20, 2026 by
pjgao
Loading…
feat: add rate limit, monitor, multiple images and input validation for dit.
#1483
opened May 19, 2026 by
xiao-yu-chen
Collaborator
Loading…
feat: support customized multimodal preprocess configs.
#1481
opened May 19, 2026 by
xanecdotex
Collaborator
•
Draft
refactor: remove negative condition when choosing decode or prefill
#1475
opened May 18, 2026 by
rauletorresc
Contributor
Loading…
feat: parallelize multimodal decode in request transfer.
#1474
opened May 18, 2026 by
wly-115
Collaborator
Loading…
refactor: split forward inputs from model input params [3 / 3].
#1469
opened May 18, 2026 by
RobbieLeung
Collaborator
Loading…
bugfix: ensure tensor contiguous layout before protobuf serialization.
#1468
opened May 18, 2026 by
a120092009
Collaborator
Loading…
fused_sigmoid_gating_tilelang tilelang adapt in qwen3.x
#1465
opened May 15, 2026 by
BikingNow
Loading…
bugfix: reduce acl graph memory overhead.
#1457
opened May 15, 2026 by
RobbieLeung
Collaborator
Loading…
docs: exporting a draft model from a quantized model.
#1455
opened May 14, 2026 by
rauletorresc
Contributor
Loading…
【WIP】feat: onerec xattn npu multistream.
#1453
opened May 14, 2026 by
DragonFive
Collaborator
•
Draft
feat: support the wan22's dit and tp parallel.
#1445
opened May 14, 2026 by
ethan686
Contributor
Loading…
feat: support the /v1/video/generation for wan2.2.
#1444
opened May 13, 2026 by
ethan686
Contributor
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.