-
Notifications
You must be signed in to change notification settings - Fork 47
Pull requests: ROCm/FlyDSL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[FlyOp] Add stages args for MmaMakeFragment to simplify multi-stages allocation
#437
opened Apr 25, 2026 by
sjfeng1999
Collaborator
Loading…
1 task
optimized fast path by reducing register use and vectorize on normal …
#436
opened Apr 24, 2026 by
kudomcho
Loading…
Adds Grouped and Batched GEMM kernels with blockscaling matching DeepGEMM API
#433
opened Apr 23, 2026 by
aryaman-gupta
Loading…
4 tasks done
CI: add standalone ATOM integration workflow
#428
opened Apr 22, 2026 by
gyohuangxin
Member
Loading…
3 of 4 tasks
feat: align quant and fused kernels with Triton in FlyDSL
#421
opened Apr 21, 2026 by
cschenjunlin
Loading…
1 of 7 tasks
feat: support mori IR JIT compilation with shmem
#418
opened Apr 21, 2026 by
yanboshao
Contributor
Loading…
1 task
[AOT] Add dump_to_object, export_to_c, and load_module for AOT export
#414
opened Apr 20, 2026 by
coderfeli
Collaborator
Loading…
Add gfx950 (MI355X) preload tuning table for preshuffle GEMM
#411
opened Apr 16, 2026 by
andyluo7
Contributor
Loading…
Unify MoE test perf timing to CUDA Event bench and fix minor issues
#408
opened Apr 16, 2026 by
XingerZhu
Collaborator
Loading…
1 task
[MI450][Kernel] add Deepseek MHA bf16 kernel verified on MI450
#393
opened Apr 13, 2026 by
jli-melchior
Contributor
Loading…
1 task
[FLYDSL]: if dispatch dynamic tests refactor
#346
opened Apr 3, 2026 by
xudoyuan
Contributor
Loading…
1 task
[Feature] Add JAX integration for FlyDSL kernels
#257
opened Mar 21, 2026 by
wenchenvincent
Loading…
1 task done
ProTip!
Follow long discussions with comments:>50.