Skip to content

flydsl implementation of a16w16 gemm#2725

Draft
omuhamma wants to merge 4 commits intomainfrom
omuhamma/fly_a16w16
Draft

flydsl implementation of a16w16 gemm#2725
omuhamma wants to merge 4 commits intomainfrom
omuhamma/fly_a16w16

Conversation

@omuhamma
Copy link
Copy Markdown
Contributor

@omuhamma omuhamma commented Apr 13, 2026

Motivation

To write the gemm for gfx1250 in flydsl (WIP)

FlyDsl Pr: ROCm/FlyDSL#401

Test Plan

Unit tests

Test Result

All tests pass

Submission Checklist

@github-actions
Copy link
Copy Markdown
Contributor

🏷️ CI Guide

Runs automatically on every PR:

  • ✅ Pre-checks (submodule verification, code formatting)
  • ✅ Aiter op tests (gfx942 + gfx950)
  • ✅ Triton tests (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label Tests
ci:triton-355 Run Triton tests on MI355 in addition to MI325
ci:sglang SGLang integration tests
ci:atom ATOM benchmark (DeepSeek-R1 + GPT-OSS)
ci:vllm vLLM benchmark
ci:all All of the above

Add labels via the sidebar or gh pr edit 2725 --add-label <label>

@omuhamma omuhamma self-assigned this Apr 13, 2026
@coderfeli
Copy link
Copy Markdown
Collaborator

@omuhamma is this validated on silicon? could you plz also create a pr to flydsl if ready? Also, @yadaish is also trying naive gemm here.

@omuhamma
Copy link
Copy Markdown
Contributor Author

@coderfeli Its only been tested on AM workflows as of now, I will further optimize the code and test it on hardware soon.

Added FlyDSL PR to the description

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants