Muhammad Awad maawad

👋 Hi there — I'm Muhammad Awad

I'm a Senior Member of Technical Staff at AMD Research, where I conceive, architect, and tech-lead system-level software, libraries, and runtimes for next-generation computing. I lead cross-organizational teams and coordinate initiatives across ML frameworks, kernel engineering, distributed systems, and research teams.

Before joining AMD, I completed my Ph.D. in Electrical and Computer Engineering at UC Davis, advised by John Owens. My research there focused on building dynamic concurrent data structures on GPUs, such as B-trees, dynamic graphs, multiversioned trees, and high-performance hash tables.

💼 Current Focus @ AMD

At AMD, I lead and contribute to projects spanning:

AI-powered tools for GPU performance and productivity
Libraries and runtimes for heterogeneous and distributed systems
Programming models for AMD GPUs and Ryzen™ AI NPUs

Notable projects:

🔁 Iris: Conceived, architected, and tech-led a Triton-based multi-GPU programming framework from scratch. Leading a cross-organizational team, demonstrated significant speedups in production LLM workloads.
🧠 IntelliKit: Conceived and architected an open-source LLM-ready profiling toolkit for AMD CPUs and GPUs. Serving as tooling lead for AMD's company-wide ML for performance engineering initiative.
🧠 IntelliPerf: Conceived and led development of an LLM-powered autonomous GPU performance engineering framework that profiles, diagnoses, and optimizes kernels end-to-end.
⚙️ IRON: Contributor to IRON, a low-level development stack for AMD Ryzen™ AI NPUs with Python APIs and MLIR compiler passes.

🔬 Academic Research

I'm broadly interested in parallel computing, concurrent data structures, performance analysis, and low-level GPU programming. As a Ph.D. student, I designed and built several GPU-native data structures:

🌳 Dynamic GPU B-Tree
⌛ Multiversion GPU B-Tree
🕸 Dynamic graph data structure (integrated into Gunrock)
🧹 Epoch-based memory reclamation for safe memory reuse on GPUs
#️⃣ Static GPU hash tables — including cuckoo, Po2, and iceberg hashing
👨‍💻 PTX-level bucketed cuckoo hash set — written in PTX and JIT-compiled

📫 Connect

✉️ Email: muhaawad at amd dot com
🌐 Website
📄 CV
🧑‍💼 LinkedIn
📚 Google Scholar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Muhammad Awad maawad

Achievements

Achievements

Highlights

Organizations

Block or report maawad

👋 Hi there — I'm Muhammad Awad

💼 Current Focus @ AMD

🔬 Academic Research

📫 Connect

Pinned Loading

Uh oh!