dflash: refactor draft model conversion by ruixiang63 · Pull Request #25110 · ggml-org/llama.cpp

ruixiang63 · 2026-06-28T17:33:19Z

Overview

Address comments in #22105 for DFlash conversion.

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: Yes, for code review and figure out how to change

ruixiang63 · 2026-06-28T17:34:18Z

cc @CISC @ggerganov
Tested the new converted GGUF model locally, works well.

ruixiang63 · 2026-06-28T18:18:34Z

I guess we need another approval, right?

CISC · 2026-06-28T18:28:25Z

I guess we need another approval, right?

Or wait for someone to notice the merge ready label. :)

ggerganov · 2026-06-28T18:29:49Z

@ruixiang63 You should be able to merge. Use squash-merge.

CISC · 2026-06-28T18:30:19Z

@ruixiang63 All yours. :)

ruixiang63 · 2026-06-28T18:32:05Z

yeah, my first commit! Thanks guys!

nipeone · 2026-06-29T14:30:40Z

holy shit! finally merged!

* dflash: refactor draft model conversion * apply fix for eagle3 convert

dflash: refactor draft model conversion

01129e7

ruixiang63 requested a review from CISC as a code owner June 28, 2026 17:33

github-actions Bot added the conversion label Jun 28, 2026

CISC approved these changes Jun 28, 2026

View reviewed changes

Comment thread gguf-py/gguf/constants.py

Comment thread conversion/qwen.py

apply fix for eagle3 convert

be0bb24

CISC approved these changes Jun 28, 2026

View reviewed changes

CISC added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Jun 28, 2026

ggerganov approved these changes Jun 28, 2026

View reviewed changes

CISC removed the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Jun 28, 2026

ruixiang63 merged commit fa72bc6 into ggml-org:master Jun 28, 2026
5 checks passed

waldirsp11 mentioned this pull request Jun 29, 2026

Eval bug: QWEN3.6 27b DFlash draft model fails to load #25116

Closed

Ankk98 added a commit to Ankk98/llama.cpp that referenced this pull request Jun 30, 2026

Merge upstream master (dflash conversion refactor ggml-org#25110)

210882f

turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jul 1, 2026

dflash: refactor draft model conversion (ggml-org#25110)

f15f406

* dflash: refactor draft model conversion * apply fix for eagle3 convert

TheTom mentioned this pull request Jul 1, 2026

eval: cherry-pick upstream DFlash spec-decode (#22105 + #25110, dep #24707) — CI check TheTom/llama-cpp-turboquant#201

Draft

TheTom pushed a commit to TheTom/llama-cpp-turboquant that referenced this pull request Jul 2, 2026

dflash: refactor draft model conversion (ggml-org#25110)

a4fd3ad

* dflash: refactor draft model conversion * apply fix for eagle3 convert

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dflash: refactor draft model conversion#25110

dflash: refactor draft model conversion#25110
ruixiang63 merged 2 commits into
ggml-org:masterfrom
ruixiang63:dflash_convert

ruixiang63 commented Jun 28, 2026

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

Uh oh!

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

CISC commented Jun 28, 2026

Uh oh!

ggerganov commented Jun 28, 2026

Uh oh!

CISC commented Jun 28, 2026

Uh oh!

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

nipeone commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

ruixiang63 commented Jun 28, 2026

Overview

Additional information

Requirements

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

Uh oh!

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

CISC commented Jun 28, 2026

Uh oh!

ggerganov commented Jun 28, 2026

Uh oh!

CISC commented Jun 28, 2026

Uh oh!

Uh oh!

ruixiang63 commented Jun 28, 2026

Uh oh!

nipeone commented Jun 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants