Skip to content

docs: exporting a draft model from a quantized model.#1455

Open
rauletorresc wants to merge 3 commits into
jd-opensource:mainfrom
rauletorresc:rauletorresc-patch-001
Open

docs: exporting a draft model from a quantized model.#1455
rauletorresc wants to merge 3 commits into
jd-opensource:mainfrom
rauletorresc:rauletorresc-patch-001

Conversation

@rauletorresc
Copy link
Copy Markdown
Contributor

MTP documentation assumes the base model is not quantized. Unfortunately, if we extract a draft model from a quantized model, the resulting draft model will not be quantized by default. This PR details the necessary steps for quantizing a draft model for correct speculative decoding serving.

MTP documentation assumes the base model is not quantized.
Unfortunately, if we extract a draft model from a quantized model,
the resulting draft model will not be quantized by default. This
PR details the necessary steps for quantizing a draft model for
correct speculative decoding serving.
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the MTP documentation to include a comprehensive guide on exporting draft models from quantized base models, including necessary scripts and configuration patches. The reviewer provided feedback to improve the robustness and portability of the documentation's scripts by recommending the removal of hardcoded paths and the use of standard working directories instead of /root/.

Comment thread docs/en/features/mtp.md Outdated
Comment thread docs/en/features/mtp.md
Comment thread docs/en/features/mtp.md Outdated
@liutongxuan liutongxuan changed the title docs: Exporting a draft model from a quantized model docs: exporting a draft model from a quantized model. May 15, 2026
@rauletorresc
Copy link
Copy Markdown
Contributor Author

Hi @liutongxuan could you please review this PR again?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant