-
Notifications
You must be signed in to change notification settings - Fork 375
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration
#1382
opened May 2, 2026 by
Fridah-nv
Contributor
Loading…
[DeepSeek] Default to top-k calibration with peer-max input amax sync
#1380
opened May 1, 2026 by
cjluo-nv
Collaborator
Loading…
3 tasks done
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379
opened Apr 30, 2026 by
ChenhanYu
Collaborator
Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378
opened Apr 30, 2026 by
dthienan-nv
Contributor
Loading…
Enable active-param and memory based Minitron pruning constraint
#1377
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
Loading…
Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment
#1376
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
•
Draft
Fix sparsity-only export emitting empty hf_quant_config.json
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1375
opened Apr 29, 2026 by
kaix-nv
Contributor
Loading…
fix: guard against None chat_template in _post_process_chat_template
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1371
opened Apr 29, 2026 by
yeyu-nvidia
Contributor
Loading…
fix: include medusa in data_module assignment in main.py
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1370
opened Apr 29, 2026 by
yeyu-nvidia
Contributor
Loading…
Added fallback to load extra cudnn dlls in the site packages
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1369
opened Apr 29, 2026 by
hthadicherla
Contributor
Loading…
Support Mixed precision & Static MSE PTQ in MCore export; Nemotron Super v3 NVFP4 recipe
#1363
opened Apr 28, 2026 by
jenchen13
Contributor
Loading…
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1356
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
2 tasks done
[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1353
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.