-
Notifications
You must be signed in to change notification settings - Fork 378
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add unit test for checking any leak of temporary augmented onnx files, on exception during ONNX INT4 AWQ quantization
#1383
opened May 3, 2026 by
vishalpandya1990
Contributor
Loading…
fixes for fused moe (qwen3.6, GLM5.1 + MSE calibration
#1382
opened May 2, 2026 by
Fridah-nv
Contributor
Loading…
[DeepSeek] Default to top-k calibration with peer-max input amax sync
#1380
opened May 1, 2026 by
cjluo-nv
Collaborator
Loading…
3 tasks done
feat(launcher): add DFlash support for DeepSeek-V4-Flash target model
#1379
opened Apr 30, 2026 by
ChenhanYu
Collaborator
Loading…
Use trtexec_safe on safety platforms when using remoteAutoTuning
#1378
opened Apr 30, 2026 by
dthienan-nv
Contributor
Loading…
Enable active-param and memory based Minitron pruning constraint
#1377
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
Loading…
Add Nemotron-3-Nano-30B-A3B-BF16 e2e tutorial: Prune + Distill + Quantize + Nemo Evaluator + vLLM deployment
#1376
opened Apr 30, 2026 by
kevalmorabia97
Collaborator
•
Draft
Fix sparsity-only export emitting empty hf_quant_config.json
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1375
opened Apr 29, 2026 by
kaix-nv
Contributor
Loading…
Support Mixed precision & Static MSE PTQ in MCore export; Nemotron Super v3 NVFP4 recipe
#1363
opened Apr 28, 2026 by
jenchen13
Contributor
Loading…
[SKILL.md Chore] make .agents/ the cannonical agent-skills location
#1362
opened Apr 28, 2026 by
shljessie
Loading…
Add pre-built evaluation recipes for common benchmarks
#1357
opened Apr 27, 2026 by
kaix-nv
Contributor
Loading…
[6106576] Restore llm_export_utils as deprecated shim for edgellm 0.6.1 compat
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1356
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
2 tasks done
[6110209] Patch zero FP16 scales in INT4_AWQ ONNX export
cherry-pick-0.44.0
After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1353
opened Apr 27, 2026 by
ajrasane
Contributor
Loading…
[OMNIML-4021]: align local JSONL loading with HF datasets path + keep original behaviour
#1345
opened Apr 24, 2026 by
shengliangxu
Collaborator
Loading…
3 tasks done
[OMNIML-3934] Guidelines and precommit hook for pydantic backward compatbility
#1333
opened Apr 23, 2026 by
jenchen13
Contributor
Loading…
[Refactor] speculative decoding: use mto config subsystem
#1328
opened Apr 23, 2026 by
h-guo18
Contributor
Loading…
Quantize lm_head + embedding for Nemotron-H, add NVFP4 W4A16 recipe
#1327
opened Apr 22, 2026 by
ajrasane
Contributor
Loading…
3 of 5 tasks
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.