-
Notifications
You must be signed in to change notification settings - Fork 426
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix non-deterministic T5 calibration NaN on multi-GPU
#1636
opened Jun 5, 2026 by
kevalmorabia97
Collaborator
Loading…
docs(skills): fix VLM PTQ ViT-quantization + AA eval judge/tool-call gaps
#1632
opened Jun 5, 2026 by
Edwardf0t1
Contributor
Loading…
fix(export): correct unified_export_megatron at EP > 1 and DP > 1
#1631
opened Jun 4, 2026 by
yueshen2016
Contributor
Loading…
3 of 4 tasks
[6058907] Fix ShapeInferenceError in ONNX int8+fp16 quantization of weakly-typed models
#1627
opened Jun 4, 2026 by
ajrasane
Contributor
Loading…
docs(eval skill): vLLM backend env vars + SLURM HF-cache/cpu_partition guidance
#1625
opened Jun 4, 2026 by
cjluo-nv
Collaborator
Loading…
DFlash speculative decoding for MiniMax-M2.7 (FSDP2): auto mask-token, FSDP2 resume fixes, per-checkpoint draft export
#1621
opened Jun 3, 2026 by
yeyu-nvidia
Contributor
Loading…
Add W4A16 NVFP4-MSE Qwen3.5 dense/MoE PTQ recipes
#1620
opened Jun 3, 2026 by
cjluo-nv
Collaborator
Loading…
Fix torch import error to remove circular dependency & move Nemotron configs
#1606
opened Jun 2, 2026 by
jenchen13
Contributor
Loading…
Add NVFP4 + QAD to the Nemotron-3-Nano-30B-A3B tutorial
#1601
opened Jun 2, 2026 by
kevalmorabia97
Collaborator
•
Draft
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.