Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feat]: Support Dspark
#1849 opened Jun 29, 2026 by h-guo18 Contributor Draft
Add LAQ NVFP4 export support
#1847 opened Jun 28, 2026 by realAsma Contributor Draft
[Fix]: Add Final Norm for vLLM HIddens Extracter
#1846 opened Jun 28, 2026 by h-guo18 Contributor Draft
docs(eval): add NEL v0.3.0 migration guide + example configs
#1845 opened Jun 28, 2026 by hychiang-git Contributor Loading…
feat(mcp): support MFA Slurm submissions
#1844 opened Jun 27, 2026 by ChenhanYu Collaborator Loading…
launcher: fix host=None when _factory_ is dropped by nemo_run --yaml path
#1842 opened Jun 27, 2026 by ChenhanYu Collaborator Loading…
3 tasks
OMNIML-5128 Capture Docker experiment id
#1840 opened Jun 27, 2026 by ChenhanYu Collaborator Loading…
Fix Nemotron-H PTQ failure on Transformers 5.x with --trust_remote_code (moe_latent_size AttributeError) cherry-pick-0.45.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc
#1839 opened Jun 26, 2026 by Fridah-nv Contributor Loading…
specdec(recipe): add MiniMax-M2.7-DFlash streaming multi-node pipeline
#1835 opened Jun 26, 2026 by yeyu-nvidia Contributor Loading…
3 tasks
Add quant+sparse attention for vLLM serving
#1832 opened Jun 25, 2026 by kaix-nv Contributor Draft
Fix weight-only prequant layernorm export
#1825 opened Jun 25, 2026 by meenchen Contributor Draft
Fix AutoQuantize causal LM score scaling
#1810 opened Jun 23, 2026 by realAsma Contributor Draft
Add NVFP4 Conv3d export for diffusers VAE (Wan 2.2)
#1809 opened Jun 23, 2026 by jingyu-ml Contributor Loading…
Support FP8 per block (weight + dynamic per token activation) export
#1807 opened Jun 23, 2026 by sugunav14 Contributor Loading…
MiniMax-M3 mixed MXFP8-base + NVFP4-experts PTQ export
#1806 opened Jun 23, 2026 by chadvoegele Contributor Loading…
Puzzletron tutorial fixes for runtime optimization
#1803 opened Jun 23, 2026 by grzegorz-k-karch Contributor Loading…
Add puzzletron eval skill
#1802 opened Jun 23, 2026 by danielkorzekwa Contributor Loading…
Support INT block scale learning
#1795 opened Jun 22, 2026 by realAsma Contributor Draft
ProTip! Filter pull requests by the default branch with base:main.