← Back to Discover
NVIDIA

NVIDIA/Model-Optimizer

PythonApache-2.0active
75Health

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Stars2.9k
Forks426
Open Issues257
Contributors426
Last Push0d ago

Health Breakdown

Activity
25
Community
13
Maintenance
12
Popularity
25
View on GitHub ↗Issues (257) ↗Pull Requests ↗

Should you contribute to NVIDIA/Model-Optimizer?

NVIDIA/Model-Optimizer has a FoundDev health score of 75/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under Apache-2.0, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

NVIDIA
NVIDIA/Model-Optimizer
PythonApache 2.0
75

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

active
2.9k426 contributors257 issues
0d ago

More Python repos

openstates
openstates/openstates-scrapers
source for Open States scrapers
90295
openstack
openstack/nova
OpenStack Compute (Nova). Mirror of code maintained at opendev.org.
3.2k92
jupyter
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
8.4k92