NVIDIA/Model-Optimizer

PythonApache-2.0active

Health

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

Stars3.3k

Forks509

Open Issues318

Contributors509

Last Push0d ago

Health Breakdown

Activity

Community

Maintenance

Should you contribute to NVIDIA/Model-Optimizer?

NVIDIA/Model-Optimizer has a FoundDev health score of 75/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under Apache-2.0, a standard OSI-approved license — safe to contribute to under normal employer IP policies.