← Explore
TOPIC

#quantization-aware-training

Open source repositories tagged with #quantization-aware-training, ranked by health score.

intel
intel/neural-compressor
Python
89
health

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

2.6k