Open source repositories tagged with #model-serving, ranked by health score.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Olares: An Open-Source Personal Cloud to Reclaim Your Data
A framework for efficient model inference with omni-modality models
Community maintained hardware plugin for vLLM on Ascend