Open source repositories tagged with #llm-d, ranked by health score.
Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping