Open source repositories tagged with #nvidia-cuda, ranked by health score.
Fast LLM speculative inference server for consumer hardware.