← Explore
TOPIC

#paged-attention

Open source repositories tagged with #paged-attention, ranked by health score.

openinfer-project
openinfer-project/openinfer
Rust
86
health

Pure Rust + CUDA LLM inference engine — no PyTorch, OpenAI-compatible, serves Qwen3 to Kimi-K2

498