← Explore
TOPIC

#long-context

Open source repositories tagged with #long-context, ranked by health score.

huawei-csl
huawei-csl/KVarN
Python
88
health

KVarN is a native vLLM KV-cache quantization backend for your agents: 3-5x more context, throughput above FP16, and FP16-level accuracy. Calibration-free, one flag.

389