← Back to Discover
raketenkater

raketenkater/llm-server

GoMITactiverising
89Health

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

Stars223
Forks11
Open Issues0
Contributors11
Last Push0d ago

Health Breakdown

Activity
25
Community
25
Maintenance
14
Popularity
25
#cuda#gguf#golang#inference-server#llama-cpp#llamacpp#llm#local-llm#localllama#metal#moe#multi-gpu#ollama-alternative#openai-api#rocm#self-hosted#speculative-decoding#vulkan
View on GitHub ↗Issues (0) ↗Pull Requests ↗Wiki ↗

Should you contribute to raketenkater/llm-server?

raketenkater/llm-server has a FoundDev health score of 89/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Go, so prior Go experience will shorten ramp-up.

Licensed under MIT, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

raketenkater89

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alternative for multi-GPU rigs.

activerising
22311 contributors
0d ago

More Go repos

canopy-network
canopy-network/canopy
The official go implementation of the Canopy Network protocol
11.2k100
pion
pion/srtp
A Go implementation of SRTP
13793
cloudfoundry
cloudfoundry/gosigar
A Golang implementation of the Sigar API
48293