← Back to Discover
ztxz16

ztxz16/fastllm

C++Apache-2.0active
76Health

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

Stars4.7k
Forks464
Open Issues289
Contributors464
Last Push3d ago

Health Breakdown

Activity
25
Community
13
Maintenance
13
Popularity
25
View on GitHub ↗Issues (289) ↗Pull Requests ↗Wiki ↗

Should you contribute to ztxz16/fastllm?

ztxz16/fastllm has a FoundDev health score of 76/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 3 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in C++, so prior C++ experience will shorten ramp-up.

Licensed under Apache-2.0, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

ztxz16
ztxz16/fastllm
C++Apache 2.0
76

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

active
4.7k464 contributors289 issues
3d ago

More C++ repos

istio
istio/proxy
The Istio proxy components.
894100
MarlinFirmware
MarlinFirmware/Marlin
Marlin is a firmware for RepRap 3D printers optimized for both 8 and 32 bit microcontrollers. Marlin supports all common platforms. Many commercial 3D printers come with Marlin installed. Check with your vendor if you need source code for your specific machine.
17.4k99
PX4
PX4/PX4-Autopilot
PX4 Autopilot Software
11.8k97