guqiong96/Lvllm

PythonApache-2.0active

Health

LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features an efficient GPU parallel and NUMA parallel architecture, supporting hybrid inference for MOE large models.

Stars370

Forks33

Open Issues3

Contributors33

Last Push0d ago

Health Breakdown

Activity

Community

Maintenance

View on GitHub ↗Issues (3) ↗Pull Requests ↗Wiki ↗

Should you contribute to guqiong96/Lvllm?

guqiong96/Lvllm has a FoundDev health score of 89/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under Apache-2.0, a standard OSI-approved license — safe to contribute to under normal employer IP policies.