← Back to Discover
SeraphimSerapis

SeraphimSerapis/tool-eval-bench

PythonMITactiverising
74Health

Tool-calling quality benchmark for LLM serving stacks. 65+ deterministic scenarios testing multi-turn orchestration, safety boundaries, and structured output. Supports vLLM, LiteLLM, and llama.cpp.

Stars80
Forks8
Open Issues0
Contributors8
Last Push0d ago

Health Breakdown

Activity
25
Community
13
Maintenance
14
Popularity
22
View on GitHub ↗Issues (0) ↗Pull Requests ↗Wiki ↗

Should you contribute to SeraphimSerapis/tool-eval-bench?

SeraphimSerapis/tool-eval-bench has a FoundDev health score of 74/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under MIT, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

SeraphimSerapis74

Tool-calling quality benchmark for LLM serving stacks. 65+ deterministic scenarios testing multi-turn orchestration, safety boundaries, and structured output. Supports vLLM, LiteLLM, and llama.cpp.

activerising
808 contributors
0d ago

More Python repos

SilentDemonSD
SilentDemonSD/WZML-X
A Super Enhanced Telegram bot which can download torrents, mega, google drive links, telegram file, direct links and all yt-dlp sites, upload to google drive, telegram cloud, rclone clouds or ddl servers. Made with Pyrogram in Python by WZML-X Devs.
1.4k100
ZhuLinsen
ZhuLinsen/alphasift
AI-native A-share stock screening engine with full-market discovery, LLM ranking, risk-aware scoring, and auditable evaluation. AI选股
11393
dancinlab
dancinlab/anima
🧠 Living Consciousness Agent — PureField repulsion-field engine · Engine A ⇄ Engine G · Ψ=1/2 fixed point · 2,448 laws + 392 hypotheses
15692