← Back to Discover
benseverndev-oss

benseverndev-oss/goldenmatch

PythonMITactiverising
81Health

Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

Stars79
Forks10
Open Issues16
Contributors10
Last Push0d ago

Health Breakdown

Activity
25
Community
25
Maintenance
9
Popularity
22
#active-learning#agent#airflow#auto-config#data-engineering#data-quality#deduplication#entity-resolution#fuzzy-matching#human-in-the-loop#llm#mcp-server#negative-evidence#polars#pprl#privacy-preserving#python#record-linkage#typescript#zero-config
View on GitHub ↗Issues (16) ↗Pull Requests ↗Wiki ↗

Should you contribute to benseverndev-oss/goldenmatch?

benseverndev-oss/goldenmatch has a FoundDev health score of 81/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Python, so prior Python experience will shorten ramp-up.

Licensed under MIT, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

benseverndev-oss81

Zero-config entity resolution that scales from a CSV to 100M+ rows on a Ray cluster (verified: 100M deduped in 213s, 0.30 GB driver). Fuzzy + exact + probabilistic dedupe, identity graph, PPRL, LLM boost. Python + full TypeScript port; SQL-native in PostgreSQL & DuckDB; MCP/REST servers, dbt + Airflow recipes.

activerising
7910 contributors16 issues
0d ago

More Python repos

openstates
openstates/openstates-scrapers
source for Open States scrapers
90295
openstack
openstack/nova
OpenStack Compute (Nova). Mirror of code maintained at opendev.org.
3.2k92
jupyter
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
8.4k92