← Back to Discover
alasdairforsythe

alasdairforsythe/tokenmonster

GoMITactive
88Health

Ungreedy subword tokenizer and vocabulary trainer for Python, Go, C++ & Javascript

Stars626
Forks22
Open Issues14
Contributors22
Last Push0d ago

Health Breakdown

Activity
25
Community
25
Maintenance
13
Popularity
25
#text-tokenization#tokenisation#tokenization#tokenize#tokenizer#tokenizing#vocabulary#vocabulary-builder#vocabulary-generator
View on GitHub ↗Issues (14) ↗Pull Requests ↗Wiki ↗

Should you contribute to alasdairforsythe/tokenmonster?

alasdairforsythe/tokenmonster has a FoundDev health score of 88/100, which puts it in the active-and-maintained tier. The maintainer team is shipping recently, issues are being closed, and a PR you open this week has a realistic chance of being reviewed.

Last push was 0 days ago — that signals an actively maintained project. New issues are likely to get a maintainer response within days. The project is written primarily in Go, so prior Go experience will shorten ramp-up.

Licensed under MIT, a standard OSI-approved license — safe to contribute to under normal employer IP policies.

Community

alasdairforsythe88

Ungreedy subword tokenizer and vocabulary trainer for Python, Go, C++ & Javascript

active
62622 contributors14 issues
0d ago

More Go repos

hashicorp
hashicorp/terraform-provider-google-beta
Terraform Provider for Google Cloud Platform (Beta)
31499
ethereum
ethereum/hive
Ethereum end-to-end test harness
51795
kubernetes
kubernetes/kube-state-metrics
Add-on agent to generate and expose cluster-level metrics.
6.1k92