rustio rustio.net
47

kitoken

v0.10.1 Growing

Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization

BSD-2-Clause Edition 2021 MSRV 1.82.0

Quick Verdict

  • โœ•Not updated for 1+ year
  • !Pre-1.0: API may have breaking changes
  • !Heavy dependency tree (23 direct deps)

Security

Checking security advisories...
Downloads
37.4K
Dependents
7
Releases
2
Size
44KB

Deep Insights

๐Ÿ“ˆ
Strong growth momentum

7.4K downloads in the last 30 days (247/day), up 30% from the previous period.

๐Ÿ”ฌ
Pre-1.0 for over a year

Despite being 1+ years old, kitoken hasn't reached 1.0 yet. Expect potential API changes between versions.

๐Ÿ“ฆ
Heavy dependency tree

23 direct dependencies. Consider the impact on compile times and supply chain complexity.

๐Ÿ“
Compact crate

At 43KB, kitoken is lightweight. Small crate size correlates with focused, well-scoped functionality.

Health Breakdown

Maintenance 6/25

Recency, release consistency, active ratio

Quality 16/25

Yanked ratio, deps, size, maturity, features

Community 8/20

Reverse deps, ownership, ecosystem

Popularity 5/15

Downloads, momentum, growth trend

Documentation 12/15

Docs, repo, license, metadata

Download Trend

Daily downloads ยท last 90 days
181/day avg+107%
010020030040050012/291/162/32/213/113/28

Top Dependents

Most downloaded crates that depend on kitoken

Version Adoption

v0.10.1
98%
v0.10.0
2%

Release Timeline

2 releasessince 2024
J
F
M
A
M
J
J
A
S
O
N
D
2024
2
2025
2026
Less
More

Feature Flags

default =["std", "serialization", "normalization", "convert", "regex-perf", "multiversion"]

allstd*splitconvert*unstableregex-onigregex-perf*multiversion*normalization*regex-unicodeserialization*convert-detectconvert-tekkenconvert-tiktokenconvert-tokenizerssplit-unicode-scriptconvert-sentencepiecenormalization-unicodenormalization-charsmap

README

Loading README...

Maintainers

Dependencies
23
direct dependencies
Dependents
7
crates depend on kitoken

Similar Crates