rustio rustio.net
64

tokenizers

v0.23.1 Growing

Provides an implementation of today's most used tokenizers, with a focus on performances and versatility.

Apache-2.0 Edition 2018
#tokenizer#nlp#wordpiece#huggingface#bpe

Quick Verdict

  • βœ“Actively maintained (updated 28d ago)
  • !Pre-1.0: API may have breaking changes
  • βœ“Massive adoption (5.5K crates depend on it)
  • !Heavy dependency tree (33 direct deps)
  • βœ“Permissive license (Apache-2.0)

Security

Checking security advisories...
Downloads
17.0M
Dependents
5.5K
Releases
40
Size
196KB

Deep Insights

πŸ“ˆ
Strong growth momentum

2.6M downloads in the last 30 days (85.1K/day), up 38% from the previous period.

πŸ”—
Widely adopted

5.5K crates depend on tokenizers. Strong ecosystem adoption means battle-tested code and long-term stability.

πŸ”¬
Pre-1.0 for over a year

Despite being 6+ years old, tokenizers hasn't reached 1.0 yet. Expect potential API changes between versions.

πŸ“¦
Heavy dependency tree

33 direct dependencies. Consider the impact on compile times and supply chain complexity.

🌟
Used by top crates

Notable dependents include candle-core, text-splitter, fastembed, xgrammar, toktrie_hf_tokenizers. When high-quality crates choose tokenizers, it's a strong quality signal.

Health Breakdown

Maintenance 16/25

Recency, release consistency, active ratio

Quality 12/25

Yanked ratio, deps, size, maturity, features

Community 16/20

Reverse deps, ownership, ecosystem

Popularity 8/15

Downloads, momentum, growth trend

Documentation 12/15

Docs, repo, license, metadata

Download Trend

Daily downloads Β· last 90 days
72K/day avg+25%
050K100K2/263/164/34/215/95/26

Top Dependents

Version Adoption

v0.21.4
37%
v0.22.2
33%
v0.21.2
13%
v0.21.1
9%
v0.22.1
8%

Release Timeline

10 releasessince 2024
J
F
M
A
M
J
J
A
S
O
N
D
2024
2
2025
7
2026
1
Less
More

Feature Flags

default =["progressbar", "onig", "esaxx_fast"]

httpesaxx_fast*rustls-tlsprogressbar*unstable_wasm

README

Loading README...

Maintainers

Dependencies
33
direct dependencies
Dependents
5.5K
crates depend on tokenizers

Similar Crates