Decision Workspace
bytepiece_rs vs tokenizers vs text-splitter
Side-by-side comparison of Rust crates
36
bytepiece_rs
growingv0.2.2
The Bytepiece Tokenizer Implemented in Rust
60
tokenizers
growingv0.22.2
Provides an implementation of today's most used tokenizers, with a focus on performances and versatility.
59
text-splitter
growingv0.29.3
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
Core Metrics
| bytepiece_rs | tokenizers | text-splitter | |
|---|---|---|---|
| Health Score | 36 | 60 | 59 |
| Total Downloads | 11.4K | 12.7M | 1.1M |
| 30d Downloads | 27 | 1.9M | 110.8K |
| Dependents | 2 | 3.6K | 654 |
| Releases | 7 | 39 | 60 |
| Last Updated | 866d ago | 115d ago | 87d ago |
| Age | 2y 6m | 6y 7m | 2y 10m |
Health Breakdown
bytepiece_rs
Maintenance
4
Quality
11
Community
6
Popularity
5
Documentation
10
tokenizers
Maintenance
12
Quality
12
Community
16
Popularity
8
Documentation
12
text-splitter
Maintenance
14
Quality
13
Community
13
Popularity
7
Documentation
12
Technical Details
| bytepiece_rs | tokenizers | text-splitter | |
|---|---|---|---|
| Version | 0.2.2 | 0.22.2 | 0.29.3 |
| Stable (≥1.0) | ✗ No | ✗ No | ✗ No |
| License | MIT | Apache-2.0 | MIT |
| Dependencies | 12 | 33 | 21 |
| Crate Size | 1.2MB | 186KB | 59KB |
| Features | 0 | 6 | 4 |
| Yanked % | 0.0% | 2.6% | 1.7% |
| Edition | 2021 | 2018 | 2021 |
| MSRV | — | — | 1.83.0 |
| Owners | 1 | 4 | 1 |
Links
Quick Verdict
- •tokenizers leads with a health score of 60/100, but none of the options score above 80.
- •tokenizers is depended on by 3.6K crates — strongest ecosystem trust.
- •⚠ bytepiece_rs has not been updated in over a year.