Decision Workspace
bytepiece_rs vs tokenizers vs text-splitter
Side-by-side comparison of Rust crates
36
bytepiece_rs
growingv0.2.2
The Bytepiece Tokenizer Implemented in Rust
64
tokenizers
growingv0.23.1
Provides an implementation of today's most used tokenizers, with a focus on performances and versatility.
62
text-splitter
growingv0.30.1
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
Core Metrics
| bytepiece_rs | tokenizers | text-splitter | |
|---|---|---|---|
| Health Score | 36 | 64 | 62 |
| Total Downloads | 11.4K | 17.0M | 1.4M |
| 30d Downloads | 0 | 2.5M | 150.5K |
| Dependents | 2 | 5.5K | 782 |
| Releases | 7 | 40 | 62 |
| Last Updated | 925d ago | 28d ago | 38d ago |
| Age | 2y 8m | 6y 9m | 3y |
Health Breakdown
bytepiece_rs
Maintenance
4
Quality
11
Community
6
Popularity
5
Documentation
10
tokenizers
Maintenance
16
Quality
12
Community
16
Popularity
8
Documentation
12
text-splitter
Maintenance
17
Quality
13
Community
13
Popularity
7
Documentation
12
Technical Details
| bytepiece_rs | tokenizers | text-splitter | |
|---|---|---|---|
| Version | 0.2.2 | 0.23.1 | 0.30.1 |
| Stable (≥1.0) | ✗ No | ✗ No | ✗ No |
| License | MIT | Apache-2.0 | MIT |
| Dependencies | 12 | 33 | 21 |
| Crate Size | 1.2MB | 196KB | 60KB |
| Features | 0 | 6 | 4 |
| Yanked % | 0.0% | 2.5% | 1.6% |
| Edition | 2021 | 2018 | 2021 |
| MSRV | — | — | 1.86.0 |
| Owners | 1 | 4 | 1 |
Links
Quick Verdict
- •tokenizers leads with a health score of 64/100, but none of the options score above 80.
- •tokenizers is depended on by 5.5K crates — strongest ecosystem trust.
- •⚠ bytepiece_rs has not been updated in over a year.