58
kitoken
v0.11.0 GrowingFast tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
BSD-2-Clause Edition 2024 MSRV 1.86.0
AlgorithmsText processingParser implementationsNo standard libraryWebAssembly #tokenizer#nlp#wordpiece#bpe#unigram
Quick Verdict
- โActively maintained (updated 15d ago)
- !Pre-1.0: API may have breaking changes
- !Heavy dependency tree (24 direct deps)
Security
Checking security advisories...
Downloads
38.7K
Dependents
9
Releases
3
Size
64KB
Deep Insights
๐
Download decline
567 downloads in the last 30 days, down 46% from the previous period. May indicate migration to alternatives.
๐ฌ
Pre-1.0 for over a year
Despite being 1+ years old, kitoken hasn't reached 1.0 yet. Expect potential API changes between versions.
๐ฆ
Heavy dependency tree
24 direct dependencies. Consider the impact on compile times and supply chain complexity.
Health Breakdown
Maintenance 16/25
Recency, release consistency, active ratio
Quality 16/25
Yanked ratio, deps, size, maturity, features
Community 9/20
Reverse deps, ownership, ecosystem
Popularity 5/15
Downloads, momentum, growth trend
Documentation 12/15
Docs, repo, license, metadata
Download Trend
Daily downloads ยท last 90 days
0/day avg
Top Dependents
Most downloaded crates that depend on kitoken
Version Adoption
v0.10.1
98%
v0.10.0
2%
v0.11.0
1%
Release Timeline
3 releasessince 2024
J
F
M
A
M
J
J
A
S
O
N
D
2024
2
2025
2026
1
LessMore
Feature Flags
default =["std", "serialization", "normalization", "convert", "regex-perf", "multiversion"]
allstd*websplitconvert*unstableregex-onigregex-perf*multiversion*normalization*regex-unicodeserialization*convert-detectconvert-tekkenconvert-tiktokenconvert-tokenizerssplit-unicode-scriptconvert-sentencepiecenormalization-unicodenormalization-charsmap
README
Loading README...
Maintainers
Dependencies
24
direct dependencies
Dependents
9
crates depend on kitoken