Text Processing
yeslogic-fontconfig-sys
6.0.1 StableRaw bindings to Fontconfig without a vendored C library
daachorse
3.0.0 StableDaachorse: Double-Array Aho-Corasick
calamine
0.35.0 GrowingAn Excel/OpenDocument Spreadsheet reader and deserializer in pure Rust
regex
1.12.3 StableAn implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
dissimilar
1.0.11 StableDiff library with semantic cleanup, based on Google's diff-match-patch
html-to-markdown-rs
3.5.1 GrowingHigh-performance HTML to Markdown converter using the astral-tl parser. Part of the Kreuzberg ecosystem.
lindera
3.0.7 StableA morphological analysis library.
bstr
1.12.1 StableA string type that is not required to be valid UTF-8.
scraper
0.27.0 GrowingHTML parsing and querying with CSS selectors
lindera-dictionary
3.0.7 StableA morphological dictionary library.
varcon-core
5.0.7 StableVarcon-relevant data structures
spider_agent_html
2.51.197 ExperimentalHTML processing utilities for spider_agent — cleaning, content analysis, and diffing.
kreuzberg-tesseract
4.9.8 GrowingRust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
arborium-theme
2.17.0 ExperimentalTheme support for arborium syntax highlighting
arborium-html
2.17.0 ExperimentalHTML grammar for arborium (tree-sitter bindings)
lazy-regex
3.6.0 Stablelazy static regular expressions checked at compile time
comrak
0.52.0 GrowingA 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
fancy-regex
0.18.0 GrowingAn implementation of regexes, supporting a relatively rich set of features, including backreferences and look-around. Aims to be compatible with Oniguruma syntax when the relevant flag is set.
allsorts
0.17.0 GrowingFont parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
oak-highlight
0.0.11 ExperimentalA lightweight syntax highlighter for Rust with support for multiple programming languages and customizable themes.
const_format
0.2.36 GrowingCompile-time string formatting
chumsky
0.13.0 GrowingA parser library for humans with powerful error recovery
arborium-javascript
2.17.0 ExperimentalJavaScript grammar for arborium (tree-sitter bindings)
dom_query
0.28.0 GrowingHTML querying and manipulation with CSS selectors
skrifa
0.42.1 GrowingMetadata reader and glyph scaler for OpenType fonts.
read-fonts
0.39.2 GrowingReading OpenType font files.
fontconfig
0.10.2 GrowingSafe, higher-level wrapper around the Fontconfig library
lindera-ipadic
3.0.7 StableA Japanese morphological dictionary for IPADIC.
uutils_term_grid
0.8.0 GrowingLibrary for formatting strings into a grid layout. Fork of term_grid.
grok
2.4.1 StableA Rust implementation of the popular Java & Ruby grok library which allows easy text and log file processing with composable patterns.
oak-pretty-print
0.0.11 ExperimentalSyntax highlighter supporting multiple programming languages.
stringzilla
4.6.1 StableSearch, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU
html2text
0.17.1 GrowingRender HTML as plain text.
kreuzberg-paddle-ocr
4.9.8 ExperimentalPaddleOCR via ONNX Runtime for Kreuzberg - high-performance text recognition
arborium-css
2.17.0 ExperimentalCSS grammar for arborium (tree-sitter bindings)
arborium-vue
2.17.0 ExperimentalVue grammar for arborium (tree-sitter bindings)
arborium-scss
2.17.0 ExperimentalSCSS grammar for arborium (tree-sitter bindings)
cruet
1.0.0 StableAdds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize are supported as both traits and pure functions acting on String types.
topiary-queries
0.7.3 Growingtree-sitter query files compatible with Topiary
font-types
0.11.3 GrowingScalar types used in fonts.
chardetng
1.0.0 StableA character encoding detector for legacy Web content
diffy
0.5.0 GrowingTools for finding and manipulating differences between files
toml-test-data
2.10.0 StableTOML test cases
toml-test-harness
1.10.0 StableCargo test harness for verifying TOML parsers
arborium-lua
2.17.0 ExperimentalLua grammar for arborium (tree-sitter bindings)
arborium-c
2.17.0 ExperimentalC grammar for arborium (tree-sitter bindings)
arborium-xml
2.17.0 ExperimentalXML grammar for arborium (tree-sitter bindings)
arborium-json
2.17.0 ExperimentalJSON grammar for arborium (tree-sitter bindings)
arborium-toml
2.17.0 ExperimentalTOML grammar for arborium (tree-sitter bindings)
arborium-starlark
2.17.0 ExperimentalStarlark grammar for arborium (tree-sitter bindings)