rustio rustio.net

Text Processing

75

yeslogic-fontconfig-sys

6.0.1 Stable

Raw bindings to Fontconfig without a vendored C library

9.2M downloads · 26d ago
74

daachorse

3.0.0 Stable

Daachorse: Double-Array Aho-Corasick

1.2M downloads · 20d ago
73

calamine

0.35.0 Growing

An Excel/OpenDocument Spreadsheet reader and deserializer in pure Rust

8.0M downloads · 15d ago
73

regex

1.12.3 Stable

An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

854.4M downloads · 3mo ago
72

dissimilar

1.0.11 Stable

Diff library with semantic cleanup, based on Google's diff-match-patch

44.4M downloads · 2mo ago
72

html-to-markdown-rs

3.5.1 Growing

High-performance HTML to Markdown converter using the astral-tl parser. Part of the Kreuzberg ecosystem.

416.7K downloads · today
71

lindera

3.0.7 Stable

A morphological analysis library.

1.2M downloads · 1mo ago
70

bstr

1.12.1 Stable

A string type that is not required to be valid UTF-8.

307.0M downloads · 7mo ago
70

scraper

0.27.0 Growing

HTML parsing and querying with CSS selectors

18.6M downloads · 14d ago
70

lindera-dictionary

3.0.7 Stable

A morphological dictionary library.

1.5M downloads · 1mo ago
70

varcon-core

5.0.7 Stable

Varcon-relevant data structures

459.0K downloads · 1mo ago
70

spider_agent_html

2.51.197 Experimental

HTML processing utilities for spider_agent — cleaning, content analysis, and diffing.

10.0K downloads · 4d ago
70

kreuzberg-tesseract

4.9.8 Growing

Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements

23.3K downloads · today
69

arborium-theme

2.17.0 Experimental

Theme support for arborium syntax highlighting

78.7K downloads · 16d ago
69

arborium-html

2.17.0 Experimental

HTML grammar for arborium (tree-sitter bindings)

50.7K downloads · 16d ago
69

lazy-regex

3.6.0 Stable

lazy static regular expressions checked at compile time

32.2M downloads · 3mo ago
69

comrak

0.52.0 Growing

A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter

5.2M downloads · 1mo ago
69

fancy-regex

0.18.0 Growing

An implementation of regexes, supporting a relatively rich set of features, including backreferences and look-around. Aims to be compatible with Oniguruma syntax when the relevant flag is set.

150.9M downloads · 1mo ago
69

allsorts

0.17.0 Growing

Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2

456.9K downloads · 13d ago
69

oak-highlight

0.0.11 Experimental

A lightweight syntax highlighter for Rust with support for multiple programming languages and customizable themes.

6.5K downloads · 1mo ago
69

const_format

0.2.36 Growing

Compile-time string formatting

108.3M downloads · 1mo ago
69

chumsky

0.13.0 Growing

A parser library for humans with powerful error recovery

21.1M downloads · 20d ago
68

arborium-javascript

2.17.0 Experimental

JavaScript grammar for arborium (tree-sitter bindings)

51.2K downloads · 16d ago
68

dom_query

0.28.0 Growing

HTML querying and manipulation with CSS selectors

1.8M downloads · 8d ago
68

skrifa

0.42.1 Growing

Metadata reader and glyph scaler for OpenType fonts.

10.7M downloads · 1mo ago
68

read-fonts

0.39.2 Growing

Reading OpenType font files.

11.7M downloads · 1mo ago
68

fontconfig

0.10.2 Growing

Safe, higher-level wrapper around the Fontconfig library

268.3K downloads · 7d ago
68

lindera-ipadic

3.0.7 Stable

A Japanese morphological dictionary for IPADIC.

1.2M downloads · 1mo ago
68

uutils_term_grid

0.8.0 Growing

Library for formatting strings into a grid layout. Fork of term_grid.

2.3M downloads · 2mo ago
68

grok

2.4.1 Stable

A Rust implementation of the popular Java & Ruby grok library which allows easy text and log file processing with composable patterns.

8.3M downloads · 2mo ago
68

oak-pretty-print

0.0.11 Experimental

Syntax highlighter supporting multiple programming languages.

7.0K downloads · 1mo ago
68

stringzilla

4.6.1 Stable

Search, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU

94.4K downloads · 20d ago
68

html2text

0.17.1 Growing

Render HTML as plain text.

3.9M downloads · 1mo ago
68

kreuzberg-paddle-ocr

4.9.8 Experimental

PaddleOCR via ONNX Runtime for Kreuzberg - high-performance text recognition

5.0K downloads · today
67

arborium-css

2.17.0 Experimental

CSS grammar for arborium (tree-sitter bindings)

50.3K downloads · 16d ago
67

arborium-vue

2.17.0 Experimental

Vue grammar for arborium (tree-sitter bindings)

45.7K downloads · 16d ago
67

arborium-scss

2.17.0 Experimental

SCSS grammar for arborium (tree-sitter bindings)

46.3K downloads · 16d ago
67

cruet

1.0.0 Stable

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize are supported as both traits and pure functions acting on String types.

2.4M downloads · 1mo ago
67

topiary-queries

0.7.3 Growing

tree-sitter query files compatible with Topiary

58.7K downloads · 4mo ago
67

font-types

0.11.3 Growing

Scalar types used in fonts.

10.9M downloads · 1mo ago
67

chardetng

1.0.0 Stable

A character encoding detector for legacy Web content

7.2M downloads · 1mo ago
67

diffy

0.5.0 Growing

Tools for finding and manipulating differences between files

11.3M downloads · 28d ago
67

toml-test-data

2.10.0 Stable

TOML test cases

196.4K downloads · today
67

toml-test-harness

1.10.0 Stable

Cargo test harness for verifying TOML parsers

180.9K downloads · today
66

arborium-lua

2.17.0 Experimental

Lua grammar for arborium (tree-sitter bindings)

45.7K downloads · 16d ago
66

arborium-c

2.17.0 Experimental

C grammar for arborium (tree-sitter bindings)

47.5K downloads · 16d ago
66

arborium-xml

2.17.0 Experimental

XML grammar for arborium (tree-sitter bindings)

52.4K downloads · 16d ago
66

arborium-json

2.17.0 Experimental

JSON grammar for arborium (tree-sitter bindings)

56.2K downloads · 16d ago
66

arborium-toml

2.17.0 Experimental

TOML grammar for arborium (tree-sitter bindings)

64.7K downloads · 16d ago
66

arborium-starlark

2.17.0 Experimental

Starlark grammar for arborium (tree-sitter bindings)

42.0K downloads · 16d ago