rustio rustio.net

Text Processing

76

regex

1.12.3 Stable

An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

742.0M downloads · 1mo ago
75

dissimilar

1.0.11 Stable

Diff library with semantic cleanup, based on Google's diff-match-patch

37.9M downloads · 12d ago
73

lindera-dictionary

2.3.4 Stable

A morphological dictionary library.

1.2M downloads · 4d ago
73

lindera

2.3.4 Stable

A morphological analysis library.

985.0K downloads · 4d ago
73

calamine

0.34.0 Growing

An Excel/OpenDocument Spreadsheet reader and deserializer in pure Rust

6.4M downloads · 20d ago
71

bstr

1.12.1 Stable

A string type that is not required to be valid UTF-8.

269.3M downloads · 5mo ago
71

lindera-ipadic

2.3.4 Stable

A Japanese morphological dictionary for IPADIC.

1.0M downloads · 4d ago
71

lazy-regex

3.6.0 Stable

lazy static regular expressions checked at compile time

28.1M downloads · 1mo ago
71

comrak

0.51.0 Growing

A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter

3.9M downloads · 17d ago
71

grok

2.4.1 Stable

A Rust implementation of the popular Java & Ruby grok library which allows easy text and log file processing with composable patterns.

8.1M downloads · 8d ago
70

oak-highlight

0.0.10 Experimental

A lightweight syntax highlighter for Rust with support for multiple programming languages and customizable themes.

4.6K downloads · 4d ago
70

uutils_term_grid

0.8.0 Growing

Library for formatting strings into a grid layout. Fork of term_grid.

1.9M downloads · 18d ago
70

read-fonts

0.38.0 Growing

Reading OpenType font files.

8.4M downloads · 7d ago
70

font-types

0.11.1 Growing

Scalar types used in fonts.

7.7M downloads · 7d ago
69

oak-pretty-print

0.0.10 Experimental

Syntax highlighter supporting multiple programming languages.

4.9K downloads · 4d ago
69

topiary-queries

0.7.3 Growing

tree-sitter query files compatible with Topiary

34.4K downloads · 2mo ago
69

unicase

2.9.0 Stable

A case-insensitive wrapper around strings.

245.5M downloads · 2mo ago
69

kreuzberg-tesseract

4.6.3 Experimental

Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements

14.2K downloads · yesterday
69

lingua-japanese-language-model

1.3.0 Stable

The Japanese language model for Lingua, an accurate natural language detection library

925.1K downloads · 18d ago
69

html-to-markdown-rs

2.30.0 Experimental

High-performance HTML to Markdown converter using the astral-tl parser. Part of the Kreuzberg ecosystem.

143.5K downloads · today
69

skrifa

0.41.0 Growing

Metadata reader and glyph scaler for OpenType fonts.

7.5M downloads · 7d ago
68

toml-test-data

2.5.0 Stable

TOML test cases

144.9K downloads · 4d ago
68

lindera-cli

2.3.4 Stable

A morphological analysis CLI.

110.0K downloads · 4d ago
68

logos

0.16.1 Growing

Create ridiculously fast Lexers

41.7M downloads · 1mo ago
68

rphonetic

3.0.6 Stable

Rust port of phonetic Apache commons-codec algorithms

84.9K downloads · 16d ago
68

asimov-prompt

25.1.0 Growing

ASIMOV Software Development Kit (SDK) for Rust

11.9K downloads · 1mo ago
68

indoc

2.0.7 Growing

Indented document literals

212.2M downloads · 5mo ago
68

varcon-core

5.0.6 Stable

Varcon-relevant data structures

398.9K downloads · 1mo ago
68

lindera-tantivy

2.0.0 Stable

Lindera Tokenizer for Tantivy.

173.6K downloads · 2mo ago
68

lingua-marathi-language-model

1.3.0 Stable

The Marathi language model for Lingua, an accurate natural language detection library

864.6K downloads · 18d ago
68

lingua-swahili-language-model

1.3.0 Stable

The Swahili language model for Lingua, an accurate natural language detection library

864.6K downloads · 18d ago
68

lingua-bengali-language-model

1.3.0 Stable

The Bengali language model for Lingua, an accurate natural language detection library

864.8K downloads · 18d ago
68

lingua-hindi-language-model

1.3.0 Stable

The Hindi language model for Lingua, an accurate natural language detection library

883.9K downloads · 18d ago
68

lingua-korean-language-model

1.3.0 Stable

The Korean language model for Lingua, an accurate natural language detection library

905.9K downloads · 18d ago
68

lingua-chinese-language-model

1.3.0 Stable

The Chinese language model for Lingua, an accurate natural language detection library

937.1K downloads · 18d ago
68

lingua-gujarati-language-model

1.3.0 Stable

The Gujarati language model for Lingua, an accurate natural language detection library

865.8K downloads · 18d ago
68

lingua-tamil-language-model

1.3.0 Stable

The Tamil language model for Lingua, an accurate natural language detection library

866.2K downloads · 18d ago
68

lingua-sotho-language-model

1.3.0 Stable

The Sotho language model for Lingua, an accurate natural language detection library

844.3K downloads · 18d ago
68

lingua-telugu-language-model

1.3.0 Stable

The Telugu language model for Lingua, an accurate natural language detection library

865.2K downloads · 18d ago
68

lingua-punjabi-language-model

1.3.0 Stable

The Punjabi language model for Lingua, an accurate natural language detection library

865.4K downloads · 18d ago
68

lingua-tsonga-language-model

1.3.0 Stable

The Tsonga language model for Lingua, an accurate natural language detection library

843.6K downloads · 18d ago
68

lingua-tswana-language-model

1.3.0 Stable

The Tswana language model for Lingua, an accurate natural language detection library

844.3K downloads · 18d ago
68

arborium-theme

2.16.0 Experimental

Theme support for arborium syntax highlighting

35.5K downloads · 16d ago
68

arborium-html

2.16.0 Experimental

HTML grammar for arborium (tree-sitter bindings)

19.8K downloads · 16d ago
67

lindera-cc-cedict

2.3.4 Stable

A Chinese morphological dictionary for CC-CEDICT.

767.3K downloads · 4d ago
67

lindera-ko-dic

2.3.4 Stable

A Korean morphological dictionary for ko-dic.

956.2K downloads · 4d ago
67

lindera-unidic

2.3.4 Stable

A Japanese morphological dictionary for UniDic.

747.2K downloads · 4d ago
67

lindera-ipadic-neologd

2.3.4 Stable

A Japanese morphological dictionary for IPADIC NEologd.

612.1K downloads · 4d ago
67

asimov-patterns

25.1.0 Growing

ASIMOV Software Development Kit (SDK) for Rust

11.7K downloads · 1mo ago
67

ammonia

4.1.2 Growing

HTML Sanitization

10.2M downloads · 6mo ago