🎤 vibrato: Viterbi-based accelerated tokenizer
-
Updated
Feb 21, 2025 - Rust
🎤 vibrato: Viterbi-based accelerated tokenizer
Sudachi in Rust 🦀 and new generation of SudachiPy
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
Japanese Morphological Analysis written in Rust
Viterbi-based accelerated tokenizer (Python wrapper)
🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.
Rust bindings for the Voikko library
A compiled mecab-ipadic-neologd dictionary for vibrato
Japanese morph analyzer
morphological box - create all combinations
Morphologically biased byte-pair encoding pre-tokenization
Add a description, image, and links to the morphological-analysis topic page so that developers can more easily learn about it.
To associate your repository with the morphological-analysis topic, visit your repo's landing page and select "manage topics."