- Notifications
You must be signed in to change notification settings - Fork0
Japanese Morphological Analysis written in Rust
License
Leko/goya
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Goya is a Japanese Morphological Analyzer written in Rust.
The main goal is to compile to WebAssembly for morphological analysis in browsers and other JavaScript runtimes. In addition, it can be used with the CLI and Rust.
Try Goya playground. It uses the Goya-wasm from WebWorker.
Download the latest IPA dictionary fromthe official Mecab website and unzip it.
cargo install goya-cli
Compile the IPA dictionary to generate a binary dictionary for morphological analysis. It may take a few minutes.
goya compile /path/to/ipadic
The binary dictionary will be generated in the~/.goya
directory by default. You can change the destination with the--dicdir
option.
goya --dicdir=/path/to/generated compile /path/to/ipadic
Goya takes input from STDIN. The easiest way is using the echo command and pipe it to the Goya.
$ echo すもももももももものうち | goyaすもも名詞,一般,*,*,*,*,すもも,スモモ,スモモも助詞,係助詞,*,*,*,*,も,モ,モもも名詞,一般,*,*,*,*,もも,モモ,モモも助詞,係助詞,*,*,*,*,も,モ,モもも名詞,一般,*,*,*,*,もも,モモ,モモの助詞,連体化,*,*,*,*,の,ノ,ノうち名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチEOS
If you specified the--dicdir
option when compiling the dictionary, you should also specify it when running the goya command.
echo すもももももももものうち | goya --dicdir=/path/to/generated
cargo release <patch|minor|major> --workspace --no-tag --skip-publish --dependent-version Upgradegit tag v{{VERSION}}git push origin v{{VERSION}}
About
Japanese Morphological Analysis written in Rust
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Uh oh!
There was an error while loading.Please reload this page.