Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Japanese Morphological Analyzer written in Rust

License

NotificationsYou must be signed in to change notification settings

togatoga/kanpyo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

70 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crates.io

Kanpyo is Japanese morphological analyzer written in Rust inspired byikawaha/Kagome.

Caution

This is a work in progress. I would break the API without notice.

Installation

You can installkanpyo viacargo orgit(development version).

cargo install kanpyo

or

cargo install --git https://github.com/togatoga/kanpyo kanpyo

You need a dictionary to usekanpyo and can build and install a dictionary by the following.

cd kanpyo-dicttar xvf resource/mecab-ipadic-2.7.0-20070801.tar.gz -C resourcecargo run --release --bin ipa-dict-builder -- --dict resource/mecab-ipadic-2.7.0-20070801

A dictionary is installed in the following directory:

  • Linux
    • $HOME/.config/kanpyo/
  • macOS
    • $HOME/Library/Application Support/kanpyo

You're ready to usekanpyo!

Usage

kanpyo --helpJapanese Morphological AnalyzerUsage: kanpyo [COMMAND]Commands:  tokenize  Tokenize input text  graphviz  Output latticein Graphviz formathelp      Print this message or thehelp of the given subcommand(s)Options:  -h, --help     Printhelp  -V, --version  Print version

Tokenize

kanpyo tokenize"すもももももももものうち"          すもも  名詞,一般,*,*,*,*,すもも,スモモ,スモモも      助詞,係助詞,*,*,*,*,も,モ,モもも    名詞,一般,*,*,*,*,もも,モモ,モモも      助詞,係助詞,*,*,*,*,も,モ,モもも    名詞,一般,*,*,*,*,もも,モモ,モモの      助詞,連体化,*,*,*,*,の,ノ,ノうち    名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチEOS

REPL mode

kanpyo自然言語処理自然    名詞,形容動詞語幹,*,*,*,*,自然,シゼン,シゼン言語    名詞,一般,*,*,*,*,言語,ゲンゴ,ゲンゴ処理    名詞,サ変接続,*,*,*,*,処理,ショリ,ショリEOS形態素解析形態素  名詞,一般,*,*,*,*,形態素,ケイタイソ,ケイタイソ解析    名詞,サ変接続,*,*,*,*,解析,カイセキ,カイセキEOS

From piped standard input

echo"自然言語処理"| kanpyo自然    名詞,形容動詞語幹,*,*,*,*,自然,シゼン,シゼン言語    名詞,一般,*,*,*,*,言語,ゲンゴ,ゲンゴ処理    名詞,サ変接続,*,*,*,*,処理,ショリ,ショリEOS

Graphviz

Print lattice in Graphviz format for debugging.

kanpyo graphviz"自然言語処理"| dot -Tpng -o lattice.png

lattice

TODO

  • Support various dictionaries(Sudachi, UniDic, neologd, etc.)
  • Support server mode
  • Support search mode
  • Tests for load dictionary and tokenize

About

Japanese Morphological Analyzer written in Rust

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors2

  •  
  •  

Languages


[8]ページ先頭

©2009-2025 Movatter.jp