cl-tohoku/showcasePublic

NotificationsYou must be signed in to change notification settings
Fork1
Star6

A PyTorch implementation of the Japanese Predicate-Argument Structure (PAS) analyser presented in the paper of Matsubayashi & Inui (2018) with some improvements.

License

MIT license

6 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
docs		docs
showcase		showcase
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
setup.py		setup.py

Repository files navigation

Showcase: Japanese Predicate-Argument Structure (PAS) analyzer

Showcase is a Pytorch implementation of the Japanese Predicate-Argument Structure (PAS) analyser presented in the paper of Matsubayashi & Inui (2018) with some improvements.Given a input sentence, Showcase identifies verbal and nominal predicates in the sentence and detects their nominative (が), accusative (を), and dative (に) case arguments.The output case labels are based on the label definition of the NAIST Text Corpus where case markers in different voices are generalized into the case markers of an active voice.

Demo

http://www.cl.ecei.tohoku.ac.jp/showcase/

Usage

echo '今日は雨が降る' | showcase

cat example.txt | showcase

Input file format

One raw sentence per line.
A blank line can be used to segment a document. (Showcase just resets an argument index to zero.)

Requirements

Python 3.5 (or higher)
- We do not support Python 2
CaboCha with JUMAN dict
PyTorch 0.4.0

Instllation

Step 1. Install Showcase

pip install showcase-parser

Step 2: Download Resources

Resources include following files:

10 Model files for predicate detector (pred_model_0{0..9}.h5)
10 Model files for argument detector (arg_model_0{0..9}.h5)
Word embedding Matrix (word_embedding.npz)
POS embedding Matrix (pos_embedding.npz)
Word index file (word.index)
Part-of-Speech tag index file (pos.index)

Resources are all available atGoogle Drive.

train/*.h5: models trained with the training set described in the paper.
train-test/*.h5: models trained with the training and test sets.

Step 3: Create and edit config.json

Runshowcase setup to createconfig.json file in$HOME/.config/showcase.

Then editconfig.json and specify valid paths for:

Resources downloaded in Step 2
CaboCha and its JUMAN dictionary

Originalconfig.json can be found atshowcase/data/config.json of this repo.

You may specify path toconfig.json as follows:

showcase -c /path/to/config/config.json

Note that the apporopriate thresholds (hyperparameters) for arguments differ for each model.The thresholds for the provided models are described in the sample config file in each Google Drive directory.

(Re-)training

TBA

Step1: Train word2vec

TBA

Step2: Train model

TBA

Step3: Convert word2vec

runget_vocab_from_word2vec.py andconvert_word2vec_to_npy.py

Citation

@InProceedings{matsubayashi:2018:coling,  author    = {Matsubayashi, Yuichiroh and Inui, Kentaro},  title     = {Distance-Free Modeling of Multi-Predicate Interactions in End-to-End Japanese Predicate Argument Structure Analysis},  booktitle = {Proceedings of the 27th International Conference on Computational Linguistics (COLING)},  year      = {2018},}

Contributor

About

A PyTorch implementation of the Japanese Predicate-Argument Structure (PAS) analyser presented in the paper of Matsubayashi & Inui (2018) with some improvements.

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Showcase: Japanese Predicate-Argument Structure (PAS) analyzer

Demo

Usage

Input file format

Requirements

Instllation

Step 1. Install Showcase

Step 2: Download Resources

Step 3: Create and edit config.json

(Re-)training

Step1: Train word2vec

Step2: Train model

Step3: Convert word2vec

Citation

Contributor

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors2

Uh oh!

Languages

Movatterモバイル変換

License

cl-tohoku/showcase

Folders and files

Latest commit

History

Repository files navigation

Showcase: Japanese Predicate-Argument Structure (PAS) analyzer

Demo

Usage

Input file format

Requirements

Instllation

Step 1. Install Showcase

Step 2: Download Resources

Step 3: Create and edit config.json

(Re-)training

Step1: Train word2vec

Step2: Train model

Step3: Convert word2vec

Citation

Contributor

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Uh oh!

Languages

Packages