ku-nlp/jumanpp-jumandicPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star7

Scripts for training Jumandic Juman++ model

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
make		make
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
configure.py		configure.py

Repository files navigation

Descritption

This repository contains a set of scripts to build a ready-to-useJuman++ model for Jumandic.

Prerequrements

Unix environment (on Windows useWSL orMSYS2/MinGW64)
Juman++ build environment
Python 3.6+
Ruby
Perl
Configured ssh authorization for github (we will clone several repositories via ssh)
32 GB of RAM

How to Use

Run the configuration script:python3 configure.py.It will prompt for the location of Mainichi Shinbun texts.

After that runmake nornn for training a model without RNN component.make rnn produces the model with RNN component.The models will be inside thebld/model folder.

Adding your words to the model

It is possible to add your words to the model.To do it:

Perform the configuration as described above:python3 configure.py
Fetch the repositoriesmake repo.
Go intobld/repos/jumandic folder, it is a local clone ofJumanDIC repository.
Create a new file with the.dic extension in theuserdic folder of thebld/repos/jumandic folder.
Put your words into that file, in JUMAN dictionary format (refer to other files for example).
Executemake clean-dic if you have already built a Juman++ model.
Build your model as shown above.

If the built model does not contain your words, ensure that the binary dictionary was rebuilt after adding new words.

About

Scripts for training Jumandic Juman++ model

Releases1

data-2020.08.12 Latest

Aug 12, 2021

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Descritption

Prerequrements

Recommended

How to Use

Adding your words to the model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Languages

Movatterモバイル変換

ku-nlp/jumanpp-jumandic

Folders and files

Latest commit

History

Repository files navigation

Descritption

Prerequrements

Recommended

How to Use

Adding your words to the model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases1

Languages