- Notifications
You must be signed in to change notification settings - Fork2
AllenNLP integration for Shiba: Japanese CANINE model
License
shunk031/allennlp-shiba-model
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Allennlp Integration forShiba
allennlp-shiab-model
is a Python library that provides AllenNLP integration forshiba-model.
SHIBA is an approximate reimplementation of CANINE[1] in raw Pytorch, pretrained on the Japanese wikipedia corpus using random span masking. If you are unfamiliar with CANINE, you can think of it as a very efficient (approximately 4x as efficient) character-level BERT model. Of course, the name SHIBA comes from the identically named Japanese canine.
Installing the library and dependencies is simple usingpip
.
pip install allennlp-shiba
This library enables users to specify the in a jsonnet config file. Here is an example of the model in jsonnet config file:
{"dataset_reader": {"tokenizer": {"type":"shiba", },"token_indexers": {"tokens": {"type":"shiba", } }, },"model": {"shiba_embedder": {"type":"basic","token_embedders": {"shiba": {"type":"shiba","eval_model":true, } } } }}
- Joshua Tanner and Masato Hagiwara (2021).SHIBA: Japanese CANINE model. GitHub repository, GitHub.
About
AllenNLP integration for Shiba: Japanese CANINE model
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
Uh oh!
There was an error while loading.Please reload this page.