BandaiNamcoResearchInc/DistilBERT-base-jpPublic

NotificationsYou must be signed in to change notification settings
Fork12
Star161

License

MIT license

161 stars 12 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
docs		docs
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
config.json		config.json
pytorch_model.bin		pytorch_model.bin
vocab.txt		vocab.txt

Repository files navigation

language	license
Japanese	MIT

Japanese DistilBERT Pretrained Model

A Japanese DistilBERT pretrained model, which was trained onWikipedia.

Findhere for a quickstart guidance in Japanese.

Introduction

DistilBERT is a small, fast, cheap and light Transformer model based on Bert architecture. It has 40% less parameters than BERT-base, runs 60% faster while preserving 97% of BERT's performance as measured on the GLUE language understanding benchmark.

This model was trained with the official Hugging Face implementation fromhere for 2 weeks on AWS p3dn.24xlarge instance.

More details about distillation can be found in following paper."DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter" by Sanh et al. (2019).

The teacher model isthe pretrained Japanese BERT models from TOHOKU NLP LAB.

Currently only PyTorch compatible weights are available. Tensorflow checkpoints can be generated by following theofficial guide.

Requirements

torch>=1.3.1torchvision>=0.4.2transformers>=2.5.0tensorboard>=1.14.0tensorboardX==1.8scikit-learn>=0.21.0mecab-python3

Usage

Download model

Please download and unzipDistilBERT-base-jp.zip.

Use model

# Read from local pathfromtransformersimportAutoModel,AutoTokenizertokenizer=AutoTokenizer.from_pretrained("bert-base-japanese-whole-word-masking")model=AutoModel.from_pretrained("LOCAL_PATH")

LOCAL_PATH means the path which above file is unzipped. 3 files should be included:

pytorch_model.bin
config.json
vocal.txt

# Download from model library from huggingface.cofromtransformersimportAutoModel,AutoTokenizertokenizer=AutoTokenizer.from_pretrained("bert-base-japanese-whole-word-masking")model=AutoModel.from_pretrained("bandainamco-mirai/distilbert-base-japanese")

License

Released under the MIT license

https://opensource.org/licenses/mit-license.php

About

No description, website, or topics provided.

Releases1

DistilBERT-base-jp Latest

Apr 22, 2020

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Japanese DistilBERT Pretrained Model

Table of Contents

Introduction

Requirements

Usage

Download model

Use model

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases1

Packages

Movatterモバイル変換

License

BandaiNamcoResearchInc/DistilBERT-base-jp

Folders and files

Latest commit

History

Repository files navigation

Japanese DistilBERT Pretrained Model

Table of Contents

Introduction

Requirements

Usage

Download model

Use model

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases1

Packages0

Packages