line/LINE-DistilBERT-JapanesePublic

NotificationsYou must be signed in to change notification settings
Fork1
Star45

DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.

License

Apache-2.0 license

45 stars 1 fork Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
README_ja.md		README_ja.md

Repository files navigation

LINE DistilBERT Japanese

This is a DistilBERT model pre-trained on 131 GB of Japanese web text.The teacher model is BERT-base that built in-house at LINE.The model was trained byLINE Corporation.

https://huggingface.co/line-corporation/line-distilbert-base-japanese

For Japanese

README_ja.md is written in Japanese.

How to use

fromtransformersimportAutoTokenizer,AutoModeltokenizer=AutoTokenizer.from_pretrained("line-corporation/line-distilbert-base-japanese",trust_remote_code=True)model=AutoModel.from_pretrained("line-corporation/line-distilbert-base-japanese")sentence="LINE株式会社で[MASK]の研究・開発をしている。"print(model(**tokenizer(sentence,return_tensors="pt")))

Requirements

fugashisentencepieceunidic-lite

Model architecture

The model architecture is the DitilBERT base model; 6 layers, 768 dimensions of hidden states, 12 attention heads, 66M parameters.

Evaluation

The evaluation byJGLUE is as follows:

model name	#Params	Marc_ja	JNLI	JSTS	JSQuAD	JCommonSenseQA
		acc	acc	Pearson/Spearman	EM/F1	acc
LINE-DistilBERT	68M	95.6	88.9	89.2/85.1	87.3/93.3	76.1
Laboro-DistilBERT	68M	94.7	82.0	87.4/82.7	70.2/87.3	73.2
BandaiNamco-DistilBERT	68M	94.6	81.6	86.8/82.1	80.0/88.0	66.5

Tokenization

The texts are first tokenized by MeCab with the Unidic dictionary and then split into subwords by the SentencePiece algorithm. The vocabulary size is 32768.

Licenses

The pretrained models are distributed under the terms of theApache License, Version 2.0.

To cite this work

We haven't published any paper on this work. Please citethis GitHub repository:

@article{LINE DistilBERT Japanese,  title = {LINE DistilBERT Japanese},  author = {"Koga, Kobayashi and Li, Shengzhe and Nakamachi, Akifumi and Sato, Toshinori"},  year = {2023},  howpublished = {\url{http://github.com/line/LINE-DistilBERT-Japanese}}}

About

DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

LINE DistilBERT Japanese

For Japanese

How to use

Requirements

Model architecture

Evaluation

Tokenization

Licenses

To cite this work

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Movatterモバイル変換

License

line/LINE-DistilBERT-Japanese

Folders and files

Latest commit

History

Repository files navigation

LINE DistilBERT Japanese

For Japanese

How to use

Requirements

Model architecture

Evaluation

Tokenization

Licenses

To cite this work

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Packages