- Notifications
You must be signed in to change notification settings - Fork0
Data Augmentation for Japanese Text on AugLy
License
NotificationsYou must be signed in to change notification settings
chck/AugLy-jp
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Data Augmentation forJapanese Text on AugLy
base_text = "あらゆる現実をすべて自分のほうへねじ曲げたのだ"
Augmenter | Augmented | Description |
---|---|---|
SynonymAugmenter | あらゆる現実をすべて自身のほうへねじ曲げたのだ | Substitute similar word according toSudachi synonym |
WordEmbsAugmenter | あらゆる現実をすべて関心のほうへねじ曲げたのだ | Leverage word2vec, GloVe or fasttext embeddings to apply augmentation |
FillMaskAugmenter | つまり現実を、未来な未来まで変えたいんだ | Using masked language model to generate text |
BackTranslationAugmenter | そして、ほかの人たちをそれぞれの道に安置しておられた | Leverage two translation models for augmentation |
Software | Install Command |
---|---|
Python 3.8.11 | pyenv install 3.8.11 |
Poetry 1.1.* | curl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/get-poetry.py | python |
pip install augly-jp
Or clone this repository:
git clone https://github.com/chck/AugLy-jp.gitpoetry install
poetry run tasktest
poetry run task fmt
poetry run task lint
- https://github.com/facebookresearch/AugLy
- https://github.com/makcedward/nlpaug
- https://github.com/QData/TextAttack
This software includes the work that is distributed in the Apache License 2.0[1].
About
Data Augmentation for Japanese Text on AugLy
Topics
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Uh oh!
There was an error while loading.Please reload this page.