Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.

License

NotificationsYou must be signed in to change notification settings

martysai/artificial-text-detection

Repository files navigation

Build StatusPyPi VersionsWheelcodecovCode style: blackPre-commitDVCLicense

Artificial Text Detection

Python framework for artificial text detection:NLP approaches to compare natural text against generated by neural networks.

Contents

Project description is put into:

Installation steps:

We usepoetry as an enhanced dependency resolver.

make poetry-downloadpoetry install --no-dev

Datasets for artificial text detection

To create datasets for the further classification, it is necessary to collect them.There are 2 available ways for it:

  • ViaData Version Control.Get in touch with@msaidov in order to have the access to the private Google Drive;
  • Via datasets generation. One dataset with a size of 20,000 samples was process with MT model on V100 GPU for 30 mins;

Data Version Control usage:

poetry add"dvc[gdrive]"

Then, rundvc pull. It will download preprocessed translation datasetsfrom the Google Drive.

Datasets generation

To generate translations before artificial text detection pipeline,install thedetection module from the cloned repo or PyPi (TODO):

pip install -e.

Then, run generate script:

python detection/data/generate.py --dataset_name='tatoeba' --size=20000 --device='cuda:0'

Simple run:

To run the artificial text detection classifier, execute the pipeline:

python detection/old.py

[8]ページ先頭

©2009-2025 Movatter.jp