Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

GPT (Decoder only Transformer - from scratch) generated fake/phoney taxonomies (based on NCBI taxonomy dataset)

License

NotificationsYou must be signed in to change notification settings

suvash/taxophoney

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GPT (Decoder only Transformer - from scratch) generated fake/phoney taxonomies, trained on NCBI taxonomy dataset, included in this repository.

Requirements

  • Pytorch - 1.12.1+cu116 (with CUDA support - for reasonably short training runs)

Quick training results

$ python gpt.pyUsing device: cudastep 0: train loss 4.4625, val loss 4.4653step 500: train loss 2.0843, val loss 2.1280step 1000: train loss 1.5394, val loss 1.5920step 1500: train loss 1.3097, val loss 1.3789step 2000: train loss 1.1842, val loss 1.2741step 2500: train loss 1.1017, val loss 1.2182step 3000: train loss 1.0408, val loss 1.1938step 3500: train loss 0.9831, val loss 1.1692step 4000: train loss 0.9382, val loss 1.1591step 4500: train loss 0.8935, val loss 1.1392step 4999: train loss 0.8545, val loss 1.1383

Generated phoney taxonomy

The model training and sampling script can be used to train the model and generate(sample) a lot of names afterwards. Some of the names have been included in thetaxophoney.txt file included in the repo.

Bonus : Generated images out of the phoney names

Naturally, some of these names makes one wonder what they could look like. I've used theStable Diffusion v1-5 Model by RunwayML to generate the images for some of the names. The generation prompt only includes the common name (inside the parens) and not the scientific names, since they didn't help with plausible images.

Rhodarius leyi (Leyn's land weaker caterpillar)

Leyn's land weaker caterpillar

Oligops erythrotis (greater-cheeked of leaf-warbler)

greater-cheeked of leaf-warbler

Ablenus amaratha (Golden-banded stone-eyellow bat)

Golden-banded stone-eyellow bat

Chliostega sp. 'Nawatan (strawberry little emperor)

strawberry little emperor

Columbidium metulum (blotcheye columbing beetle)

blotcheye columbing beetle

Gobionia rotalorum (round horned fringe-fingered gecko)

round horned fringe-fingered gecko

About

GPT (Decoder only Transformer - from scratch) generated fake/phoney taxonomies (based on NCBI taxonomy dataset)

Topics

Resources

License

Stars

Watchers

Forks


[8]ページ先頭

©2009-2025 Movatter.jp