Movatterモバイル変換


[0]ホーム

URL:


Skip to content
DEV Community
Log in Create account

DEV Community

Wincent Balin
Wincent Balin

Posted on

Closure

After a pause, this series comes to a conclusion, mostly because of the rapid developments in the area of large language models.

Original intention

At the beginning I intended to create a language model, that would have gotten a prompt "Geschirrabwaschgesetz" (a law about washing dishes) and write me a corresponding law text in German.

I was discouraged from training the originalchar RNN because of the scary amount of training time with a 110 M training data. Therefore I went with fine-tuning aGerman GPT-2 (and laterthe better one; thanks Jo!). The fine-tuning process of such a model is describedhere orhere, for example.

(Un-)expected discovery

I happened to discover that my intended case is covered perfectly by theLLAMA 2 Chat German model (almost, because of a few grammatical errors). This is very likely because of being fine-tuned with theGerman legal SQuAD dataset, among others.

I do not want to withhold the result from you (produced inLM Studio):Output to "Geschirrabwaschgesetz"

Just look at this beauty! It even defined "Hygiene" in the last subparagraph! And hence this series is concluded.

Top comments(0)

Subscribe
pic
Create template

Templates let you quickly answer FAQs or store snippets for re-use.

Dismiss

Are you sure you want to hide this comment? It will become hidden in your post, but will still be visible via the comment'spermalink.

For further actions, you may consider blocking this person and/orreporting abuse

I have many interests, mostly where computer science or IT meets music or humanities.
  • Location
    Hamburg, Germany
  • Joined

Trending onDEV CommunityHot

DEV Community

We're a place where coders share, stay up-to-date and grow their careers.

Log in Create account

[8]ページ先頭

©2009-2025 Movatter.jp