Aleksa Gordić’s Post

View profile for Aleksa Gordić
Aleksa GordićAleksa Gordić is an Influencer

Ship, ship | x-Google DeepMind

Well, it's official. YugoGPT 7B significantly beats Mistral and LLaMA 2 and is now officially the best open-source LLM in the world for Serbian & other HBS (Croatian, Bosnian, Montenegrin) languages.Earlier this summer I was frustrated when I saw how poor the situation is as soon as you leave the English NLP space (w/ few exceptions).Exactly 57 days ago I came up with this idea while working on the Open-NLLB project and training a machine translation system for HBS.Why not train an LLM for HBS languages? And so yugoGPT was born.The problem? I had 1 RTX 3090 GPU on my custom built rig. :) (kindly provided to me byNVIDIA back in London!)Fast forward to today I have 16 80 GB A100s fromTogether AI (whom I seriously recommend - not just for their hardware but also for their support & speed & dedication) and 16 more coming in a week (from a different company -> will share more details then).I had to learn a ton of new stuff as I was basically a single guy (w/Nikola Ljubešić's tremendous help that saved me a ton of time on the data front) building this whole complex system (shout-out to my Discord community was essential for the Serbian LLM eval project).And I must say: I learned more over the past 6 months than I did during my whole time back at DeepMind. Unless you're very very senior - you just don't get the level of agency I'm enjoying right now.If I could do this bootstrapped with 16 A100s, give me a cluster of 1000 GPUs and I'll show you how things are getting done :) (hah!) well more on that later, things are cooking in the background.I'll just say that I'm already working on the 2nd iteration that'll be significantly better.As for yugoGPT I’ll be red teaming it and I'll need help from the community over the next period to test it. more updates soon :) keep an eye outAgain big thanks to:*Together AI - for generous help with compute*Weights & Biases - for generous donation of GPT-4 credits for the Serbian LLM eval project*Slobodan Marković - for spreading the word!!*Nikola Ljubešić - for sharing the data he's been working on over the past many years* and to many individuals & companies from the local region (mostly Serbia) that recently supported me!---Results obtained via Serbian LLM eval that I've recently released:https://lnkd.in/diQDTeh2

  • chart, bar chart
105 Comments
Like Comment
Tomaz Bratanic, graphic
Tomaz Bratanic

Graph ML and GenAI research at Neo4j

1y

Yugo nije za dugo 🤣

Danijel Domazet MSc, graphic
Danijel Domazet MSc

☝️ Signal processing engineer (audio and more) 🔊 Former start-up owner 🛠️ AI (ofcourse)🎰 Tech lover 🖥️... but also a guitar fingerpicker🪕 avid birdwatcher🐦 chess addict♟️ part-time gardener👨🌾 ...and a lll📚.

1y

Čestitam. Ova rečenica gore, da si više naučio na ovom šestomjesečnom samostalnom projektu nego u DeepMindu (!) je iznenađujuća, uzbudljiva, a pomalo i stravična 🙂. Keep on. 👍

Peter Seeberg, graphic
Peter Seeberg

Industrial AI Consultant, Moderator and Podcaster

1y

CongratsAleksa!When / where can the public access YugoGPT?(happy to show friends and family during the Christmas holiday 😉)

Yannick Léo, graphic
Yannick Léo

Partner - AI, Generative AI & Robotics at Emerton Data | Bridging Research & Business | Ph.D. in Computer Science

1y

Seems very impressive, congrats! Question for you Aleksa Gordić, have you excluded the benchmarking datasets from the training?

Smriti Mishra, graphic
Smriti Mishra

Data Science & Engineering | LinkedIn Top Voice Tech & Innovation | Mentor @ Google for Startups | 30 Under 30 STEM & Healthcare

1y

This is so amazingAleksa Gordić! 🚀

Adam Murphy, graphic
Adam Murphy

LLM Engineer | Document Processing Specialist | AWS ML Specialty Certified

1y

Incredible work!

Rishi Khetan, graphic
Rishi Khetan

Applied Data Science @ Galileo | GenAI, Agents, RAG LLMs, Search, Recsys | Scaling AI/ML from Research to Production

1y

Congrats, great work 🙌🏻

Jovan Stojanovic, graphic

Hell yeah... well done!

Tom Aarsen, graphic
Tom Aarsen

🤗 Sentence Transformers, SetFit & NLTK maintainer, MLE @ Hugging Face

1y

Very well done, impressive work as usual.

Blaž Jurišić, graphic
Blaž Jurišić

Head of AI @OnScriptAI | Detecting deepfakes @DeepQ

1y

Svaka cast!! Cestitke :) Ucinio si nezamisliv prvi korak za Yugo jezicnu skupinu i omogucio mnogima da nastave graditi na ovome 👏👏

See more comments

To view or add a comment,sign in

Aleksa Gordić

106,492 followers

View ProfileConnect

Explore topics