Movatterモバイル変換

EleutherAI

From Wikipedia, the free encyclopedia

Artificial intelligence research collective

EleutherAI

Type of business	Researchco-operative
Founded	3 July 2020; 5 years ago (2020-07-03)^[1]
Industry	Artificial intelligence
Products	GPT-Neo,GPT-J, GPT-NeoX, Pythia,The Pile, VQGAN-CLIP
URL	eleuther.ai

Artificial intelligence (AI)
Part ofa series on

Major goals Artificial general intelligence Intelligent agent Recursive self-improvement Planning Computer vision General game playing Knowledge representation Natural language processing Robotics AI safety
Approaches Machine learning Symbolic Deep learning Bayesian networks Evolutionary algorithms Hybrid intelligent systems Systems integration Open-source AI data centers
Applications Bioinformatics Deepfake Earth sciences Finance Generative AI Art Audio Music Government Healthcare Mental health Industry Software development Translation Military Physics Projects
Philosophy AI alignment Artificial consciousness The bitter lesson Chinese room Friendly AI Ethics Existential risk Turing test Uncanny valley Human–AI interaction
History Timeline Progress AI winter AI boom AI bubble
Controversies Deepfake pornography Taylor Swift deepfake pornography controversy Grok sexual deepfake scandal Google Gemini image generation controversy Pause Giant AI Experiments Removal of Sam Altman from OpenAI Statement on AI Risk Tay (chatbot) Théâtre D'opéra Spatial Voiceverse NFT plagiarism scandal
Glossary Glossary
v t e

EleutherAI (/əˈluːθər/^[2]) is a grass-roots non-profitartificial intelligence (AI) research group. The group, considered an open-source version ofOpenAI,^[3] was formed in aDiscord server in 2020 to create an open-source version ofGPT-3.^[4] In early 2023, it formally incorporated as the EleutherAI Institute, a non-profit research institute.^[5] As of 2025, the nonprofit maintains widely-used training datasets, conducts research, and is involved in public policy, among other activities.^[4]

History

[edit]

EleutherAI began as aDiscord server on July 7, 2020, under the tentative name "LibreAI" before rebranding to "EleutherAI" later that month,^[6]^{[better source needed]} in reference toeleutheria, the Greek word forliberty.^[3] Its founding members are Connor Leahy, Leo Gao, and Sid Black.^[3] They co-wrote the code for EleutherAI to serve as a collection ofopen source AI research, creating a machine learning model similar toGPT-3.^[5]

On December 31, 2020, EleutherAI releasedThe Pile, a curated dataset of diverse text for traininglarge language models. While the paper referenced the existence of the GPT-Neo models, the models themselves were not released until March 21, 2021.^[7]^{[better source needed]} On June 9, 2021, EleutherAI followed this up withGPT-J-6B, a six billion parameter language model that was again the largest open-source GPT-3-like model in the world.^[8]^{[better source needed]} These language models were released under theApache 2.0 free software license and are considered to have "fueled an entirely new wave of startups".^[5]

While EleutherAI initially turned down funding offers, preferring to use Google's TPU Research Cloud Program to source their compute,^[3] by early 2021 they had accepted funding fromCoreWeave (a small cloud computing company) and SpellML (a cloud infrastructure company) in the form of access to powerful GPU clusters that are necessary for large scale machine learning research. On Feb 10, 2022, they released GPT-NeoX-20B, a model similar to their prior work but scaled up thanks to the resources CoreWeave provided.^[9]

In early 2023, EleutherAI incorporated as a non-profit research institute run by Stella Biderman, Curtis Huebner, and Shivanshu Purohit.^[5] EleutherAI also announced a shift towards doing work ininterpretability,alignment, and scientific research.^[10]^{[non-primary source needed]} EleutherAI felt that "there is substantially more interest in training and releasing LLMs than there once was", enabling them to focus on other projects.^[11]

In July 2024, an investigation byProof news found that EleutherAI's The Pile dataset includes subtitles from over 170,000YouTube videos across more than 48,000 channels. The findings drew criticism and accusations of theft from YouTubers and others who had their work published on the platform.^[12]

In 2025, EleutherAI released a new dataset for training AI, "Common Pile", that does not have the controversial copyrighted material contained in its previous release of The Pile, and trained two models from it.^[13] EleutherAI, in collaboration with the UK'sAI Security Institute, found that filtering the training data to remove key concepts can maintain performance while reducing the ability to provide harmful information.^[14]^[15]

Research

[edit]

EleutherAI works with hundreds of volunteer researchers.^[16]

The Pile

[edit]

Main article:The Pile (dataset)

The Pile is an 886 GB dataset designed for training large language models. It was originally developed to train EleutherAI's GPT-Neo models^[17] but has become widely used to train other models, includingMicrosoft's Megatron-Turing Natural Language Generation.^[18] Compared to other datasets, the Pile's main distinguishing features are that it is a curated selection of data chosen by researchers at EleutherAI to contain information they thought language models should learn and that it is the only such dataset that is thoroughly documented by the researchers who developed it.^[19] The initial Pile dataset has come under scrutiny for containing copyrighted material^[13] including books^[20]^[21]^[22] and subtitles from documentaries, movies, television and online videos^[23] including from YouTube.^[24]^[12]

Common Pile

[edit]

Common Pile v0.1, released in partnership with a large number of collaborators in June 2025, contains only works where the licenses permit their use for training AI models.^[13]^[25]

GPT models

[edit]

EleutherAI's most prominent research relates to its work to train open-sourcelarge language models inspired by OpenAI'sGPT-3.^[7]^{[better source needed]} EleutherAI's "GPT-Neo" model series has released 125 million, 1.3 billion, 2.7 billion, 6 billion, and 20 billion parameter models.

GPT-Neo (125M, 1.3B, 2.7B):^[26] released in March 2021, it was the largest open-source GPT-3-style language model in the world at the time of release.
GPT-J (6B): released in March 2021, it was the largest open-source GPT-3-style language model in the world at the time of release.
GPT-NeoX-20B, released on Feb 10, 2022.^[9]

VQGAN-CLIP

[edit]

An artificial intelligence art created with CLIP-Guided Diffusion, another text-to-image model created by Katherine Crowson of EleutherAI^[27]^[28]

Following the release ofDALL-E by OpenAI in January 2021, EleutherAI started working ontext-to-image synthesis models. When OpenAI did not release DALL-E publicly, EleutherAI's Katherine Crowson and digital artist Ryan Murdock developed a technique for using CLIP (another model developed by OpenAI) to convert regular image generation models into text-to-image synthesis ones.^[29]^[30]^[31]^[32] Building on ideas dating back to Google'sDeepDream,^[33] they found their first major success combining CLIP with another publicly available model called VQGAN and the resulting model is called VQGAN-CLIP.^[34] Crowson released the technology by tweetingnotebooks demonstrating the technique that people could run for free without any special equipment.^[35]^[36]^[37]

References

[edit]

^Leahy, Connor; Hallahan, Eric; Gao, Leo; Biderman, Stella (7 July 2021)."What A Long, Strange Trip It's Been: EleutherAI One Year Retrospective".Archived from the original on 29 August 2023. Retrieved1 March 2023.
^"Talk with Stella Biderman on The Pile, GPT-Neo and MTG". The Interference Podcast. 2 April 2021. Retrieved26 March 2023.
^^a ^b ^c ^dSmith, Craig (21 March 2022)."EleutherAI: When OpenAI Isn't Open Enough".IEEE Spectrum.IEEE.Archived from the original on 29 August 2023. Retrieved8 August 2023.
^^a ^bHerschander, Sara (11 February 2025)."How Philanthropy Built, Lost, and Could Reclaim the A.I. Race".Chronicle of Philanthropy (Contributor). Retrieved10 December 2025.
^^a ^b ^c ^dWiggers, Kyle (2 March 2023)."Stability AI, Hugging Face and Canva back new AI research nonprofit".TechCrunch.Archived from the original on 29 August 2023. Retrieved8 August 2023.
^Leahy, Connor; Hallahan, Eric; Gao, Leo; Biderman, Stella (7 July 2021)."What A Long, Strange Trip It's Been: EleutherAI One Year Retrospective".EleutherAI Blog.Archived from the original on 29 August 2023. Retrieved14 April 2023.
^^a ^bIyer, Abhishek (15 May 2021)."GPT-3's free alternative GPT-Neo is something to be excited about".VentureBeat (Blog post).Archived from the original on 9 March 2023. Retrieved14 April 2023.
^"GPT-J-6B: An Introduction to the Largest Open Source GPT Model".Forefront.ai. 14 October 2021. Archived fromthe original on 9 March 2023. Retrieved1 March 2023.
^^a ^b"Ukraine uses Clearview AI to identify Russian dead".The Register. 28 March 2022. Archived fromthe original on 8 September 2025. Retrieved10 December 2025.
^"The View from 30,000 Feet: Preface to the Second EleutherAI Retrospective".Eleuther AI Blog. 2 March 2023.
^"AI Research Lab Launches Open Source Research Nonprofit".The NonProfit Times. 7 March 2023.
^^a ^bGilbertson, Annie; Reisner, Alex (16 July 2024)."Apple, Nvidia, Anthropic Used Thousands of Swiped YouTube Videos to Train AI".WIRED. Retrieved18 July 2024.
^^a ^b ^cWiggers, Kyle (6 June 2025)."EleutherAI releases massive AI training dataset of licensed and open domain text".TechCrunch. Retrieved10 December 2025.
^Goldman, Sharon."AI safety tip: if you don't want it giving bioweapon instructions, maybe don't put them in the training data, say researchers".Fortune.
^Tiku, Nitasha (12 August 2025)."Analysis | AI systems 'ignorant' of sensitive data can be safer, but still smart".The Washington Post.
^Herschander, Sara (11 February 2025)."How Philanthropy Built, Lost, and Could Reclaim the A.I. Race".Chronicle of Philanthropy (Contributor). Retrieved10 December 2025.
^Knight, Will (29 March 2021)."This AI Can Generate Convincing Text—and Anyone Can Use It".Wired.ISSN 1059-1028. Retrieved10 December 2025.
^Wiggers, Kyle (11 October 2021)."Microsoft and Nvidia team up to train one of the world's largest language models".VentureBeat.Archived from the original on 27 March 2023. Retrieved8 March 2023.
^Khan, Mehtab; Hanna, Alex (2023). "The Subjects and Stages of AI Dataset Development: A Framework for Dataset Accountability".Ohio State Technology Law Journal.19 (2):171–256.hdl:1811/103549.SSRN 4217148.
^Roth, Emma (20 August 2024)."Authors sue Anthropic for training AI using pirated books".The Verge. Retrieved10 December 2025.
^Knibbs, Kate (4 September 2023)."The Battle Over Books3 Could Change AI Forever".WIRED. Retrieved13 October 2023.
^Barr, Kyle (18 August 2023)."Anti-Piracy Group Takes Massive AI Training Dataset 'Books3′ Offline".Gizmodo. Retrieved10 December 2025.
^Deck, Andrew (7 January 2025)."Thousands of documentaries are fueling AI models built by Apple, Meta, and Nvidia".Nieman Lab. Archived fromthe original on 1 July 2025. Retrieved10 December 2025.
^Sato, Mia (16 July 2024)."Apple, Anthropic, and other companies used YouTube videos to train AI".The Verge. Retrieved10 December 2025.
^Tiku, Nitasha (5 June 2025)."Analysis | AI firms say they can't respect copyright. These researchers tried".The Washington Post. Archived fromthe original on 25 July 2025. Retrieved29 January 2026.
^Andonian, Alex; Biderman, Stella; Black, Sid; Gali, Preetham; Gao, Leo; Hallahan, Eric; Levy-Kramer, Josh; Leahy, Connor; Nestler, Lucas; Parker, Kip; Pieler, Michael; Purohit, Shivanshu; Songz, Tri; Phil, Wang; Weinbach, Samuel (10 March 2023). GPT-NeoX: Large Scale Autoregressive Language Modeling in PyTorch (Preprint).doi:10.5281/zenodo.5879544.
^"CLIP-Guided Diffusion".EleutherAI.Archived from the original on 29 August 2023. Retrieved20 August 2023.
^"CLIP Guided Diffusion HQ 256x256.ipynb - Colaboratory".Google Colab.Archived from the original on 29 August 2023. Retrieved20 August 2023.
^MIRANDA, LJ (8 August 2021)."The Illustrated VQGAN".ljvmiranda921.github.io.Archived from the original on 20 March 2023. Retrieved8 March 2023.
^"Inside The World of Uncanny AI Twitter Art".Nylon. 24 March 2022.Archived from the original on 29 August 2023. Retrieved8 March 2023.
^"This AI Turns Movie Text Descriptions Into Abstract Posters".Yahoo Life. 20 September 2021.Archived from the original on 27 December 2022. Retrieved8 March 2023.
^Quach, Katyanna (22 August 2021)."A man spent a year in jail on a murder charge involving disputed AI evidence. Now the case has been dropped".www.theregister.com.Archived from the original on 8 March 2023. Retrieved8 March 2023.
^"Alien Dreams: An Emerging Art Scene - ML@B Blog".Alien Dreams: An Emerging Art Scene - ML@B Blog.Archived from the original on 10 March 2023. Retrieved8 March 2023.
^"VQGAN-CLIP".EleutherAI.Archived from the original on 20 August 2023. Retrieved20 August 2023.
^"We asked an AI tool to 'paint' images of Australia. Critics say they're good enough to sell".ABC News. 14 July 2021.Archived from the original on 7 March 2023. Retrieved8 March 2023 – via www.abc.net.au.
^Nataraj, Poornima (28 February 2022)."Online tools to create mind-blowing AI art".Analytics India Magazine.Archived from the original on 8 February 2023. Retrieved8 March 2023.
^"Meet the Woman Making Viral Portraits of Mental Health on TikTok".Vice.com. 30 November 2021.Archived from the original on 11 May 2023. Retrieved8 March 2023.

v t e Existential risk from artificial intelligence
Concepts	AGI AI alignment AI boom AI capability control AI safety AI takeover Effective accelerationism Ethics of artificial intelligence Existential risk from artificial intelligence Friendly artificial intelligence Instrumental convergence Intelligence explosion Longtermism Machine ethics Suffering risks Superintelligence Technological singularity Vulnerable world hypothesis
Organizations	AI Futures Project Alignment Research Center Center for AI Safety Center for Applied Rationality Center for Human-Compatible Artificial Intelligence Centre for the Study of Existential Risk EleutherAI Future of Humanity Institute Future of Life Institute Google DeepMind Humanity+ Institute for Ethics and Emerging Technologies Leverhulme Centre for the Future of Intelligence Machine Intelligence Research Institute METR OpenAI PauseAI Safe Superintelligence
People	Scott Alexander Sam Altman Yoshua Bengio Nick Bostrom Paul Christiano Eric Drexler Sam Harris Stephen Hawking Dan Hendrycks Geoffrey Hinton Bill Joy Daniel Kokotajlo Shane Legg Jan Leike Elon Musk Steve Omohundro Huw Price Martin Rees Stuart J. Russell Nate Soares Ilya Sutskever Jaan Tallinn Max Tegmark Alan Turing Frank Wilczek Roman Yampolskiy Eliezer Yudkowsky
Books	Do You Trust This Computer? Human Compatible If Anyone Builds It, Everyone Dies Our Final Invention Superintelligence: Paths, Dangers, Strategies The Precipice: Existential Risk and the Future of Humanity
Other	Artificial Intelligence Act Open letter on artificial intelligence Statement on AI Risk
Category