Movatterモバイル変換

[0]ホーム

Jump to content

Generative pre-trained transformer

Edit links

From Wikipedia, the free encyclopedia

Type of large language model

Not to be confused withChatGPT, the flagship chatbot of OpenAI.

Machine learning anddata mining
Part of a series on
Paradigms Supervised learning Unsupervised learning Semi-supervised learning Self-supervised learning Reinforcement learning Meta-learning Online learning Batch learning Curriculum learning Rule-based learning Neuro-symbolic AI Neuromorphic engineering Quantum machine learning
Problems Classification Generative modeling Regression Clustering Dimensionality reduction Density estimation Anomaly detection Data cleaning AutoML Association rules Semantic analysis Structured prediction Feature engineering Feature learning Learning to rank Grammar induction Ontology learning Multimodal learning
Supervised learning (classification • regression) Apprenticeship learning Decision trees Ensembles Bagging Boosting Random forest k-NN Linear regression Naive Bayes Artificial neural networks Logistic regression Perceptron Relevance vector machine (RVM) Support vector machine (SVM)
Clustering BIRCH CURE Hierarchical k-means Fuzzy Expectation–maximization (EM) DBSCAN OPTICS Mean shift
Dimensionality reduction Factor analysis CCA ICA LDA NMF PCA PGD t-SNE SDL
Structured prediction Graphical models Bayes net Conditional random field Hidden Markov
Anomaly detection RANSAC k-NN Local outlier factor Isolation forest
Neural networks Autoencoder Deep learning Feedforward neural network Recurrent neural network LSTM GRU ESN reservoir computing Boltzmann machine Restricted GAN Diffusion model SOM Convolutional neural network U-Net LeNet AlexNet DeepDream Neural field Neural radiance field Physics-informed neural networks Transformer Vision Mamba Spiking neural network Memtransistor Electrochemical RAM (ECRAM)
Reinforcement learning Q-learning Policy gradient SARSA Temporal difference (TD) Multi-agent Self-play
Learning with humans Active learning Crowdsourcing Human-in-the-loop Mechanistic interpretability RLHF
Model diagnostics Coefficient of determination Confusion matrix Learning curve ROC curve
Mathematical foundations Kernel machines Bias–variance tradeoff Computational learning theory Empirical risk minimization Occam learning PAC learning Statistical learning VC theory Topological deep learning
Journals and conferences AAAI ECML PKDD NeurIPS ICML ICLR IJCAI ML JMLR
Related articles Glossary of artificial intelligence List of datasets for machine-learning research List of datasets in computer vision and image processing Outline of machine learning
v t e

Agenerative pre-trained transformer (GPT) is a type oflarge language model (LLM)^[1]^[2]^[3] that is widely used ingenerative AI chatbots.^[4]^[5] GPTs are based on adeep learning architecture called thetransformer. They are pre-trained on largedatasets of unlabeled content, and able to generate novel content.^[2]^[3]

OpenAI was the first to applygenerative pre-training (GP) to the transformer architecture, introducing theGPT-1 model in 2018.^[6] The company has since released many bigger GPT models. The popular chatbotChatGPT, released in late 2022 (usingGPT-3.5), was followed by many competitor chatbots using their own "GPT" models to generate text, such asGemini,DeepSeek orClaude.^[7]

GPTs are primarily used to generate text, but can be trained to generate other kinds of data. For example,GPT-4o can process and generate text, images and audio.^[8] To improve performance on complex tasks, some GPTs, such asOpenAI o3, spend more time analyzing the problem before generating an output, and are calledreasoning models. In 2025,GPT-5 was released with a router that automatically selects whether to use a faster model or slower reasoning model based on task.

Background

[edit]

According toThe Economist, improved algorithms, more powerful computers, and an increase in the amount of digitized material fueled a revolution inmachine learning during the 2010s. New techniques in the years before theAI boom resulted in "rapid improvements in tasks", including manipulating language.^[9] Modern software models are trained to learn by using millions of examples inartificial neural networks that are inspired by biological neural structures.^[9]

Separately, the concept of generative pre-training (GP) was a long-established technique in machine learning. GP is a form ofself-supervised learning wherein a model is first trained on a large, unlabeled dataset (the "pre-training" step) to learn to generate data points. This pre-trained model is then adapted to a specific task using a labeled dataset (the "fine-tuning" step).^[10]

The transformer architecture for deep learning is the core technology of a GPT. Developed by researchers atGoogle, it was introduced in the paper "Attention Is All You Need", which was published on June 12, 2017. The transformer architecture solved many of the performance issues that were associated with olderrecurrent neural network (RNN) designs fornatural language processing (NLP). The architecture's use of anattention mechanism allows models to process entire sequences of text at once, enabling the training of much larger and more sophisticated models. Since 2017, numerous transformer-based NLP systems have been available that are capable of processing, mining, organizing, connecting, contrasting, andsummarizing texts as well as correctlyanswering questions from textual input.^[11]^[12]

History

[edit]

On June 11, 2018, OpenAI researchers and engineers published a paper called "Improving Language Understanding by Generative Pre-Training", which introducedGPT-1, the first GPT model.^[13] It was designed as a transformer-basedlarge language model that used generative pre-training (GP) onBookCorpus, a diversetext corpus, followed by discriminativefine-tuning to focus on specific language tasks.^[14] This semi-supervised approach was seen as a breakthrough. Previously, the best-performing neural models innatural language processing (NLP) had commonly employedsupervised learning from large amounts of manually labeled data – training a large language model with this approach would have been prohibitively expensive and time-consuming.^[13]

On February 14, 2019, OpenAI introducedGPT-2, a larger model that could generate coherent text. Created as a direct scale-up of its predecessor, it had both its parameter count and dataset size increased by a factor of 10. GPT-2 has 1.5 billion parameters and was trained on WebText, a 40-gigabyte dataset of 8 millionweb pages.^[15]^[16]^[17] Citing risks of malicious use, OpenAI opted for a "staged release", initially publishing smaller versions of the model before releasing the full 1.5-billion-parameter model in November.^[18]

On February 10, 2020,Microsoft introduced its Turing Natural Language Generation, which it claimed was the "largest language model ever published at 17 billion parameters." The model outperformed all previous language models at a variety of tasks, includingsummarizing texts andanswering questions.^[19]

On May 28, 2020, OpenAI introducedGPT-3, a model with 175 billion parameters that was trained on a larger dataset compared to GPT-2. It marked a significant advancement in few-shot and zero-shot learning abilities. With few examples, it could perform various tasks that it was not explicitly trained for.^[20]^[21]

Following the release of GPT-3, OpenAI started usingreinforcement learning from human feedback (RLHF) to align models' behavior more closely with human preferences. This led to the development ofInstructGPT, a fine-tuned version of GPT-3. OpenAI further refined InstructGPT to createChatGPT, the flagship chatbot product of OpenAI that was launched on November 30, 2022.^[22] ChatGPT was initially based onGPT-3.5, but it was later transitioned to theGPT-4 model, which was released on March 14, 2023.^[23]^[24] GPT-4 was also integrated into parts of several applications, includingMicrosoft Copilot,GitHub Copilot,Snapchat,Khan Academy, andDuolingo.^[25]

The immense popularity of ChatGPT spurred widespread development of competing GPT-based systems from other organizations.EleutherAI released a series ofopen-weight models, includingGPT-J in 2021. Other major technology companies later developed their own GPT models, such asGoogle'sPaLM andGemini as well asMeta AI'sLlama.^[26]

Many subsequent GPT models have been trained to bemultimodal (able to process or to generate multiple types of data). For example,GPT-4o can both process and generate text, images, and audio.^[27] Additionally, GPT models likeo3 andDeepSeek R1 have been trained withreinforcement learning to generate multi-stepchain-of-thought reasoning before producing a final answer, which helps to solve complex problems in domains such as mathematics.^[28]

On August 7, 2025, OpenAI releasedGPT-5, which includes a router that automatically selects whether to use a faster model or slower reasoning model based on task.^[29]^[30]

Foundation models

[edit]

Afoundation model is an AI model trained on broad data at scale such that it can be adapted to a wide range of downstream tasks.^[31]^[32]

Thus far, the most notable GPT foundation models have been fromOpenAI'sGPT-n series. The most recent from that isGPT-5.^[33]

Other such models includeGoogle'sPaLM, a broad foundation model that has been compared toGPT-3 and has been made available to developers via anAPI,^[34]^[35] and Together's GPT-JT, which has been reported as the closest-performingopen-source alternative toGPT-3 (and is derived fromearlier open-source GPTs).^[36]Meta AI (formerlyFacebook) also has a generative transformer-based foundational large language model, known asLLaMA.^[37]

Foundational GPTs can also employmodalities other than text, for input and/or output.GPT-4 is a multi-modal LLM that is capable of processing text and image input (though its output is limited to text).^[38] Regarding multimodaloutput, some generative transformer-based models are used fortext-to-image technologies such asdiffusion^[39] and parallel decoding.^[40] Such kinds of models can serve as visual foundation models (VFMs) for developing downstream systems that can work with images.^[41]

Task-specific models

[edit]

Training workflow of original ChatGPT/InstructGPT release^[42]^[43]

A foundational GPT model can be further adapted to produce more targeted systems directed to specific tasks and/or subject-matter domains. Methods for such adaptation can include additionalfine-tuning (beyond that done for the foundation model) as well as certain forms ofprompt engineering.^[44]

An important example of this isfine-tuning models to follow instructions, which is of course a fairly broad task but more targeted than a foundation model. In January 2022,OpenAI introduced "InstructGPT" – a series of models which were fine-tuned to follow instructions using a combination ofsupervised training andreinforcement learning from human feedback (RLHF) on base GPT-3 language models.^[45]^[46] Advantages this had over the bare foundational models included higher accuracy, less negative/toxic sentiment, and generally better alignment with user needs. Hence, OpenAI began using this as the basis for itsAPI service offerings.^[47] Other instruction-tuned models have been released by others, including a fully open version.^[48]^[49]

Another (related) kind of task-specific models arechatbots, which engage in human-like conversation. In November 2022, OpenAI launchedChatGPT – an online chat interface powered by an instruction-tuned language model trained in a similar fashion to InstructGPT.^[50] They trained this model using RLHF, with human AI trainers providing conversations in which they played both the user and the AI, and mixed this new dialogue dataset with the InstructGPT dataset for a conversational format suitable for a chatbot. Other major chatbots currently includeMicrosoft'sBing Chat, which uses OpenAI'sGPT-4 (as part of a broader close collaboration between OpenAI and Microsoft),^[51] andGoogle's competing chatbotGemini (initially based on theirLaMDA family of conversation-trained language models, with plans to switch toPaLM).^[52]

Yet another kind of task that a GPT can be used for is themeta-task of generatingits own instructions, like developing a series of prompts for 'itself' to be able to effectuate a more general goal given by a human user.^[53] This is known as an AIagent, and more specifically a recursive one because it uses results from its previous self-instructions to help it form its subsequent prompts; the first major example of this wasAuto-GPT (which uses OpenAI's GPT models), and others have since been developed as well.^[54]

Domain-specificity

[edit]

GPT systems can be directed toward particular fields or domains. Some reported examples of such models and apps are as follows:

EinsteinGPT – for sales and marketing domains, to aid with customer relationship management (usesGPT-3.5)^[55]^[56]
BloombergGPT – for the financial domain, to aid with financial news and information (uses "freely available" AI methods, combined with their proprietary data)^[57]
Khanmigo – described as a GPT version for tutoring, in the education domain, it aids students usingKhan Academy by guiding them through their studies without directly providing answers (powered byGPT-4)^[58]^[59]
SlackGPT – for theSlack instant-messaging service, to aid with navigating and summarizing discussions on it (usesOpenAI'sAPI)^[60]
BioGPT – for the biomedical domain, to aid with biomedical literature text generation and mining (usesGPT-2)^[61]

Sometimes domain-specificity is accomplished via softwareplug-ins or add-ons. For example, several different companies have developed particular plugins that interact directly with OpenAI'sChatGPT interface,^[62]^[63] andGoogle Workspace has available add-ons such as "GPT for Sheets and Docs" – which is reported to aid use ofspreadsheet functionality inGoogle Sheets.^[64]^[65]

Brand issues

[edit]

OpenAI, which created the first generative pre-trained transformer (GPT) in 2018, asserted in 2023 that "GPT" should be regarded as abrand of OpenAI.^[66] In April 2023, OpenAI revised the brand guidelines in itsterms of service to indicate that other businesses using itsAPI to run their AI services would no longer be able to include "GPT" in such names or branding.^[67] In May 2023, OpenAI engaged a brand management service to notify its API customers of this policy, although these notifications stopped short of making overt legal claims (such as allegations oftrademark infringement or demands tocease and desist).^[66] As of November 2023, OpenAI still prohibits its API licensees from naming their own products with "GPT",^[68] but it has begun enabling its ChatGPT Plus subscribers to make "custom versions of ChatGPT" calledGPTs on the OpenAI site.^[69] OpenAI's terms of service says that its subscribers may use "GPT" in the names of these, although it's "discouraged".^[68]

Relatedly, OpenAI has applied to theUnited States Patent and Trademark Office (USPTO) to seek domestictrademark registration for the term "GPT" in the field of AI.^[66] OpenAI sought to expedite handling of its application, but the USPTO declined that request in April 2023.^[70] In May 2023, the USPTO responded to the application with a determination that "GPT" was both descriptive and generic.^[71] As of November 2023, OpenAI continues to pursue its argument through the available processes. Regardless, failure to obtain aregistered U.S. trademark does not preclude some level ofcommon-law trademark rights in the U.S.^[72] and trademark rights in other countries.^[73]

For any given type or scope of trademark protection in the U.S., OpenAI would need to establish that the term is actually "distinctive" to their specific offerings in addition to being a broader technical term for the kind of technology. Some media reports suggested in 2023 that OpenAI may be able to obtain trademark registration based indirectly on the fame of its GPT-basedchatbot product,ChatGPT,^[70]^[74] for which OpenAI hasseparately sought protection (and which it has sought to enforce more strongly).^[75] Other reports have indicated that registration for the bare term "GPT" seems unlikely to be granted,^[66]^[76] as it is used frequently as a common term to refer simply to AI systems that involve generative pre-trained transformers.^[3]^[77]^[78]^[79] In any event, to whatever extent exclusive rights in the term may occur the U.S., others would need to avoid using it for similar products or services in ways likely to cause confusion.^[76]^[80] If such rights ever became broad enough to implicate other well-established uses in the field, the trademark doctrine ofdescriptive fair use could still continue non-brand-related usage.^[81]

In theEuropean Union, theEuropean Union Intellectual Property Office registered "GPT" as a trade mark of OpenAI in spring 2023. However, since spring 2024 the registration is being challenged and is pending cancellation.^[82]

InSwitzerland, theSwiss Federal Institute of Intellectual Property registered "GPT" as a trade mark of OpenAI in spring 2023.^[83]^[84]

References

[edit]

^Haddad, Mohammed."How does GPT-4 work and how can you start using it in ChatGPT?".www.aljazeera.com.Archived from the original on July 5, 2023. RetrievedApril 10, 2023.
^^a ^b"Generative AI: a game-changer society needs to be ready for".World Economic Forum. January 9, 2023.Archived from the original on April 25, 2023. RetrievedApril 8, 2023.
^^a ^b ^c"The A to Z of Artificial Intelligence".Time. April 13, 2023.Archived from the original on June 16, 2023. RetrievedApril 14, 2023.
^Hu, Luhui (November 15, 2022)."Generative AI and Future".Medium.Archived from the original on June 5, 2023. RetrievedApril 29, 2023.
^"CSDL | IEEE Computer Society".www.computer.org.Archived from the original on April 28, 2023. RetrievedApril 29, 2023.
^"Improving language understanding with unsupervised learning".openai.com. June 11, 2018.Archived from the original on March 18, 2023. RetrievedMarch 18, 2023.
^"GPT-1 to GPT-4: Each of OpenAI's GPT Models Explained and Compared".MUO. April 11, 2023.Archived from the original on April 15, 2023. RetrievedMay 3, 2023.
^Colburn, Thomas."OpenAI unveils GPT-4o, a fresh multimodal AI flagship model".The Register. RetrievedMay 18, 2024.
^^a ^b"An understanding of AI's limitations is starting to sink in".The Economist. June 11, 2020.ISSN 0013-0613.Archived from the original on July 31, 2020. RetrievedJuly 31, 2020.
^Erhan, Dumitru; Courville, Aaron; Bengio, Yoshua; Vincent, Pascal (March 31, 2010)."Why Does Unsupervised Pre-training Help Deep Learning?".Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics. JMLR Workshop and Conference Proceedings:201–208.Archived from the original on January 24, 2024. RetrievedJanuary 24, 2024.
^"Natural Language Processing".Archived from the original on August 22, 2020. RetrievedJuly 31, 2020.
^Vaswani, Ashish;Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion;Gomez, Aidan N; Kaiser, Łukasz; Polosukhin, Illia (June 12, 2017). "Attention Is All You Need". In I. Guyon and U. Von Luxburg and S. Bengio and H. Wallach and R. Fergus and S. Vishwanathan and R. Garnett (ed.).31stConference on Neural Information Processing Systems. Advances in Neural Information Processing Systems. Vol. 30. Curran Associates, Inc.arXiv:1706.03762.
^^a ^bRadford, Alec; Narasimhan, Karthik; Salimans, Tim; Sutskever, Ilya (June 11, 2018)."Improving Language Understanding by Generative Pre-Training"(PDF). p. 12.Archived(PDF) from the original on January 26, 2021. RetrievedJuly 31, 2020.
^Khandelwal, Umesh (April 1, 2023)."How Large Language GPT models evolved and work".Archived from the original on April 4, 2023. RetrievedApril 3, 2023.
^Vincent, James (February 14, 2019)."OpenAI's new multitalented AI writes, translates, and slanders".The Verge.Archived from the original on December 18, 2020. RetrievedDecember 19, 2020.
^"What is GPT-4 and Why Does it Matter?". April 3, 2023.Archived from the original on April 3, 2023. RetrievedApril 3, 2023.
^Radford, Alec; Wu, Jeffrey; Child, Rewon; Luan, David; Amodei, Dario; Sutskever, Ilua (February 14, 2019)."Language models are unsupervised multitask learners"(PDF).OpenAI.1 (8).Archived(PDF) from the original on February 6, 2021. RetrievedDecember 19, 2020.
^Vincent, James (November 7, 2019)."OpenAI has published the text-generating AI it said was too dangerous to share".The Verge.Archived from the original on June 11, 2020. RetrievedApril 28, 2023.
^Sterling, Bruce (February 13, 2020)."Web Semantics: Microsoft Project Turing introduces Turing Natural Language Generation (T-NLG)".Wired.ISSN 1059-1028.Archived from the original on November 4, 2020. RetrievedJuly 31, 2020.
^Sagar, Ram (June 3, 2020)."OpenAI Releases GPT-3, The Largest Model So Far".Analytics India Magazine.Archived from the original on August 4, 2020. RetrievedJuly 31, 2020.
^Brown, Tom B.; Mann, Benjamin; Ryder, Nick; Subbiah, Melanie; Kaplan, Jared; Dhariwal, Prafulla; Neelakantan, Arvind; Shyam, Pranav; Sastry, Girish; Askell, Amanda; Agarwal, Sandhini; Herbert-Voss, Ariel; Krueger, Gretchen; Henighan, Tom; Child, Rewon; Ramesh, Aditya; Ziegler, Daniel M.; Wu, Jeffrey; Winter, Clemens; Hesse, Christopher; Chen, Mark; Sigler, Eric; Litwin, Mateusz; Gray, Scott; Chess, Benjamin; Clark, Jack; Berner, Christopher; McCandlish, Sam; Radford, Alec; Sutskever, Ilya; Amodei, Dario (May 28, 2020). "Language Models are Few-Shot Learners".arXiv:2005.14165 [cs.CL].
^Fu, Yao; Peng, Hao; Khot, Tushar (2022)."How does GPT Obtain its Ability? Tracing Emergent Abilities of Language Models to their Sources".Yao Fu's Notion.Archived from the original on April 19, 2023. RetrievedJune 24, 2023.
^Edwards, Benj (March 14, 2023)."OpenAI's GPT-4 exhibits "human-level performance" on professional benchmarks".Ars Technica.Archived from the original on March 14, 2023. RetrievedMarch 15, 2023.
^OpenAI (March 15, 2023). "GPT-4 Technical Report".arXiv:2303.08774 [cs.CL].
^Gupta, Aman (March 21, 2023)."GPT-4 takes the world by storm - List of companies that integrated the chatbot".Mint. RetrievedJanuary 23, 2024.
^Alford, Anthony (July 13, 2021)."EleutherAI Open-Sources Six Billion Parameter GPT-3 Clone GPT-J".InfoQ.Archived from the original on February 10, 2023. RetrievedApril 3, 2023.
^Colburn, Thomas."OpenAI unveils GPT-4o, a fresh multimodal AI flagship model".The Register. RetrievedMay 18, 2024.
^Zia, Dr Tehseen (March 29, 2025)."How OpenAI's o3, Grok 3, DeepSeek R1, Gemini 2.0, and Claude 3.7 Differ in Their Reasoning Approaches".Unite.AI. RetrievedAugust 3, 2025.
^Heath, Alex (August 7, 2025)."GPT-5 is being released to all ChatGPT users".The Verge. RetrievedAugust 7, 2025.
^"Introducing GPT‑5".OpenAI. August 7, 2025. RetrievedAugust 7, 2025.
^"Introducing the Center for Research on Foundation Models (CRFM)".Stanford HAI. August 18, 2021.Archived from the original on June 4, 2023. RetrievedApril 26, 2023.
^"Reflections on Foundation Models".hai.stanford.edu. October 18, 2021.Archived from the original on August 15, 2024. RetrievedAugust 15, 2024.
^"Introducing GPT-5".openai.com. August 7, 2025. RetrievedAugust 14, 2025.
^Vincent, James (March 14, 2023)."Google opens up its AI language model PaLM to challenge OpenAI and GPT-3".The Verge.Archived from the original on March 14, 2023. RetrievedApril 29, 2023.
^"Google Opens Access to PaLM Language Model".Archived from the original on May 31, 2023. RetrievedApril 29, 2023.
^Iyer, Aparna (November 30, 2022)."Meet GPT-JT, the Closest Open Source Alternative to GPT-3".Analytics India Magazine.Archived from the original on June 2, 2023. RetrievedApril 29, 2023.
^"Meta Debuts AI Language Model, But It's Only for Researchers".PCMAG. February 24, 2023.Archived from the original on July 19, 2023. RetrievedMay 21, 2023.
^Islam, Arham (March 27, 2023)."Multimodal Language Models: The Future of Artificial Intelligence (AI)". Archived fromthe original on May 15, 2023. RetrievedMay 15, 2023.
^Islam, Arham (November 14, 2022)."How Do DALL·E 2, Stable Diffusion, and Midjourney Work?".Archived from the original on July 18, 2023. RetrievedMay 21, 2023.
^Saha, Shritama (January 4, 2023)."Google Launches Muse, A New Text-to-Image Transformer Model".Analytics India Magazine.Archived from the original on May 15, 2023. RetrievedMay 15, 2023.
^Wu (et-al), Chenfei (March 8, 2023). "Visual ChatGPT".arXiv:2303.04671 [cs.CV].
^Ouyang, Long; Wu, Jeff; et al. (March 4, 2022). "Training language models to follow instructions with human feedback".arXiv:2203.02155 [cs.CL].
^OpenAI (January 27, 2022)."Aligning language models to follow instructions".OpenAI. RetrievedJuly 29, 2025.
^Bommasani (et-al), Rishi (July 12, 2022). "On the Opportunities and Risks of Foundation Models".arXiv:2108.07258 [cs.LG].
^"Aligning language models to follow instructions".openai.com.Archived from the original on March 23, 2023. RetrievedMarch 23, 2023.
^Ouyang, Long; Wu, Jeff; Jiang, Xu; et al. (November 4, 2022). "Training language models to follow instructions with human feedback".NeurIPS.arXiv:2203.02155.
^Ramnani, Meeta (January 28, 2022)."OpenAI dumps its own GPT-3 for something called InstructGPT, and for right reason".Analytics India Magazine.Archived from the original on June 4, 2023. RetrievedApril 29, 2023.
^"Stanford CRFM".crfm.stanford.edu.Archived from the original on April 6, 2023. RetrievedMay 15, 2023.
^"Free Dolly: Introducing the World's First Truly Open Instruction-Tuned LLM".Databricks. April 12, 2023.Archived from the original on July 14, 2023. RetrievedMay 15, 2023.
^"Introducing ChatGPT".openai.com.Archived from the original on March 16, 2023. RetrievedMarch 16, 2023.
^Wiggers, Kyle (May 4, 2023)."Microsoft doubles down on AI with new Bing features".Archived from the original on December 7, 2023. RetrievedMay 4, 2023.
^"ChatGPT vs. Bing vs. Google Bard: Which AI Is the Most Helpful?".CNET.Archived from the original on July 24, 2023. RetrievedApril 30, 2023.
^"Auto-GPT, BabyAGI, and AgentGPT: How to use AI agents".Mashable. April 19, 2023.Archived from the original on July 22, 2023. RetrievedMay 15, 2023.
^Marr, Bernard."Auto-GPT May Be The Strong AI Tool That Surpasses ChatGPT".Forbes.Archived from the original on May 21, 2023. RetrievedMay 15, 2023.
^Morrison, Ryan (March 7, 2023)."Salesforce launches EinsteinGPT built with OpenAI technology".Archived from the original on April 15, 2023. RetrievedApril 10, 2023.
^Sharma, Animesh K.; Sharma, Rahul (2023)."The role of generative pretrained transformers (GPTs) in revolutionising digital marketing: A conceptual model".Journal of Cultural Marketing Strategy.8 (1):80–90.doi:10.69554/TLVQ2275.
^Leswing, Kif (April 13, 2023)."Bloomberg plans to integrate GPT-style A.I. into its terminal".CNBC.Archived from the original on May 19, 2023. RetrievedMay 4, 2023.
^Melendez, Steven (May 4, 2023)."Learning nonprofit Khan Academy is piloting a version of GPT called Khanmigo".Fast Company.Archived from the original on May 11, 2023. RetrievedMay 22, 2023.
^"Khan Academy Pilots GPT-4 Powered Tool Khanmigo for Teachers".THE Journal.Archived from the original on May 7, 2023. RetrievedMay 7, 2023.
^Hachman, Mark (May 4, 2023)."Slack GPT will bring AI chatbots to your conversations".PCWorld.Archived from the original on June 9, 2023. RetrievedMay 4, 2023.
^Luo (et-al), Renqian (April 3, 2023). "BioGPT: Generative pre-trained transformer for biomedical text generation and mining".Briefings in Bioinformatics.23 (6) bbac409.arXiv:2210.10341.doi:10.1093/bib/bbac409.PMID 36156661.
^John, Amy Sarah (May 5, 2023)."Know about ChatGPT's 13 best plugins, designed to improve your overall user experience".Latest Digital Transformation Trends | Cloud News | Wire19. Archived fromthe original on May 9, 2023. RetrievedMay 7, 2023.
^"ChatGPT plugins".openai.com. March 13, 2024.Archived from the original on March 23, 2023. RetrievedMay 7, 2023.
^"How to Use ChatGPT on Google Sheets With GPT for Sheets and Docs".MUO. March 12, 2023.Archived from the original on June 19, 2023. RetrievedMay 7, 2023.
^Asay, Matt (February 27, 2023)."Embrace and extend Excel for AI data prep".InfoWorld.Archived from the original on June 2, 2023. RetrievedMay 7, 2023.
^^a ^b ^c ^dHicks, William (May 10, 2023)."ChatGPT creator OpenAI is asking startups to remove 'GPT' from their names".The Business Journal.Archived from the original on June 28, 2023. RetrievedMay 21, 2023.
^OpenAI (April 24, 2023)."Brand Guidelines".Archived from the original on July 18, 2023. RetrievedMay 21, 2023.
^^a ^b"Brand guidelines".Archived from the original on July 18, 2023. RetrievedNovember 28, 2023.
^"Introducing GPTS". March 13, 2024.Archived from the original on March 20, 2024. RetrievedNovember 28, 2023.
^^a ^bHeah, Alexa (April 26, 2023)."OpenAI Unsuccessful At Speeding Up Its Attempt To Trademark 'GPT'".DesignTAXI.Archived from the original on April 26, 2023. RetrievedMay 21, 2023.
^"NONFINAL OFFICE ACTION".USPTO. May 25, 2023.Archived from the original on December 3, 2023. RetrievedDecember 30, 2023.
^"U.S. Trademark Law". December 2015.Archived from the original on January 17, 2024. RetrievedNovember 29, 2023.
^"International Trademark Rights".Archived from the original on March 11, 2024. RetrievedNovember 29, 2023.
^"OpenAI Wants to Trademark 'GPT' Amid Rise of AI Chatbots". Tech Times. April 25, 2023.Archived from the original on April 25, 2023. RetrievedMay 21, 2023.
^Louise, Nickie (April 3, 2023)."OpenAI files a UDRP case against the current owner of ChatGPT.com".Archived from the original on June 5, 2023. RetrievedMay 21, 2023.
^^a ^bDemcak, Tramatm-Igor (April 26, 2023)."OpenAI's Battle for Brand Protection: Can GPT be trademarked?".Lexology. Archived fromthe original on May 5, 2023. RetrievedMay 22, 2023.
^Lawton, George (April 20, 2023)."ChatGPT vs. GPT: How are they different? | TechTarget".Enterprise AI. Archived fromthe original on May 9, 2023. RetrievedMay 21, 2023.
^Robb, Drew (April 12, 2023)."GPT-4 vs. ChatGPT: AI Chatbot Comparison".eWEEK.Archived from the original on July 27, 2023. RetrievedMay 21, 2023.
^Russo, Philip (August 22, 2023)."The Genesis of Generative AI for Everything Everywhere All at Once in CRE".Commercial Observer.Archived from the original on August 24, 2023.
^"Trademark infringement".Archived from the original on November 30, 2023. RetrievedNovember 29, 2023.
^Rheintgen, Husch Blackwell LLP-Kathleen A. (August 16, 2013)."Branding 101: trademark descriptive fair use".Lexology.Archived from the original on May 21, 2023. RetrievedMay 21, 2023.
^"EUIPO - eSearch".euipo.europa.eu. RetrievedSeptember 4, 2025.
^"IPI Database".www.swissreg.ch. RetrievedSeptember 4, 2025.
^Vogt, Reto (February 20, 2024)."OpenAI sichert sich in der Schweiz "GPT" als Markenname".www.inside-it.ch (in German). Archived fromthe original on March 21, 2025. RetrievedSeptember 4, 2025.

OpenAI

Products

Chatbots	ChatGPT in education GPT Store DALL-E ChatGPT Search Sora Whisper GitHub Copilot
Foundation models	OpenAI Codex Generative pre-trained transformer GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1 o3 GPT-4.5 GPT-4.1 o4-mini GPT-OSS GPT-5
Intelligent agents	ChatGPT Deep Research Operator

People

Senior
management

Current	Sam Altman removal Greg Brockman Sarah Friar Jakub Pachocki Scott Schools
Former	Mira Murati Emmett Shear

Board of
directors

Current	Sam Altman Adam D'Angelo Sue Desmond-Hellmann Zico Kolter Paul Nakasone Adebayo Ogunlesi Nicole Seligman Fidji Simo Lawrence Summers Bret Taylor (chair)
Former	Greg Brockman (2017–2023) Reid Hoffman (2019–2023) Will Hurd (2021–2023) Holden Karnofsky (2017–2021) Elon Musk (2015–2018) Ilya Sutskever (2017–2023) Helen Toner (2021–2023) Shivon Zilis (2019–2023)

Joint ventures

Stargate LLC

Category

Generative AI

Concepts

Models

Text	Character.ai ChatGPT Command A Claude DeepSeek Ernie Gemini Gemma GLM GPT 1 2 3 J 3.5 4 4o o1 o3 4.5 4.1 o4-mini OSS 5 Grok Hunyuan T1 Kimi Llama Microsoft Copilot MiniMax M2 Mistral Medium Qwen Solar Pro
Coding	Base44 Claude Code Cursor Devstral GitHub Copilot Grok Code Fast 1 Kimi-Dev Qwen3-Coder Replit Xcode
Image	Aurora Firefly Flux GPT Image 1 Ideogram Imagen Leonardo Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling Runway Gen Seedance Sora Veo Wan
Speech	15.ai Eleven Gemini Speech GPT-4o mini TTS MiniMax Speech Speechify
Music	Eleven Music Endel Lyria Riffusion Stable Audio Suno Udio

Agents

Companies

Controversies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Category

Natural language processing

General terms

Text analysis

Text segmentation	Compound-term processing Lemmatisation Lexical analysis Text chunking Stemming Sentence segmentation Word segmentation

Automatic summarization

Machine translation

Distributional semantics models

Language resources,
datasets and corpora

Types and standards	Corpus linguistics Lexical resource Linguistic Linked Open Data Machine-readable dictionary Parallel text PropBank Semantic network Simple Knowledge Organization System Speech corpus Text corpus Thesaurus (information retrieval) Treebank Universal Dependencies
Data	BabelNet Bank of English DBpedia FrameNet Google Ngram Viewer UBY WordNet Wikidata