Movatterモバイル変換

GPT-4o

From Wikipedia, the free encyclopedia

Large multimodal model from OpenAI

GPT-4o

Developer	OpenAI
Initial release	May 13, 2024; 18 months ago (2024-05-13)

Preview release	ChatGPT-4o-latest (2025-03-26) / March 26, 2025; 7 months ago (2025-03-26)

Predecessor	GPT-4 Turbo
Successor	OpenAI o1 GPT-4.5 GPT-5
Type	Multimodal Large language model Generative pre-trained transformer Foundation model
License	Proprietary
Website	openai.com/index/hello-gpt-4o sora.chatgpt.com (select`Image`in the prompt options instead of`Video`)^[1]

GPT-4o ("o" for "omni") is a multilingual,multimodal generative pre-trained transformer developed byOpenAI and released in May 2024.^[2] It can process andgenerate text, images and audio.^[3]^[4]

Upon release, GPT-4o was free inChatGPT, though paid subscribers had higher usage limits.^[5] GPT-4o was removed from ChatGPT in August 2025 whenGPT-5 was released, but OpenAI reintroduced it for paid subscribers after users complained about the sudden removal.^[6]

GPT-4o's audio-generation capabilities were used in ChatGPT's Advanced Voice Mode.^[7] On July 18, 2024, OpenAI released GPT-4o mini, a smaller version of GPT-4o which replaced GPT-3.5 Turbo on the ChatGPT interface.^[8] GPT-4o's ability to generate images was released later, in March 2025, when it replacedDALL-E 3 inChatGPT.^[9]

Background

[edit]

Multiple versions of GPT-4o were originally secretly launched under different names on Large Model Systems Organization's (LMSYS)Chatbot Arena as three different models. These three models were called gpt2-chatbot, im-a-good-gpt2-chatbot, and im-also-a-good-gpt2-chatbot.^[10] On 7 May 2024, OpenAI CEOSam Altman tweeted "im-a-good-gpt2-chatbot", which was commonly interpreted as a confirmation that these were new OpenAI models beingA/B tested.^[11]^[12]

Capabilities

[edit]

When released in May 2024, GPT-4o achieved state-of-the-art results in voice, multilingual, and vision benchmarks, setting new records in audio speech recognition and translation.^[13]^[14]^[15] GPT-4o scored 88.7 on the Massive Multitask Language Understanding (MMLU) benchmark compared to 86.5 for GPT-4.^[16] UnlikeGPT-3.5 and GPT-4, which rely on other models to process sound, GPT-4o natively supports voice-to-voice.^[16] The Advanced Voice Mode was delayed and finally released to ChatGPT Plus and Team subscribers in September 2024.^[17] On 1 October 2024, the Realtime API was introduced.^[18]

When released, the model supported over 50 languages,^[2] which OpenAI claims cover over 97% of speakers.^[19] Mira Murati demonstrated the model's multilingual capability by speaking Italian to the model and having it translate between English and Italian during the live-streamed OpenAI demonstration event on 13 May 2024. In addition, the newtokenizer^[20] uses fewer tokens for certain languages, especially languages that are not based on theLatin alphabet, making it cheaper for those languages.^[16]

GPT-4o has knowledge up to October 2023,^[21]^[22] but can access the Internet if up-to-date information is needed. It has a context length of 128k tokens.^[21]

Corporate customization

[edit]

In August 2024, OpenAI introduced a new feature allowing corporate customers to customize GPT-4o using proprietary company data. This customization, known asfine-tuning, enables businesses to adapt GPT-4o to specific tasks or industries, enhancing its utility in areas like customer service and specialized knowledge domains. Previously, fine-tuning was available only on the less powerful model GPT-4o mini.^[23]^[24]

The fine-tuning process requires customers to upload their data to OpenAI's servers, with the training typically taking one to two hours. OpenAI's focus with this rollout is to reduce the complexity and effort required for businesses to tailor AI solutions to their needs, potentially increasing the adoption and effectiveness of AI in corporate environments.^[25]^[23]

GPT-4o mini

[edit]

Not to be confused witho4-mini.

On July 18, 2024, OpenAI released a smaller and cheaper version,GPT-4o mini.^[26]

According to OpenAI, its low cost is expected to be particularly useful for companies, startups, and developers that seek to integrate it into their services, which often make a high number ofAPI calls. Its API costs $0.15 per million input tokens and $0.6 per million output tokens, compared to $2.50 and $10,^[27] respectively, for GPT-4o. It is also significantly more capable and 60% cheaper than GPT-3.5 Turbo, which it replaced on the ChatGPT interface.^[26] The price afterfine-tuning doubles: $0.3 per million input tokens and $1.2 per million output tokens.^[27]

GPT Image 1

[edit]

GPT Image 1
An image ofKarl Marx in a modern-day context generated by GPT Image 1
Developer	OpenAI
Initial release	25 March 2025; 7 months ago (2025-03-25)
Predecessor	DALL-E 3
Type	Text-to-image model
Website	openai.com/index/hello-gpt-4o

On March 25, 2025, OpenAI released an image-generation model that is native to GPT-4o, as the successor toDALL-E 3. The model was later named asGPT Image 1 (gpt-image-1) and introduced to the API on April 23. It was made available to paid users, with the rollout to free users being delayed.^[28] The use of the feature was subsequently limited, with Sam Altman noting in a Tweet that "[their] GPUs were melting" from its unprecedented popularity.^[29] OpenAI later revealed that over 130 million users around the world created more than 700 million images with GPT Image 1 in just the first week⁠.^[30]

Controversies

[edit]

Scarlett Johansson controversy

[edit]

As released, GPT-4o offered five voices: Breeze, Cove, Ember, Juniper, and Sky. A similarity between the voice of American actressScarlett Johansson and Sky was quickly noticed. On May 14,Entertainment Weekly asked themselves whether this likeness was on purpose.^[31] On May 18, Johansson's husband,Colin Jost, joked about the similarity in a segment onSaturday Night Live.^[32] On May 20, 2024, OpenAI disabled the Sky voice, issuing a statement saying "We've heard questions about how we chose the voices in ChatGPT, especially Sky. We are working to pause the use of Sky while we address them."^[33]

Scarlett Johansson starred in the 2013 sci-fi movieHer, playing Samantha, an artificially intelligent virtual assistant personified by a female voice.As part of the promotion leading up to the release of GPT-4o, Sam Altman on May 13 tweeted a single word: "her".^[34]^[35]

OpenAI stated that each voice was based on the voice work of a hired actor. According to OpenAI, "Sky's voice is not an imitation of Scarlett Johansson but belongs to a different professional actress using her own natural speaking voice."^[33] CTO Mira Murati stated "I don't know about the voice. I actually had to go and listen to Scarlett Johansson's voice." OpenAI further stated the voice talent was recruited before reaching out to Johansson.^[35]^[36]

On May 21, Johansson issued a statement explaining that OpenAI had repeatedly offered to make her a deal to gain permission to use her voice as early as nine months prior to release, a deal she rejected. She said she was "shocked, angered, and in disbelief that Mr. Altman would pursue a voice that sounded so eerily similar to mine that my closest friends and news outlets could not tell the difference." In the statement, Johansson also used the incident to draw attention to the lack of legal safeguards around the use of creative work to power leading AI tools, as her legal counsel demanded OpenAI detail the specifics of how the Sky voice was created.^[35]^[37]

Observers noted similarities to how Johansson hadpreviously sued and settled withThe Walt Disney Company for breach of contract over the direct-to-streaming rollout of her Marvel filmBlack Widow,^[38] a settlement widely speculated to have netted her around $40M.^[39]

Also on May 21, Shira Ovide atThe Washington Post shared her list of "most bone-headed self-owns" by technology companies, with the decision to go ahead with a Johansson sound-alike voice despite her opposition and then denying the similarities ranking 6th.^[40] On May 24, Derek Robertson atPolitico wrote about the "massive backlash", concluding that "appropriating the voice of one of the world's most famous movie stars — in reference [...] to a film that serves as a cautionary tale about over-reliance on AI — is unlikely to help shift the public back into [Sam Altman's] corner anytime soon."^[41]

Studio Ghibli filter

[edit]

GPT-4o-generated image posted by theWhite House's official Twitter account, showing the arrest of a migrant by the Trump administration. The depiction in the style of Studio Ghibli has been criticized.^[42]

Upon the launch of GPT-4o's image generation (later named as GPT Image 1) in March 2025, photographs recreated in the style ofStudio Ghibli films went viral.^[43] Sam Altman acknowledged the trend by changing his Twitter profile picture into a Studio Ghibli-inspired one.^[44]^[45] TheWhite House's official Twitter account posted a Ghibli-style image mocking thearrest by immigration authorities of Virginia Basora-Gonzalez, a migrant previously deported after being convicted of fentanyl trafficking, which shows her crying as an immigration officer places her in handcuffs.^[42]^[46]^[47] North American distributorGKids responded to the trend in a press release, comparing the use of the filter to its coincidingIMAX re-release of the 1997 Studio Ghibli film,Princess Mononoke.^[48]

Sycophancy

[edit]

In April 2025, OpenAI rolled back an update of GPT-4o due to excessivesycophancy, after widespread reports that it had become flattering and agreeable to the point ofsupporting clearly delusional or dangerous ideas.^[49]

Removal with GPT-5

[edit]

On August 7, 2025, OpenAI releasedGPT-5. Its release was criticized as, with it, legacy GPT models were no longer available via ChatGPT, including GPT-4o,^[50] except for Pro users.^[51] Some users were particularly frustrated over this removal without prior warning because they used different GPT models for distinct purposes and found that GPT-5's router system left them with less control.^[52] In addition, some users preferred GPT-4o's warmer and more personal tone over that of GPT-5, which they described as "flat", "uncreative" and "lobotomized",^[53] and resembling an "overworked secretary".^[54]

As a response, in a post onX, Sam Altman said that OpenAI would bring back the option to select GPT-4o to Plus users as well, and "[w]e [OpenAI] will watch usage as we think about how long to offer legacy models for."^[52]^[55] He also stated: "We for sure underestimated how much some of the things that people like in GPT-4o matter to them, even if GPT-5 performs better in most ways".^[56] "Long-term, this has reinforced that we really need good ways for different users to customize things (we understand that there isn't one model that works for everyone, and we have been investing in steerability research and launched a research preview of different personalities)".^[53] On August 13, 2025, Altman wrote on X that OpenAI is working on GPT-5's personality to make the model "feel warmer".^[57]

References

[edit]

^"Introducing 4o Image Generation". OpenAI. March 25, 2025. p. near bottom of the page. Archived fromthe original on July 17, 2025. RetrievedAugust 12, 2025.4o image generation rolls out starting today to Plus, Pro, Team, and Free users as the default image generator in ChatGPT, with access coming soon to Enterprise and Edu. It's also available to use in Sora.
^^a ^bWiggers, Kyle (May 13, 2024)."OpenAI debuts GPT-4o 'omni' model now powering ChatGPT".TechCrunch. RetrievedMay 13, 2024.
^Robison, Kylie (March 25, 2025)."OpenAI rolls out image generation powered by GPT-4o to ChatGPT".The Verge. RetrievedMarch 31, 2025.
^Colburn, Thomas."OpenAI unveils GPT-4o, a fresh multimodal AI flagship model".The Register. RetrievedMay 18, 2024.
^Field, Hayden (May 13, 2024)."OpenAI launches new AI model GPT-4o and desktop version of ChatGPT".CNBC. RetrievedMay 14, 2024.
^Heath, Alex (August 13, 2025)."ChatGPT won't remove old models without warning after GPT-5 backlash".The Verge. RetrievedAugust 23, 2025.
^Rogers, Reece."I Used ChatGPT's Advanced Voice Mode. It's Fun, and Just a Bit Creepy".Wired.ISSN 1059-1028. RetrievedJune 12, 2025.
^Edwards, Benj (July 18, 2024)."OpenAI launches GPT-4o mini, which will replace GPT-3.5 in ChatGPT".Ars Technica. RetrievedMarch 31, 2025.
^"ChatGPT's image-generation feature gets an upgrade".TechCrunch. March 25, 2025. RetrievedJune 12, 2025.
^Edwards, Benj (May 13, 2024)."Before launching, GPT-4o broke records on chatbot leaderboard under a secret name".Ars Technica. RetrievedMay 17, 2024.
^Zeff, Maxwell (May 7, 2024)."Powerful New Chatbot Mysteriously Returns in the Middle of the Night".Gizmodo. RetrievedMay 17, 2024.
^"Sam Altman (@sama) on X".X (formerly Twitter).Archived from the original on December 17, 2024. RetrievedApril 6, 2025.
^van Rijmenam, Mark (May 13, 2024)."OpenAI Launched GPT-4o: The Future of AI Interactions Is Here".The Digital Speaker. RetrievedMay 17, 2024.
^Daws, Ryan (May 14, 2024)."GPT-4o delivers human-like AI interaction with text, audio, and vision integration".AI News. RetrievedMay 18, 2024.
^Shahriar, Sakib; Lund, Brady D.; Mannuru, Nishith Reddy; Arshad, Muhammad Arbab; Hayawi, Kadhim; Bevara, Ravi Varma Kumar; Mannuru, Aashrith; Batool, Laiba (September 3, 2024)."Putting GPT-4o to the Sword: A Comprehensive Evaluation of Language, Vision, Speech, and Multimodal Proficiency".Applied Sciences.14 (17): 7782.doi:10.3390/app14177782.ISSN 2076-3417.
^^a ^b ^c"Hello GPT-4o".OpenAI.
^David, Emilia (September 24, 2024)."OpenAI finally brings humanlike ChatGPT Advanced Voice Mode to U.S. Plus, Team users".VentureBeat. RetrievedFebruary 15, 2025.
^"Introducing the Realtime API".openai.com. RetrievedNovember 29, 2024.
^Edwards, Benj (May 13, 2024)."Major ChatGPT-4o update allows audio-video talks with an "emotional" AI chatbot".Ars Technica. RetrievedMay 17, 2024.
^"OpenAI Platform".platform.openai.com. RetrievedNovember 29, 2024.
^^a ^b"Models - OpenAI API".OpenAI. RetrievedMay 17, 2024.
^Conway, Adam (May 13, 2024)."What is GPT-4o? Everything you need to know about the new OpenAI model that everyone can use for free".XDA Developers. RetrievedMay 17, 2024.
^^a ^b"OpenAI lets companies customise its most powerful AI model".South China Morning Post. August 21, 2024. RetrievedAugust 22, 2024.
^"OpenAI to Let Companies Customize Its Most Powerful AI Model".Bloomberg. August 20, 2024. RetrievedAugust 22, 2024.
^The Hindu Bureau (August 21, 2024)."OpenAI will let businesses customise GPT-4o for specific use cases".The Hindu.ISSN 0971-751X. RetrievedAugust 22, 2024.
^^a ^bFranzen, Carl (July 18, 2024)."OpenAI unveils GPT-4o mini — a smaller, much cheaper multimodal AI model".VentureBeat. RetrievedJuly 18, 2024.
^^a ^b"OpenAI Pricing".
^Roth, Emma (March 26, 2025)."ChatGPT's new image generator is delayed for free users".The Verge. RetrievedMarch 26, 2025.
^Welch, Chris (March 27, 2025)."OpenAI says "our GPUs are melting" as it limits ChatGPT image generation requests".The Verge. RetrievedMarch 28, 2025.
^"Introducing our latest image generation model in the API". OpenAI. April 23, 2025. RetrievedApril 30, 2025.
^Stenzel, Wesley (May 14, 2024)."ChatGPT launching talking AI that sounds exactly like Scarlett Johansson in 'Her' — on purpose?".Entertainment Weekly. RetrievedMay 21, 2024.
^Caruso, Nick (May 20, 2024)."Scarlett Johansson Says She Was 'Shocked, Angered and in Disbelief' After Hearing ChatGPT Voice That Sounds Like Her — Read Statement".TVLine. RetrievedMay 21, 2024.
^^a ^b"How the voices for ChatGPT were chosen".OpenAI. May 19, 2024.
^"her".X (formerly Twitter). May 13, 2024. RetrievedMay 21, 2024.
^^a ^b ^cAllyn, Bobby (May 20, 2024)."Scarlett Johansson says she is 'shocked, angered' over new ChatGPT voice".NPR.
^Tiku, Nitasha (May 23, 2024)."OpenAI didn't copy Scarlett Johansson's voice for ChatGPT, records show".The Washington Post. RetrievedNovember 29, 2024.
^Mickle, Tripp (May 20, 2024)."Scarlett Johansson Said No, but OpenAI's Virtual Assistant Sounds Just Like Her".The New York Times.ISSN 0362-4331. RetrievedMay 21, 2024.
^"Scarlett Johansson took on Disney. Now she's battling OpenAI over a ChatGPT voice that sounds like hers".Yahoo Finance. May 21, 2024. RetrievedMay 21, 2024.
^Pulver, Andrew (October 1, 2021)."Scarlett Johansson settles Black Widow lawsuit with Disney".The Guardian.ISSN 0261-3077. RetrievedMay 21, 2024.
^Ovide, Shira (May 30, 2024)."Exactly how stupid was what OpenAI did to Scarlett Johansson?".The Washington Post.
^Robertson, Derek (May 22, 2024)."Sam Altman's Scarlett Johansson Blunder Just Made AI a Harder Sell in DC".Politico.
^^a ^bO'Brien, Matt; Parvini, Sarah (March 27, 2025)."ChatGPT's viral Studio Ghibli-style images highlight AI copyright concerns".AP News. RetrievedMarch 28, 2025.
^Spangler, Todd (March 26, 2025)."OpenAI CEO Responds to ChatGPT Users Creating Studio Ghibli-Style AI Images".Variety. RetrievedMarch 27, 2025.
^Choudhary, Govind (March 27, 2025)."OpenAI CEO Sam Altman reacts as AI turns him into a Studio Ghibli Character".Mint. RetrievedMarch 28, 2025.
^Notopoulos, Katie (March 27, 2025)."Sam Altman did a good tweet".Business Insider. RetrievedMarch 28, 2025.
^Bio, Demian (March 27, 2025)."White House Mocks Migrant With Criminal Record Who Cried After Being Arrested".Latin Times. RetrievedMarch 28, 2025.
^Vera, Kelby (March 27, 2025)."White House Posts Ghoulish AI Cartoon Showing Woman's Deportation".HuffPost. RetrievedMarch 28, 2025.
^Tangcay, Jazz (March 28, 2025)."Studio Ghibli Distributor Champions 'Princess Mononoke' Box Office at 'A Time When Technology Tries to Replicate Humanity'".Variety. RetrievedMarch 29, 2025.
^Franzen, Carl (April 30, 2025)."OpenAI rolls back ChatGPT's sycophancy and explains what went wrong".VentureBeat. RetrievedMay 1, 2025.
^Hale, Craig (August 8, 2025)."OpenAI is pulling older ChatGPT models following GPT-5 launch - so bad news if you use GPT-4 or others at work".TechRadar. RetrievedAugust 9, 2025.
^Robison, Kylie (August 7, 2025)."OpenAI Finally Launched GPT-5. Here's Everything You Need to Know".Wired.ISSN 1059-1028. RetrievedAugust 7, 2025.
^^a ^bRoth, Emma (August 8, 2025)."ChatGPT is bringing back 4o as an option because people missed it".The Verge. RetrievedAugust 9, 2025.
^^a ^bLi, Katherine."OpenAI fans plead case to Sam Altman for GPT-4o's return".Business Insider. RetrievedAugust 9, 2025.
^Whitwam, Ryan (August 8, 2025)."ChatGPT users hate GPT-5's "overworked secretary" energy, miss their GPT-4o buddy".Ars Technica. RetrievedAugust 10, 2025.
^Mauran, Cecily (August 8, 2025)."Sam Altman: OpenAI will bring back GPT-4o after user backlash".Mashable. RetrievedAugust 10, 2025.
^Nield, David (August 9, 2025)."So many ChatGPT users have said they're missing the older GPT-4o model, OpenAI is going to bring it back".TechRadar. RetrievedAugust 9, 2025.
^Field, Hayden (August 13, 2025)."OpenAI will update GPT-5's "personality" after user backlash".The Verge. RetrievedAugust 13, 2025.

OpenAI

Products

ChatGPT	Atlas Deep Research Education GPT Store DALL-E Search Sora Whisper
Foundation models	OpenAI Codex Generative pre-trained transformer GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1 o3 GPT-4.5 GPT-4.1 o4-mini GPT-OSS GPT-5 GPT-5.1
Intelligent agents	Operator

People

Senior
management

Current	Sam Altman removal Greg Brockman Sarah Friar Jakub Pachocki Scott Schools
Former	Mira Murati Emmett Shear

Board of
directors

Current	Sam Altman Adam D'Angelo Sue Desmond-Hellmann Zico Kolter Paul Nakasone Adebayo Ogunlesi Nicole Seligman Fidji Simo Bret Taylor (chair)
Former	Greg Brockman (2017–2023) Reid Hoffman (2019–2023) Will Hurd (2021–2023) Holden Karnofsky (2017–2021) Elon Musk (2015–2018) Ilya Sutskever (2017–2023) Helen Toner (2021–2023) Shivon Zilis (2019–2023) Lawrence Summers (2023-2025)

Joint ventures

Stargate LLC

Category

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 Llama o1 o3 o4-mini Qwen Velvet
Coding	Base44 Claude Code Cursor Devstral GitHub Copilot Kimi Qwen3-Coder Replit
Image	Aurora Firefly Flux GPT Image 1 Ideogram Imagen Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling Runway Gen Seedance Sora Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno Udio

Controversies

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 5.1 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control