Movatterモバイル変換

Veo (text-to-video model)

From Wikipedia, the free encyclopedia

Video-generating machine learning model

Veo
A video generated by Veo 3 of an owl and badger
Developer	Google DeepMind
Initial release	May 2024; 1 year ago (2024-05)

Stable release	Veo 3.1 / 15 October 2025; 3 months ago (2025-10-15)

Type	Text-to-video model
Website	deepmind.google/models/veo/

Artificial intelligence (AI)
Part ofa series on

Major goals Artificial general intelligence Intelligent agent Recursive self-improvement Planning Computer vision General game playing Knowledge representation Natural language processing Robotics AI safety
Approaches Machine learning Symbolic Deep learning Bayesian networks Evolutionary algorithms Hybrid intelligent systems Systems integration Open-source AI data centers
Applications Bioinformatics Deepfake Earth sciences Finance Generative AI Art Audio Music Government Healthcare Mental health Industry Software development Translation Military Physics Projects
Philosophy AI alignment Artificial consciousness The bitter lesson Chinese room Friendly AI Ethics Existential risk Turing test Uncanny valley Human–AI interaction
History Timeline Progress AI winter AI boom AI bubble
Controversies Deepfake pornography Taylor Swift deepfake pornography controversy Grok deepfake pornography controversy Google Gemini image generation controversy Pause Giant AI Experiments Removal of Sam Altman from OpenAI Statement on AI Risk Tay (chatbot) Théâtre D'opéra Spatial Voiceverse NFT plagiarism scandal
Glossary Glossary
v t e

Veo, orGoogle Veo, is atext-to-video model developed byGoogle DeepMind and announced in May 2024. As agenerative AI model, it creates videos based on userprompts. Veo 3, released in May 2025, can also generate accompanying audio.

Development

[edit]

In May 2024, amultimodal video generation model called Veo was announced atGoogle I/O 2024.^[1] Google claimed that it could generate1080p videos over a minute long.^[1] In December 2024,Google released Veo 2, available via VideoFX. It supports4K resolution video generation and has an improved understanding of physics.^[2] In April 2025, Google announced that Veo 2 became available for advanced users on the Gemini app.^[3]

In May 2025, Google released Veo 3, which not only generates videos but also creates synchronized audio — including dialogue, sound effects, and ambient noise — to match the visuals.^[4] Google also announcedFlow, a video-creation tool powered by Veo andImagen.^[5]^[6] Google DeepMind CEODemis Hassabis described the release as the moment when AI video generation left the era of thesilent film.^[6]

Capabilities and limitations

[edit]

ALGBTQ romantic thriller short film, generated by Google Veo 3. This video is an example of detailed, diverse, realistic character models; continuity with characters and environments between cuts; music; voice acting; subtitles; andproduct placement.

Google Veo can be purchased at multiple subscription tiers and through Google "AI credits". The software itself can be run by two different consoles,Google Gemini and Google Flow. Gemini being geared towards shorter, quicker, and faster projects, using the Gemini AI chat model, with Google Flow, which is essentially amovie editor allowing users to create longer projects with continuity, using the same characters and actors. Users can create a maximum of eight seconds per clip.^[7] Additionally, video content can be created using Whisk in theGoogle Labs platform.^[8]^[9]

Google Veo has a simple interface and dashboard. However, those who have little to no experience intranscribing orfilmmaking may face issues when writing prompts, with the software misunderstanding what the user intended by their prompt. So prompts, which are the forefront of the software, need to be not only clear but also specific. When it comes to human models, Veo is able to generate several ethnicities and body types. The software is also capable of generatingstand up comedy routines,music videos, animals, cartoons, and animation. Prompts need places, people, and things in each scene, in addition knowledge of film and camera lingo such aspanning,zooming, and terms forcamera angles.^[10]

Veo, however, has strict guidelines and blockades to their software. Before a clip is generated, the algorithm computer software reviews it, and if it is

inappropriate
too graphically sexual
illegal
showcasing graphic abuse, assault, or fighting (unless the prompt specifies that it is a fictitious martial arts scene etc.), gross behaviors
antisemitic
racist
homophobic
depicting currentregimes,rioting,blood, gore, orwarfare, (unless in some cases the prompt specifies that it is fictitiousperiod drama)

the clip will not be generated. In addition, Google Veo cannot and will not generate character actors that look identical to celebrities or real-life individuals. Users have primarily complained that, regardless of how descriptive and detailed their prompts are, Google Veo often misunderstands the input, resulting in completely different outputs. Common issues include the emulation of incorrect subtitles and captions, the generation of complex scenes that are incomplete due to the maximum length, the production of garbled and nonsensical speech, and character models that appear deformed in both appearance and movement. Users have also reported that their prompts and generated content are falsely flagged as violating guidelines, along with a variety of other issues and complaints. However,trial and error may have to be used with Veo for optimal results.^[11]

Reactions

[edit]

A reporter forGizmodo reacted to the release of Veo 3 by observing that users were directing the model to generate low-quality content, such asman on the street interviews orhaul videos of peopleunboxing products.^[12] Another media commentator reported that the tool tended to repeat the same joke in response to different prompts.^[13]

Commentators speculated that Google had trained the service on YouTube videos^[6] orReddit posts.^[13] Google itself had not stated the source of its training content.^[6]

In July 2025,Media Matters for America reported thatracist andantisemitic videos generated using Veo 3 were being uploaded toTikTok.^[14]^[15] Ryan Whitwam ofArs Technica commented, "In a perfect world, Veo 3 would refuse to create these videos, but vagueness in the prompt and the AI's inability to understand the subtleties of racist tropes (i.e., the use of monkeys instead of humans in some videos) make it easy to skirt the rules."^[15]

References

[edit]

^^a ^bWiggers, Kyle (14 May 2024)."Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024".TechCrunch.
^"Google unveils improved AI video generator Veo 2 to rival OpenAI's Sora".The Hindu. 2024-12-17.ISSN 0971-751X. Retrieved2024-12-20.
^Wiggers, Kyle (2025-04-15)."Google's Veo 2 video generating model comes to Gemini".TechCrunch.Archived from the original on 2025-04-16. Retrieved2025-04-16.
^"Google launches Veo 3, an AI video generator that incorporates audio".CNBC. 2025-05-20. Retrieved2025-05-20.
^Peters, Jay (May 20, 2025)."Google has a new tool just for making AI videos".The Verge.Archived from the original on May 20, 2025. RetrievedMay 20, 2025.
^^a ^b ^c ^dWiggers, Kyle (20 May 2025)."Veo 3 can generate videos — and soundtracks to go along with them".TechCrunch.
^Caswell, Amanda (20 May 2025)."Google Veo 3 and Flow: The future of AI filmmaking is here, and here's how it works".Tomsguide.com.
^"labs.google/fx".labs.google. Retrieved2025-12-31.
^"Whisk - labs.google/fx".labs.google. Retrieved2025-12-31.
^Olteanu, Alex (22 May 2025)."Google's Veo 3: A Guide To Prompts, With Practical Examples".Datacamp.com.
^"Generative AI Prohibited Use Policy".Google.com. 17 December 2024.
^Pero, James (22 May 2025)."Google's Veo 3 Is Already Deepfaking All of YouTube's Most Smooth-Brained Content".Gizmodo.Archived from the original on 23 May 2025. Retrieved23 May 2025.
^^a ^bMaiberg, Emanuel (21 May 2025)."Why Does Google's New Veo 3 AI Video Generator Love This Dad Joke?".404 Media.
^Richards, Abbie (July 1, 2025)."Racist AI-generated videos are the newest slop garnering millions of views on TikTok".Media Matters for America. RetrievedJuly 4, 2025.
^^a ^bWhitwam, Ryan (2025-07-02)."TikTok is being flooded with racist AI videos generated by Google's Veo 3".Ars Technica. Retrieved2025-07-03.

External links

[edit]

Google AI

Computer
programs

AlphaGo

Versions	AlphaGo (2015) Master (2016) AlphaGo Zero (2017) AlphaZero (2017) MuZero (2019)
Competitions	Fan Hui (2015) Lee Sedol (2016) Ke Jie (2017)
In popular culture	AlphaGo (2017) The MANIAC (2023)

Other

AlphaFold (2018)
AlphaStar (2019)
AlphaDev (2023)
AlphaGeometry (2024)
AlphaGenome (2025)

Machine
learning

Neural networks	Inception (2014) WaveNet (2016) MobileNet (2017) Transformer (2017) EfficientNet (2019) Gato (2022)
Other	Quantum Artificial Intelligence Lab TensorFlow Tensor Processing Unit

Generative
AI

Chatbots	Assistant (2016) Sparrow (2022) Gemini (2023) Nano Banana (2025)
Models	BERT (2018) XLNet (2019) T5 (2019) LaMDA (2021) Chinchilla (2022) PaLM (2022) Imagen (2023) Gemini (2023) VideoPoet (2024) Gemma (2024) Veo (2024)
Other	DreamBooth (2022) NotebookLM (2023) Vids (2024) Gemini Robotics (2025) Antigravity (2025)

See also

Google

a subsidiary ofAlphabet

Company

Divisions

Subsidiaries

Active

Defunct

Programs

Events

Infrastructure

People

Current	Krishna Bharat Vint Cerf Jeff Dean John Doerr Sanjay Ghemawat Al Gore John L. Hennessy Urs Hölzle Salar Kamangar Ray Kurzweil Ann Mather Alan Mulally Rick Osterloh Sundar Pichai (CEO) Ruth Porat (CFO) Rajen Sheth Hal Varian Neal Mohan
Former	Andy Bechtolsheim Sergey Brin (co-founder) David Cheriton Matt Cutts David Drummond Alan Eustace Timnit Gebru Omid Kordestani Paul Otellini Larry Page (co-founder) Patrick Pichette Eric Schmidt Ram Shriram Amit Singhal Shirley M. Tilghman Rachel Whetstone Susan Wojcicki

Criticism

General	Censorship DeGoogle FairSearch "Google's Ideological Echo Chamber" No Tech for Apartheid Privacy concerns Street View YouTube Trade unions Alphabet Workers Union YouTube copyright issues
Incidents	Backdoor advertisement controversy Blocking of YouTube videos in Germany Data breach Elsagate Fantastic Adventures scandal Kohistan video case Reactions toInnocence of Muslims San Francisco tech bus protests Services outages Slovenian government incident Walkouts YouTube headquarters shooting

Other

Development

Software

A–C	Accelerated Linear Algebra AMP Actions on Google ALTS American Fuzzy Lop Android Cloud to Device Messaging Android Debug Bridge Android NDK Android Runtime Android SDK Android Studio Angular AngularJS Apache Beam APIs App Engine App Inventor App Maker App Runtime for Chrome AppJet Apps Script AppSheet ARCore Base Bazel BeyondCorp Bigtable BigQuery Bionic Blockly Borg Caja Cameyo Chart API Charts Chrome Frame Chromium Blink Closure Tools Cloud Connect Cloud Dataflow Cloud Datastore Cloud Messaging Cloud Shell Cloud Storage Code Search Compute Engine Cpplint
D–N	Dalvik Data Protocol Dialogflow Exposure Notification Fast Pair Fastboot Federated Learning of Cohorts File System Firebase Firebase Studio Firebase Cloud Messaging FlatBuffers Flutter Freebase Gadgets Ganeti Gears Gerrit Global Cache GLOP gRPC Gson Guava Guetzli Guice gVisor GYP JAX Jetpack Compose Keyhole Markup Language Kubernetes Kythe LevelDB Lighthouse Looker Studio lmctfy MapReduce Mashup Editor Matter Mobile Services Namebench Native Client Neatx Neural Machine Translation Nomulus
O–Z	Open Location Code OpenRefine OpenSocial Optimize OR-Tools Pack PageSpeed Piper Plugin for Eclipse Polymer Programmable Search Engine Project Shield Public DNS reCAPTCHA RenderScript SafetyNet SageTV Schema.org Search Console Shell Sitemaps Skia Graphics Engine Spanner Sputnik Stackdriver Swiffy Tango TensorFlow Tesseract Test Translator Toolkit Urchin UTM parameters V8 VirusTotal VisBug Wave Federation Protocol Weave Web Accelerator Web Designer Web Server Web Toolkit Webdriver Torso WebRTC

Operating systems

Machine learning models

Neural networks

Computer programs

Formats and codecs

Programming languages

Search algorithms

Domain names

Typefaces

Software

A	Aardvark Account Dashboard Takeout Ad Manager AdMob Ads AdSense Affiliate Network Alerts Allo Analytics Antigravity Android Auto Android Beam Answers Apture Arts & Culture Assistant Attribution Authenticator
B	BebaPay BeatThatQuote.com Beam Blog Search Blogger Body Bookmarks Books Ngram Viewer Browser Sync Building Maker Bump BumpTop Buzz
C	Calendar Cast Catalogs Chat Checkout Chrome Chrome Apps Chrome Experiments Chrome Remote Desktop Chrome Web Store Classroom Cloud Print Cloud Search Contacts Contributor Crowdsource Currents (social app) Currents (news app)
D	Data Commons Dataset Search Desktop Dictionary Dinosaur Game Directory Docs Docs Editors Domains Drawings Drive Duo
E	Earth Etherpad Expeditions Express
F	Family Link Fast Flip FeedBurner fflick Fi Wireless Finance Files Find Hub Fit Flights Flu Trends Fonts Forms Friend Connect Fusion Tables
G	Gboard Gemini Nano Banana Gesture Search Gizmo5 Google+ Gmail Goggles GOOG-411 Grasshopper Groups
H	Hangouts Helpouts Home
I	iGoogle Images Image Labeler Image Swirl Inbox by Gmail Input Tools Japanese Input Pinyin Insights for Search
J	Jaiku Jamboard
K	Kaggle Keep Knol
L	Labs Latitude Lens Like.com Live Transcribe Lively
M	Map Maker Maps Maps Navigation Marketing Platform Meet Messages Moderator My Tracks
N	Nearby Share News News & Weather News Archive Notebook NotebookLM Now
O	Offers One One Pass Opinion Rewards Orkut Oyster
P	Panoramio PaperofRecord.com Patents Page Creator Pay (mobile app) Pay (payment method) Pay Send People Cards Person Finder Personalized Search Photomath Photos Picasa Picasa Web Albums Picnik Pixel Camera Play Play Books Play Games Play Music Play Newsstand Play Pass Play Services Podcasts Poly Postini PostRank Primer Public Alerts Public Data Explorer
Q	Question Hub Quick, Draw! Quick Search Box Quick Share Quickoffice
R	Read Along Reader Reply
S	Safe Browsing SageTV Santa Tracker Schemer Scholar Search AI Overviews Knowledge Graph SafeSearch Searchwiki Sheets Shoploop Shopping Sidewiki Sites Slides Snapseed Socratic Softcard Songza Sound Amplifier Spaces Sparrow (chatbot) Sparrow (email client) Speech Recognition & Synthesis Squared Stadia Station Store Street View Surveys Sync
T	Tables Talk TalkBack Tasks Tenor Tez Tilt Brush Toolbar Toontastic 3D Translate Travel Trendalyzer Trends TV
U	URL Shortener
V	Video Vids Voice Voice Access Voice Search
W	Wallet Wave Waze WDYL Web Light Where Is My Train Widevine Wiz Word Lens Workspace Workspace Marketplace
Y	YouTube YouTube Kids YouTube Music YouTube Premium YouTube Shorts YouTube Studio YouTube TV YouTube VR

Hardware

Pixel

Smartphones	Pixel (2016) Pixel 2 (2017) Pixel 3 (2018) Pixel 3a (2019) Pixel 4 (2019) Pixel 4a (2020) Pixel 5 (2020) Pixel 5a (2021) Pixel 6 (2021) Pixel 6a (2022) Pixel 7 (2022) Pixel 7a (2023) Pixel Fold (2023) Pixel 8 (2023) Pixel 8a (2024) Pixel 9 (2024) Pixel 9 Pro Fold (2024) Pixel 9a (2025) Pixel 10 (2025) Pixel 10 Pro Fold (2025)
Smartwatches	Pixel Watch (2022) Pixel Watch 2 (2023) Pixel Watch 3 (2024) Pixel Watch 4 (2025)
Tablets	Pixel C (2015) Pixel Slate (2018) Pixel Tablet (2023)
Laptops	Chromebook Pixel (2013–2015) Pixelbook (2017) Pixelbook Go (2019)
Other	Pixel Buds (2017–present)

Nexus

Smartphones	Nexus One (2010) Nexus S (2010) Galaxy Nexus (2011) Nexus 4 (2012) Nexus 5 (2013) Nexus 6 (2014) Nexus 5X (2015) Nexus 6P (2015)
Tablets	Nexus 7 (2012) Nexus 10 (2012) Nexus 7 (2013) Nexus 9 (2014)
Other	Nexus Q (2012) Nexus Player (2014)

Other

v t e Litigation
Advertising	Feldman v. Google, Inc. (2007) Rescuecom Corp. v. Google Inc. (2009) Goddard v. Google, Inc. (2009) Rosetta Stone Ltd. v. Google, Inc. (2012) Google, Inc. v. American Blind & Wallpaper Factory, Inc. (2017) Jedi Blue
Antitrust	European Union (2010–present) United States v. Adobe Systems, Inc., Apple Inc., Google Inc., Intel Corporation, Intuit, Inc., and Pixar (2011) Umar Javeed, Sukarma Thapar, Aaqib Javeed vs. Google LLC and Ors. (2019) United States v. Google LLC (2020) Epic Games v. Google (2021) United States v. Google LLC (2023)
Intellectual property	Perfect 10, Inc. v. Amazon.com, Inc. (2007) Viacom International, Inc. v. YouTube, Inc. (2010) Lenz v. Universal Music Corp.(2015) Authors Guild, Inc. v. Google, Inc. (2015) Field v. Google, Inc. (2016) Google LLC v. Oracle America, Inc. (2021) Smartphone patent wars
Privacy	Rocky Mountain Bank v. Google, Inc. (2009) Hibnick v. Google, Inc. (2010) United States v. Google Inc. (2012) Judgement of the German Federal Court of Justice on Google's autocomplete function (2013) Joffe v. Google, Inc. (2013) Mosley v SARL Google (2013) Google Spain v AEPD and Mario Costeja González (2014) Frank v. Gaos (2019)
Other	Garcia v. Google, Inc. (2015) Google LLC v Defteros (2020) Gonzalez v. Google LLC (2022)

Concepts

Products

Android	Booting process Custom distributions Features Recovery mode Software development
Street View coverage	Africa Antarctica Asia Israel Europe North America Canada United States Oceania South America Argentina Chile Colombia
YouTube	Copyright strike Education Features Moderation Most-disliked videos Most-liked videos Most-subscribed channels Most-viewed channels Most-viewed videos Arabic music videos Chinese music videos French music videos Indian videos Pakistani videos Official channel Social impact YouTube Premium original programming
Other	Gmail interface Maps pin Most downloaded Google Play applications Stadia games

Documentaries

Books

Popular culture

Google Feud
Google Me (film)
"Google Me" (Kim Zolciak song)
"Google Me" (Teyana Taylor song)
Is Google Making Us Stupid?
Proceratium google
Matt Nathanson: Live at Google
The Billion Dollar Code
The Internship
Where on Google Earth is Carmen Sandiego?

Other

Italics denotediscontinued products.

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 5.1 5.2 Llama o1 o3 o4-mini Qwen Velvet
Coding	Claude Code Cursor Devstral GitHub Copilot Kimi Qwen3-Coder Replit
Image	Aurora Firefly DALL-E Flux GPT Image Ideogram Imagen Nano Banana Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling AI Runway Gen Seedance LTX-2 Sora Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno Udio

Controversies

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux GPT Image Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 5.1 5.2 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Political

Social and economic

Category

Retrieved from "https://en.wikipedia.org/w/index.php?title=Veo_(text-to-video_model)&oldid=1337612931"

Categories:

Hidden categories:

[8]ページ先頭