Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Veo (text-to-video model)

From Wikipedia, the free encyclopedia
Video-generating machine learning model
Veo
A video generated by Veo 3 of an owl and badger
DeveloperGoogle DeepMind
Initial releaseMay 2024; 1 year ago (2024-05)
Stable release
Veo 3.1 / 15 October 2025; 10 days ago (2025-10-15)
TypeText-to-video model
Websitedeepmind.google/models/veo/

Veo, orGoogle Veo, is atext-to-video model developed byGoogle DeepMind and announced in May 2024. As agenerative AI model, it creates videos based on userprompts. Veo 3, released in May 2025, can also generate accompanying audio.

Development

[edit]

In May 2024, amultimodal video generation model called Veo was announced atGoogle I/O 2024.[1] Google claimed that it could generate1080p videos over a minute long.[1] In December 2024,Google released Veo 2, available via VideoFX. It supports4K resolution video generation and has an improved understanding of physics.[2] In April 2025, Google announced that Veo 2 became available for advanced users on the Gemini app.[3]

In May 2025, Google released Veo 3, which not only generates videos but also creates synchronized audio — including dialogue, sound effects, and ambient noise — to match the visuals.[4] Google also announcedFlow, a video-creation tool powered by Veo andImagen.[5][6] Google DeepMind CEODemis Hassabis described the release as the moment when AI video generation left the era of thesilent film.[6]

Capabilities and limitations

[edit]
ALGBTQromantic thrillershort film, generated by Google Veo 3. This video is an example of detailed, diverse, realistic character models; continuity with characters and environments between cuts; music; voice acting; subtitles; andproduct placement.

Google Veo can be bought by several subscription/membership tiers, and/or by using Google "AI credits". The software itself can be run by two different consoles calledGoogle Gemini and Google Flow, with Gemini being geared towards shorter, quicker, and faster projects, using the Gemini AI chat model, or through Google Flow, which is essentially amovie editor, as well, allowing users to create longer projects, and continuity using the same characters and actors. Users can create a maximum length of eight seconds per clip.[7]

Google Veo, has a relatively simple interface and dashboard, however writing prompts, for those who have little to no experience intranscribing orfilmmaking may face issues with the software misunderstanding what the user intended by their prompt (no matter how detailed it was). So although Veo does have a friendly and simple setup, prompts, which are the forefront of the software, need to be not only short and to the point, but they also must be very specific, if the user wants the right vision for their project. Google Veo, when it comes to human models, is able to generate several ethnicity and body types. The software is also capable of generatingstand up comedy routines, andMusic videos. It can as well generate animals, cartoons, and animation. Prompts must accurately describe places, people, and things in each scene, in addition knowledge of film and camera lingo such aspanning,zooming, and terms forcamera angles, are also important.[8]

Google Veo however, has strict guidelines and blockades to their software. Before a clip is generated, the algorithm computer software reviews it, and if it is anything deemed inappropriate, too graphically sexual, illegal, showcasing graphic abuse/assault/fighting (unless the prompt specifies that it is a fictitious martial arts scene etc.) gross behaviors,antisemitism,racist,homophobic, anything depictingreigning regimes,rioting,blood, gore, orwarfare, (unless in some cases the prompt specifies that it is fictitiousperiod drama, the clip may still be generated), the clip will not be generated. In addition, Google Veo cannot and will not generate character actors that look identical to celebrities or real-life individuals. Users have primarily complained that, regardless of how descriptive and detailed their prompts are, Google Veo often misunderstands the input, resulting in completely different outputs. Common issues include the emulation of incorrect subtitles and captions, the generation of complex scenes that are incomplete due to the maximum eight-second length, the production of garbled and nonsensical speech, and character models that appear deformed in both appearance and movement. Users have also reported that their prompts and generated content are falsely flagged as violating guidelines, along with a variety of other issues and complaints. However,trial and error may have to be used with Veo for optimal results.[9]

Reactions

[edit]

A reporter forGizmodo reacted to the release of Veo 3 by observing that users were directing the model to generate low-quality content, such asman on the street interviews orhaul videos of peopleunboxing products.[10] Another media commentator reported that the tool tended to repeat the same joke in response to different prompts.[11]

Commentators speculated that Google had trained the service on YouTube videos[6] orReddit posts.[11] Google itself had not stated the source of its training content.[6]

In July 2025,Media Matters for America reported thatracist andantisemitic videos generated using Veo 3 were being uploaded toTikTok.[12][13] Ryan Whitwam ofArs Technica commented, "In a perfect world, Veo 3 would refuse to create these videos, but vagueness in the prompt and the AI's inability to understand the subtleties of racist tropes (i.e., the use of monkeys instead of humans in some videos) make it easy to skirt the rules."[13]

See also

[edit]

References

[edit]
  1. ^abWiggers, Kyle (14 May 2024)."Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024".TechCrunch.
  2. ^"Google unveils improved AI video generator Veo 2 to rival OpenAI's Sora".The Hindu. 2024-12-17.ISSN 0971-751X. Retrieved2024-12-20.
  3. ^Wiggers, Kyle (2025-04-15)."Google's Veo 2 video generating model comes to Gemini".TechCrunch.Archived from the original on 2025-04-16. Retrieved2025-04-16.
  4. ^"Google launches Veo 3, an AI video generator that incorporates audio".CNBC. 2025-05-20. Retrieved2025-05-20.
  5. ^Peters, Jay (May 20, 2025)."Google has a new tool just for making AI videos".The Verge.Archived from the original on May 20, 2025. RetrievedMay 20, 2025.
  6. ^abcdWiggers, Kyle (20 May 2025)."Veo 3 can generate videos — and soundtracks to go along with them".TechCrunch.
  7. ^Caswell, Amanda (20 May 2025)."Google Veo 3 and Flow: The future of AI filmmaking is here, and here's how it works".Tomsguide.com.
  8. ^Olteanu, Alex (22 May 2025)."Google's Veo 3: A Guide To Prompts, With Practical Examples".Datacamp.com.
  9. ^"Generative AI Prohibited Use Policy".Google.com. 17 December 2024.
  10. ^Pero, James (22 May 2025)."Google's Veo 3 Is Already Deepfaking All of YouTube's Most Smooth-Brained Content".Gizmodo.Archived from the original on 23 May 2025. Retrieved23 May 2025.
  11. ^abMaiberg, Emanuel (21 May 2025)."Why Does Google's New Veo 3 AI Video Generator Love This Dad Joke?".404 Media.
  12. ^Richards, Abbie (July 1, 2025)."Racist AI-generated videos are the newest slop garnering millions of views on TikTok".Media Matters for America. RetrievedJuly 4, 2025.
  13. ^abWhitwam, Ryan (2025-07-02)."TikTok is being flooded with racist AI videos generated by Google's Veo 3".Ars Technica. Retrieved2025-07-03.

External links

[edit]
Computer
programs
AlphaGo
Versions
Competitions
In popular culture
Other
Machine
learning
Neural networks
Other
Generative
AI
Chatbots
Models
Other
See also
a subsidiary ofAlphabet
Company
Divisions
Subsidiaries
Active
Defunct
Programs
Events
Infrastructure
People
Current
Former
Criticism
General
Incidents
Other
Software
A–C
D–N
O–Z
Operating systems
Machine learning models
Neural networks
Computer programs
Formats and codecs
Programming languages
Search algorithms
Domain names
Typefaces
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
Y
Hardware
Pixel
Smartphones
Smartwatches
Tablets
Laptops
Other
Nexus
Smartphones
Tablets
Other
Other
Advertising
Antitrust
Intellectual
property
Privacy
Other
Related
Concepts
Products
Android
Street View coverage
YouTube
Other
Documentaries
Books
Popular culture
Other
Concepts
Models
Text
Coding
Image
Video
Speech
Music
Agents
Companies
Controversies
Concepts
Applications
Implementations
Audio–visual
Text
Decisional
People
Architectures
Retrieved from "https://en.wikipedia.org/w/index.php?title=Veo_(text-to-video_model)&oldid=1318052487"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp