Google models

Featured Gemini models

3 Pro

Designed for comprehensive multimodal understanding and complex problem solving

Features a 1 million token context window
Excels in agentic workflows and autonomous coding tasks
Designed for complex multimodal tasks and advanced reasoning

2.5 Flash Image

Jumpstart your creative workflow with image generation and conversational editing

Generate high-quality images
Capable of turn-based conversational editing
Same balance of speed and price as Gemini 2.5 Flash

2.5 Flash with Gemini Live API

Low-latency, real-time conversational AI

Provides rich, natural voice interactions with 30 HD voices in 24 languages
Interrupt the model more naturally and reliably, even in loud and noisy environments

Generally available Gemini models

diamondGemini 2.5 ProOur high-capability model for complex reasoning and coding. Features adaptive thinking capabilities to solve complex agentic and multimodal challenges with a 1 million token context.

sparkGemini 2.5 FlashLightning-fast and highly capable. Delivers a balance of intelligence and latency with controllable thinking budgets for versatile applications.

🍌Gemini 2.5 Flash ImageTurn ideas into production-ready assets. Features conversational editing, multi-image fusion, and character consistency for advanced creative workflows.

performance_autoGemini 2.5 Flash-LiteBuilt for massive scale. Balances cost and performance for high-throughput tasks, optimized for efficiency without sacrificing multimodal understanding.

audio_spark Gemini 2.5 Flash with Gemini Live APIDesigned for real-time, bidirectional streaming. Features low-latency built-in audio and affective dialogue capabilities for natural, conversational interactions.

sparkGemini 2.0 FlashMultimodal performance for developers needing a cost-effective model for general-purpose tasks.

performance_autoGemini 2.0 Flash-LiteStreamlined and ultra-efficient for simple, high-frequency tasks where speed and price are the priority.

Preview Gemini models

previewGemini 3 ProOur latest reasoning-first model optimized for complex agentic workflows and coding. Features adaptive thinking, a 1M token context window, and integrated grounding for sophisticated multimodal problem solving.

previewGemini 3 Pro ImageHigh-fidelity image generation with reasoning-enhanced composition. Supports legible text rendering, complex multi-turn editing, and character consistency using up to 14 reference inputs.

Gemma models

Gemma 3nAn open model designed for efficient execution on low-resource devices, supporting multimodal input (text, image, video, and audio) and text output in over 140 languages.

Gemma 3An open model featuring text and image input, support for over 140 languages, and a 128K context window.

Gemma 2An open model supporting text generation, summarization, and extraction.

GemmaA small, lightweight open model supporting text generation, summarization, and extraction.

ShieldGemma 2Instruction-tuned models for evaluating text and image safety against defined policies.

PaliGemmaAn open vision-language model combining SigLIP and Gemma.

CodeGemmaA powerful, lightweight open model for coding tasks, including code completion, generation, and understanding.

TxGemmaA model that generates predictions, classifications, or text based on therapeutic-related data, for building AI models with less data and compute.

MedGemmaA collection of Gemma 3 variants trained for performance on medical text and image comprehension.

MedSigLIPA SigLIP variant trained to encode medical images and text into a common embedding space.

T5GemmaA family of lightweight encoder-decoder research models.

Embeddings models

width_normalEmbeddings for TextConverts text data into vector representations for semantic search, classification, and clustering.

width_normalMultimodal EmbeddingsGenerates vectors based on images, for tasks such as image classification and search.

Imagen models

photo_sparkImagen 4 for GenerationUse text prompts to generate novel images with higher quality than our previous image generation models

photo_sparkImagen 4 for Fast GenerationUse text prompts to generate novel images with higher quality and lower latency than our previous image generation models

photo_sparkImagen 4 for Ultra GenerationUse text prompts to generate novel images with higher quality and better prompt adherence than our previous image generation models

photo_sparkImagen 3 for Generation 002Use text prompts to generate novel images

photo_sparkImagen 3 for Generation 001Use text prompts to generate novel images

photo_sparkImagen 3 for Fast GenerationUse text prompts to generate novel images with lower latency than our other image generation models

image_edit_autoImagen 3 for Editing and CustomizationEdits existing images or generates new images based on text prompts and provided context.

Preview Imagen models

photo_sparkVirtual Try-OnGenerates images of people wearing clothing products.

image_edit_autoImagen product recontext on Vertex AI Edits product images to place them in different scenes or backgrounds based on text prompts.

Veo models

movieVeo 2 GenerateGenerates videos from text prompts and images.

movieVeo 3 GenerateGenerates videos from text prompts and images with high quality.

movieVeo 3 FastGenerates videos from text prompts and images with high quality and low latency.

movieVeo 3.1 GenerateGenerates videos from text prompts and images with high quality.

movieVeo 3.1 FastGenerates videos from text prompts and images with high quality and low latency.

Preview Veo models

movieVeo 3 Generate previewGenerates videos from text prompts and images with high quality.

movieVeo 3 Fast previewGenerates videos from text prompts and images with high quality and low latency.

movieVeo 3.1 Generate previewGenerates videos from text prompts and images with high quality.

movieVeo 3.1 Fast previewGenerates videos from text prompts and images with high quality and low latency.

movieVeo 2 PreviewGenerates videos from text prompts and images, supporting inpaint and outpaint.

Experimental Veo models

movieVeo 2 ExperimentalAn experimental model with features under test.

MedLM models

Caution: MedLM is deprecated. Access to MedLM will no longer be available on or after September 29, 2025.

medical_information MedLM-mediumA HIPAA-compliant model for medical question answering and summarization of healthcare documents.

clinical_notesMedLM-large-largeA HIPAA-compliant model for medical question answering and summarization of healthcare documents.

Language support

Gemini

All the Gemini models can understand and respond in the following languages:

Afrikaans (af), Albanian (sq), Amharic (am), Arabic (ar), Armenian (hy), Assamese (as), Azerbaijani (az), Basque (eu), Belarusian (be), Bengali (bn), Bosnian (bs), Bulgarian (bg), Catalan (ca), Cebuano (ceb), Chinese (Simplified and Traditional) (zh), Corsican (co), Croatian (hr), Czech (cs), Danish (da), Dhivehi (dv), Dutch (nl), English (en), Esperanto (eo), Estonian (et), Filipino (Tagalog) (fil), Finnish (fi), French (fr), Frisian (fy), Galician (gl), Georgian (ka), German (de), Greek (el), Gujarati (gu), Haitian Creole (ht), Hausa (ha), Hawaiian (haw), Hebrew (iw), Hindi (hi), Hmong (hmn), Hungarian (hu), Icelandic (is), Igbo (ig), Indonesian (id), Irish (ga), Italian (it), Japanese (ja), Javanese (jv), Kannada (kn), Kazakh (kk), Khmer (km), Korean (ko), Krio (kri), Kurdish (ku), Kyrgyz (ky), Lao (lo), Latin (la), Latvian (lv), Lithuanian (lt), Luxembourgish (lb), Macedonian (mk), Malagasy (mg), Malay (ms), Malayalam (ml), Maltese (mt), Maori (mi), Marathi (mr), Meiteilon (Manipuri) (mni-Mtei), Mongolian (mn), Myanmar (Burmese) (my), Nepali (ne), Norwegian (no), Nyanja (Chichewa) (ny), Odia (Oriya) (or), Pashto (ps), Persian (fa), Polish (pl), Portuguese (pt), Punjabi (pa), Romanian (ro), Russian (ru), Samoan (sm), Scots Gaelic (gd), Serbian (sr), Sesotho (st), Shona (sn), Sindhi (sd), Sinhala (Sinhalese) (si), Slovak (sk), Slovenian (sl), Somali (so), Spanish (es), Sundanese (su), Swahili (sw), Swedish (sv), Tajik (tg), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Urdu (ur), Uyghur (ug), Uzbek (uz), Vietnamese (vi), Welsh (cy), Xhosa (xh), Yiddish (yi), Yoruba (yo), and Zulu (zu).

Gemma

Gemma and Gemma 2 support only the English (en) language. Gemma 3 and Gemma 3n provide multilingual support in over 140 languages.

Embeddings

Multilingual text embedding models support the following languages:

Afrikaans (af), Albanian (sq), Amharic (am), Arabic (ar), Armenian (hy), Azerbaijani (az), Basque (eu), Belarusian (be), Bengali (bn), Bulgarian (bg), Catalan (ca), Cebuano (ceb), Chinese (Simplified and Traditional) (zh), Corsican (co), Czech (cs), Danish (da), Dutch (nl), English (en), Esperanto (eo), Estonian (et), Filipino (Tagalog) (fil), Finnish (fi), French (fr), Frisian (fy), Galician (gl), Georgian (ka), German (de), Greek (el), Gujarati (gu), Haitian Creole (ht), Hausa (ha), Hawaiian (haw), Hebrew (iw), Hindi (hi), Hmong (hmn), Hungarian (hu), Icelandic (is), Igbo (ig), Indonesian (id), Irish (ga), Italian (it), Japanese (ja), Javanese (jv), Kannada (kn), Kazakh (kk), Khmer (km), Korean (ko), Kurdish (ku), Kyrgyz (ky), Lao (lo), Latin (la), Latvian (lv), Lithuanian (lt), Luxembourgish (lb), Macedonian (mk), Malagasy (mg), Malay (ms), Malayalam (ml), Maltese (mt), Maori (mi), Marathi (mr), Mongolian (mn), Myanmar (Burmese) (my), Nepali (ne), Nyanja (Chichewa) (ny), Norwegian (no), Pashto (ps), Persian (fa), Polish (pl), Portuguese (pt), Punjabi (pa), Romanian (ro), Russian (ru), Samoan (sm), Scots Gaelic (gd), Serbian (sr), Sesotho (st), Shona (sn), Sindhi (sd), Sinhala (Sinhalese) (si), Slovak (sk), Slovenian (sl), Somali (so), Spanish (es), Sundanese (su), Swahili (sw), Swedish (sv), Tajik (tg), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Urdu (ur), Uzbek (uz), Vietnamese (vi), Welsh (cy), Xhosa (xh), Yiddish (yi), Yoruba (yo), and Zulu (zu).

Imagen 3

Imagen 3 supports the following languages:

English (en), Chinese (Simplified and Traditional) (zh), Hindi (hi), Japanese (ja), Korean (ko), Portuguese (pt), and Spanish (es).

MedLM

The MedLM model supports the English (en) language.

Explore all models in Model Garden

Model Garden is a platform that helps you discover, test, customize,and deploy Google proprietary and select OSS models and assets. To explorethe generative AI models and APIs that are available on Vertex AI, go toModel Garden in the Google Cloud console.

Go to Model Garden

To learn more about Model Garden, including available models andcapabilities, seeExplore AI models in Model Garden.

Model versions

To see all model versions, including legacy and retired models, seeModel versions and lifecycle.

What's next

Try a quickstart tutorial usingVertex AI Studio ortheVertex AI API.
Explore pretrained models inModel Garden.
Learn how to control access to specific models in Model Garden byusing aModel Garden organizationpolicy.
Learn aboutpricing.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-12-13 UTC.

Movatterモバイル変換

Google models Stay organized with collections Save and categorize content based on your preferences.