Gemini 3 Flash

Preview

This product or feature is subject to the "Pre-GA Offerings Terms" in the General Service Terms section of theService Specific Terms, and theAdditional Terms for Generative AI Preview Products. You can process personal data for this product or feature as outlined in theCloud Data Processing Addendum, subject to the obligations and restrictions described in the agreement under which you access Google Cloud. Pre-GA products and features are available "as is" and might have limited support. For more information, see thelaunch stage descriptions.

Gemini 3 Flash combines Gemini 3 Pro's reasoning capabilitieswith the Flash line's levels on latency, efficiency, and cost. It not onlyenables everyday tasks with improved reasoning, but is designed to tackle themost complex agentic workflows.

Gemini 3 Flash uses several new features to improve performance,control, and multimodal fidelity:

For more information on using these features, seeGet started withGemini3.

Try inVertex AIView inModel Garden(Preview) Deploy example app

Note: To use the "Deploy example app" feature, you need a Google Cloud project with billing and Vertex AI API enabled.
Model IDgemini-3-flash-preview
Supported inputs & outputs
  • Inputs:
    Text,Code,Images,Audio,Video,PDF
  • Outputs:
    Text
Token limits
  • Maximum input tokens: 1,048,576
  • Maximum output tokens: 65,536
Capabilities
Consumption options
SeeConsumption options for more information.
Technical specifications
Images
  • Maximum images per prompt: 900
  • Maximum file size per file for inline data or direct uploads through the console: 7 MB
  • Maximum file size per file from Google Cloud Storage: 30 MB
  • Default resolution tokens: 1120
  • Supported MIME types:
    image/png,image/jpeg,image/webp,image/heic,image/heif
Documents
  • Maximum number of files per prompt: 900
  • Maximum number of pages per file: 900
  • Maximum file size per file for the API or Cloud Storage imports: 50 MB
  • Maximum file size per file for direct uploads through the console: 7 MB
  • Default resolution tokens: 560
  • OCR for scanned PDFs: Not used by default
  • Supported MIME types:
    application/pdf,text/plain
Video
  • Maximum video length (with audio): Approximately 45 minutes
  • Maximum video length (without audio): Approximately 1 hour
  • Maximum number of videos per prompt: 10
  • Default resolution tokens per frame: 70
  • Supported MIME types:
    video/x-flv,video/quicktime,video/mpeg,video/mpegs,video/mpg,video/mp4,video/webm,video/wmv,video/3gpp
Audio
  • Maximum audio length per prompt: Approximately 8.4 hours, or up to 1 million tokens
  • Maximum number of audio files per prompt: 1
  • Speech understanding for: Audio summarization, transcription, and translation
  • Supported MIME types:
    audio/x-aac,audio/flac,audio/mp3,audio/m4a,audio/mpeg,audio/mpga,audio/mp4,audio/ogg,audio/pcm,audio/wav,audio/webm
Parameter defaults
  • Temperature: 0.0-2.0 (default 1.0)
  • topP: 0.0-1.0 (default 0.95)
  • topK: 64 (fixed)
  • candidateCount: 1–8 (default 1)
Supported regions

Model availability

  • Global
    • global
SeeDeployments and endpoints for more information.
Knowledge cutoff dateJanuary 2025
Versions
  • gemini-3-flash-preview
    • Launch stage: Public preview
    • Release date: December 17, 2025
Supported languagesSeeSupported languages.
PricingSeePricing.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.