Single Zone Provisioned Throughput

Single Zone Provisioned Throughput lets you reservethroughput in specific regions where only onezone isavailable. This option providespredictable performance for Gemini models in use cases where MLprocessing is required.

To view the list of supported models and regions, seeDeployments and endpoints. For the list ofregions and models that support ML processing, seeML processing.

Features of Single Zone Provisioned Throughput

This section outlines the key features of Single Zone Provisioned Throughput:

  • Pricing and units are consistent with standard Provisioned Throughput:Single Zone Provisioned Throughput uses the same measure of throughput (GSUs),pricing, and terms asstandardProvisioned Throughput.

  • Single Zone Provisioned Throughput supports in-region ML processing: All requests are processed in thepurchased region, including traffic that exceeds your purchased amount ofthroughput. This traffic is billed at thepay-as-you-go rateusing buffer capacity in the region.

  • You control the overages: You cancontrol overflow trafficusing the same headers as with standard Provisioned Throughput.

  • You can monitor your order: You can monitor your Single Zone Provisioned Throughput order using the existingProvisioned Throughput monitoring capabilities.

Limitations

Single Zone Provisioned Throughput has the following limitations:

Purchase Single Zone Provisioned Throughput

For assistance with purchasing Single Zone Provisioned Throughput,contact your Google Cloud account representative.

What's next

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.