Run AI inference on Cloud Run with GPUs

Use GPUs to run AI inference on Cloud Run. If you are new to AI concepts,seeGPUs for AI.GPUs are used to train and run AI models. This can give you more stableperformance with the ability to scale workloads depending on your overallutilization. See GPU support forservices,jobs, andworker poolsto learn more about GPU configurations.

Tutorials for services

Tutorials for jobs

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-18 UTC.