About Autopilot mode workloads in GKE Standard

Autopilot Standard

You can useComputeClasses to run Google Kubernetes Engine (GKE)Autopilot workloads in your GKE Standard modeclusters. This document describes the methods that you can use to run yourworkloads in Autopilot mode and helps you to decide when to run aworkload in a specific mode.

This information is intended for the following people:

Cloud architects who want to optimize operational costs inorganizations.
Platform administrators who want to reduce the overhead of manualinfrastructure management.
Site reliability engineers (SREs) who want to shift infrastructuremaintenance, upgrades, and scaling to Google Cloud when possible.

You should already be familiar with the following concepts:

About GKE Autopilot

Autopilot is amode of operation in GKE in whichGoogle manages your node infrastructure, scaling, security, and pre-configuredfeatures. Autopilot mode is optimized for running most productionworkloads in an environment that applies recommended settings for security,reliability, performance, and scalability. To decide between Autopilotmode and Standard mode based on your requirements, seeAbout GKE modes of operation.

You can use Autopilot mode in the following ways:

Create a cluster that uses Autopilotmode:Google manages the entire cluster and applies best practices for automation,reliability, security, and costs.
Run workloads in Autopilot mode in Standard clusters:you deploy Autopilot ComputeClasses and select them in workloads.Google manages the nodes that GKE creates for those specificworkloads in anAutopilot-managed nodepool. Youcontrol the cluster and can run your own Standard node poolsalongside the nodes that GKE manages.

About Autopilot mode for ComputeClasses

A ComputeClass is a Kubernetes custom resource that defines a list of nodeconfigurations, like machine types or feature settings. You can select specificComputeClasses in Kubernetes workload specifications. When a workload thatselects a ComputeClass needs a new node, GKE attempts toprovision the node with one of the configurations that the ComputeClassdeclares. GKE tries each configuration in the ComputeClass inorder and falls back to the next configuration if node creation fails. For moreinformation, seeAbout custom ComputeClasses.

To run Autopilot workloads in your GKE Standardclusters, you enable Autopilot mode in a ComputeClass and select thatComputeClass in specific workloads. Google manages any new nodes thatGKE provisions for these workloads, similar to how Google managesthe nodes in Autopilot clusters. Most of the benefits and securityfeatures of Autopilot mode apply to those workloads and the host nodes.

Autopilot mode ComputeClasses provide cluster administrators withadditional flexibility to choose the level of control that you want overspecific workloads and infrastructure in your cluster, such as in the followingways:

You can let GKE fully manage specific workloads by running themin Autopilot mode.
You retain full control over workloads and infrastructure that don't useAutopilot mode, such as manually created node pools.
You can set an Autopilot ComputeClass as thedefault for your cluster or namespace,so that workloads run in Autopilot mode unless they explicitlyrequest a different option.

These options let cluster admins decide on the level and the scope with whichthey use Autopilot.

Benefits of Autopilot ComputeClasses in Standard clusters

Running some of your workloads in Autopilot mode provides benefitslike the following:

Reduce infrastructure management costs: Google upgrades, maintains,configures, and fine-tunes specific nodes for you.
Use the Autopilot pricing model: workloads that use anAutopilot ComputeClass are billing using the Autopilotpricing model. This pricing model includes per-Pod billing for workloadsthat don't request specific hardware. For more information, see thePricing section.
Improve scaling and security posture:Autopilot workloads get benefits like access to thecontainer-optimized compute platform,improved default security constraints, and node autoscaling based on resourcerequests. The nodes for those workloads use features like node auto-upgradesand automatic repairs.
Improve reliability: the GKEservice-level agreement (SLA)includes a Pod uptime service-level objective (SLO) for Autopilot.

Many of these benefits are also provided by Autopilot clusters, whichalso provide a more managed experience than Standard clusters andinclude multiple security, networking, and resource management benefits. Formore information, seeAutopilot overview.

Hardware selection in Autopilot ComputeClasses

In Autopilot ComputeClasses, you can select specific hardware for yournodes (like GPUs or machine types), or you can let GKE place Podson a general-purpose, container-optimized compute platform. The general-purposeoption is recommended for most production workloads that don't require specifichardware to run well.

The following table describes these configuration options, how to choose one ina ComputeClass, and how this choice affects your billing model:

**Table 1.** Hardware selection in Autopilot ComputeClasses
Workload requirement	Recommended ComputeClass configuration	Billing model
General-purpose workloads	Use an Autopilot ComputeClass that has the`podFamily` priority rule to run workloads that don't require specific hardware on the Autopilot container-optimized compute platform. This platform works well for general-purpose workloads like the following: Web servers Event-driven jobs Batch processing CI/CD pipelines Thebuilt-in Autopilot ComputeClasses that are available for Standard clusters use the`podFamily` priority rule.	Pod-based billing model
Workloads that need specific hardware	Use a ComputeClass that uses any available hardware configuration rule, such as the`machineFamily` rule or the`gpus` rule.	Node-based billing model

Configuration of Autopilot in ComputeClasses

You can use Autopilot mode in a Standard cluster by using abuilt-in Autopilot ComputeClass that GKE provides,or by enabling Autopilot in any custom ComputeClass that you create.The following sections describe each option.

Built-in Autopilot ComputeClasses

GKE configures specific Autopilot ComputeClasses foryou. You canselect these built-in Autopilot classesin any eligible cluster. The built-in Autopilot ComputeClasses inStandard clusters use thepodFamily priority rule to run Pods onthe container-optimized compute platform. For more information, seeAbout built-in ComputeClasses in GKE.

Custom Autopilot ComputeClasses

You can enable Autopilot in any custom ComputeClass that you manage.This option is useful if your workloads have specific hardware requirements.Theautopilot field in theComputeClass custom resource lets you enable ordisable Autopilot in a specific ComputeClass.

To enable Autopilot in an existing ComputeClass, you must delete it,update the configuration, and then recreate the ComputeClass in your cluster.Your changes apply to any new nodes that GKE creates forworkloads that you deploy after you update the Autopilot ComputeClass.

For more information about enabling Autopilot in your customComputeClasses, seeSelect specific hardware for your Autopilot Pods.

Pricing

GKE Autopilot pricing applies to the nodes and workloadsthat GKE creates for an Autopilot ComputeClass. Thefollowing table describes the billing model that applies to differentAutopilot ComputeClass configurations in your Standard modeclusters.

Note: only one of the following billing models can apply to a ComputeClass,because you can't configure a custom ComputeClass to use thepodFamilypriority rule.

**Table 2.** Pricing for Autopilot ComputeClasses
Billing models for different ComputeClass configurations
Pod-based billing model	The Pod-based billing model applies to Autopilot compute classes that use the`podFamily` priority rule instead of selecting specific machines or hardware. Thebuilt-in Autopilot ComputeClasses, which use the`podFamily` rule, use the Pod-based billing model.
Node-based billing model	The node-based billing model applies to Autopilot compute classes that explicitly request specific node configurations, such as N2 instances or GPUs.

Autopilot pricing applies only to the workloads and nodes that use anAutopilot ComputeClass. Your Standard mode cluster and anyother node pools that you run continue to use GKE Standard mode pricing.

Pre-configured settings for nodes managed by Autopilot

Before you enable Autopilot mode in your ComputeClasses, know what toexpect from the nodes that GKE creates to run theAutopilot workloads. Google configures specific features and securityconstraints in Autopilot nodes. As a result, workloads that deploy andfunction correctly in your Standard mode nodes might be rejected byAutopilot mode if they don't meet the security requirements ofAutopilot.

The following table describes the feature configurations that override thecorresponding settings in your Standard cluster. If a configurationisn't in this table, the Autopilot nodes use the Standardcluster setting. For example, Workload Identity Federation for GKE isn't in this table, whichmeans that the Workload Identity Federation for GKE setting of the Standard clusterapplies to the Autopilot nodes that GKE creates.

**Table 3.** Pre-configured settings for Autopilot nodes
Feature	Standard cluster-level setting	Autopilot-managed node setting
Node upgrades and maintenance	Configurable: Node auto-repair Node auto-upgrade Node upgrade strategy	Pre-configured: Node auto-repair: enabled Node auto-upgrade: enabled Node upgrade strategy:surge upgrades with pre-configured parameters
Autoscaling	Configurable:Autoscaling profile	Pre-configured:`optimize-utilization` autoscaling profile
Networking	VPC-native or routes-based	Requires a VPC-native cluster
Security	Configurable: Node IAM service account Access scopes Secure boot	Pre-configured: Node IAM service account: uses theservice account for node auto-provisioning. Access scopes: uses theaccess scopes for node auto-provisioning. Secure boot: enabled Node security constraints:built-in Autopilot security configurations, enforced on a best-effort basis.
Node operating system	Configurable: Node image Node system configuration Linux cgroup mode	Pre-configured: Node image:Container-Optimized OS with containerd Node system configuration: configurable only with the ComputeClass`nodeSystemConfig` field. cgroup mode: v2
Node boot disk	Configurable: Boot disk type Boot disk size	Configurable: Boot disk type: uses the value in the ComputeClass`storage.bootDiskType` field. If this field isn't set, GKE sets the boot disk type as follows: If the ComputeClass uses`podFamily` rules, GKE uses a`pd-balanced` disk. If the ComputeClass doesn't use`podFamily` rules, GKE uses the default boot disk type for the cluster. Boot disk size: GKE uses the value in the compute class`storage.bootDiskSize` field. If this field isn't set, GKE sets the boot disk size as follows: If you set adefault boot disk size for node auto-provisioning, GKE uses that value. If you don't set a default boot disk size for node auto-provisioning, GKE uses the cluster default size.
Node metadata	Resource Manager Tags for firewall policies: updates to cluster-level tags don't affect existing nodes.	Resource Manager Tags for firewall policies: updates to cluster-level tags affect existing nodes.

Resource requests for Autopilot workloads

For Autopilot workloads to run efficiently, GKE enforcescertain minimum and maximum values for CPU, memory, and ephemeral storagerequests in your Pods. GKE also applies default requests to Pods that don't explicitly request one of these resources.The specific values for the minimum, maximum, and default resource requirementsin GKE Autopilot workloads vary based on the type ofhardware that your Pods use.

For ephemeral storage, the default value if you don't request ephemeral storageis the same for all ComputeClasses and hardware selections. For moreinformation, seeDefault resource requests.

The following table provides links to the CPU and memory requirements for yourPod requests, depending on the type of hardware:

**Table 4.** Autopilot CPU and memory requirements
Resource type	Minimum and maximum requests	Default requests
General-purpose Pods `podFamily` priority rule	See the "General-purpose" row in theMinimums and maximums for ComputeClasses table.	See the "General-purpose" row in theDefault requests for ComputeClasses table.
GPUs and TPUs	Depends on the type and quantity of hardware accelerator. For more information, seeMinimums and maximums for the Accelerator ComputeClass.	Depends on the type and quantity of hardware accelerator. For more information, seeDefault requests for accelerators.
Specific Compute Engine machine types and machine families	Minimum: no minimum values for CPU or memory. Maximum: the maximum value is the resource capacity of the Compute Engine instance.	For any Compute Engine machine type or machine family, the default requests in the "General-purpose" row in theDefault requests for ComputeClasses table.

What's next

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-18 UTC.

Movatterモバイル変換

About Autopilot mode workloads in GKE Standard Stay organized with collections Save and categorize content based on your preferences.