Scale and autoscale runtime services

You are currently viewing version 1.1 of the Apigee hybrid documentation.This version is end of life. You should upgrade to a newer version. For more information, seeSupported versions.

You can scale most services running in Kubernetes from the command line or in a configuration override. You can set scaling parameters for Apigee hybrid runtime services in theoverrides.yaml file.

How you scale and autoscale each service depends on the type of service.

TIP: You can use the-w option to continuously watch your nodes, as the following example shows:
kubectl get nodes -w ...

For more information on scaling, see theKubernetes documentation.

The following table summarizes the hybrid runtime services and how you can scale and autoscale them in configurations or from the command line:

ServiceImplemented AsScaling
CassandraStatefulSet

(Scale up only)

To scale up on the command line, you can use thekubectlscale orpatch options.

To scale up via configuration, set the value of thecassandra object'sreplicaCount configuration property in the overrides.yaml file.

NOTE: The default replication factor for all keyspaces is three. As a result, Apigee recommends that you scale the replicas by a factor of three.

For more information on setting configuration properties, seeManage runtime plane components.

StatefulSets do not support autoscaling. Cassandra is deployed in sets of 3 replicas. Apigee recommends that you add 3 additional nodes to the Cassandra nodepool as standby worker nodes.

To scale the number of Cassandra nodes down, seeScale down Cassandra.

Ingress/LoadBalancer3rd-party proxyDepends on the type of Ingress.
LoggerDaemonSetDaemonSets manage replicas of a pod on all nodes, so they scale when you scale the pods themselves.
MART
Synchronizer
Metrics
MPs
ApigeeDeployment (Custom CRD)To scale on the command line, you can use thekubectlscale option.

To scale via configuration, increase the value of the Deployment'sreplicaCountMin configuration property for themart,synchronizer,metrics, ormp object.

Deployments use a Horizontal Pod Autoscaler for autoscaling. Set the Deployment object'stargetCPUUtilizationPercentage property to the threshold for scaling up; when this value is exceeded, Kubernetes adds pods up to the value ofreplicaCountMax.

For more information on setting configuration properties, seeManage runtime plane components.

UDCAApigeeDeployment (Custom CRD)To scale via configuration, increase the value of theudca object'sreplicaCountMin property in theoverrides.yaml file.

The UDCA service uses a Horizontal Pod Autoscaler for autoscaling. Set theudca object'stargetCPUUtilizationPercentage property to the threshold for scaling up; when this value is exceeded, Kubernetes adds pods up to the value ofreplicaCountMax.

For more information on setting configuration properties, seeManage runtime plane components.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-18 UTC.