gcloud alpha ai index-endpoints deploy-index Stay organized with collections Save and categorize content based on your preferences.
- NAME
- gcloud alpha ai index-endpoints deploy-index - deploy an index to a Vertex AI index endpoint
- SYNOPSIS
gcloud alpha ai index-endpoints deploy-index(INDEX_ENDPOINT:--region=REGION)--deployed-index-id=DEPLOYED_INDEX_ID--display-name=DISPLAY_NAME--index=INDEX[--allowed-issuers=[ALLOWED_ISSUERS,…]][--audiences=[AUDIENCES,…]][--deployment-group=DEPLOYMENT_GROUP][--deployment-tier=DEPLOYMENT_TIER][--enable-access-logging][--machine-type=MACHINE_TYPE][--max-replica-count=MAX_REPLICA_COUNT][--min-replica-count=MIN_REPLICA_COUNT][--psc-automation-configs=[network=NETWORK],[project-id=PROJECT-ID]][--reserved-ip-ranges=[RESERVED_IP_RANGES,…]][GCLOUD_WIDE_FLAG …]
- DESCRIPTION
(ALPHA)Deploy an index to a Vertex AI index endpoint.- EXAMPLES
- To deploy index
to an index endpoint345with 2 min replica count and 10 maxreplica count under project456inregionexample, within reserved iprangesus-central1andvertex-ai-ip-range-1run:vertex-ai-ip-range-2gcloudalphaaiindex-endpointsdeploy-index456--project=example--region=us-central1--index=345--deployed-index-id=deployed-index-345--display-name=deployed-index-345--min-replica-count=2--max-replica-count=10--reserved-ip-ranges=vertex-ai-ip-range-1,vertex-ai-ip-range-2 - POSITIONAL ARGUMENTS
- Index endpoint resource - The index endpoint to deploy an index. The argumentsin this group can be used to specify the attributes of this resource. (NOTE)Some attributes are not given arguments in this group but can be set in otherways.
To set the
projectattribute:- provide the argument
index_endpointon the command line with afully specified name; - provide the argument
--projecton the command line; - set the property
core/project.
This must be specified.
INDEX_ENDPOINT- ID of the index_endpoint or fully qualified identifier for the index_endpoint.
To set the
nameattribute:- provide the argument
index_endpointon the command line.
This positional argument must be specified if any of the other arguments in thisgroup are specified.
- provide the argument
--region=REGION- Cloud region for the index_endpoint.
To set the
regionattribute:- provide the argument
index_endpointon the command line with afully specified name; - provide the argument
--regionon the command line; - set the property
ai/region; - choose one from the prompted list of available regions.
- provide the argument
- provide the argument
- Index endpoint resource - The index endpoint to deploy an index. The argumentsin this group can be used to specify the attributes of this resource. (NOTE)Some attributes are not given arguments in this group but can be set in otherways.
- REQUIRED FLAGS
--deployed-index-id=DEPLOYED_INDEX_ID- Id of the deployed index.
--display-name=DISPLAY_NAME- Display name of the deployed index.
--index=INDEX- ID of the index.
- OPTIONAL FLAGS
--allowed-issuers=[ALLOWED_ISSUERS,…]- List of allowed JWT issuers for a deployed index.
Each entry must be a valid Google service account, in the following format:
service-account-name@project-id.iam.gserviceaccount.com --audiences=[AUDIENCES,…]- List of JWT audiences that are allowed to access a deployed index.
JWT containing any of these audiences (https://tools.ietf.org/html/draft-ietf-oauth-json-web-token-32#section-4.1.3) will be accepted.
--deployment-group=DEPLOYMENT_GROUP- Deployment group can be no longer than 64 characters (eg:
test,prod). If not set, we will use thedefaultdeploymentgroup.Creating deployment_groups with
reserved_ip_rangesis a recommendedpractice when the peered network has multiple peering ranges.This creates yourdeployments from predictable IP spaces for easier traffic administration. --deployment-tier=DEPLOYMENT_TIER- The deployment tier that the deployed index is deployed to. If not specified, asystem-chosen default tier is used.
DEPLOYMENT_TIERmustbe (only one value is supported):storage. --enable-access-logging- If true, online prediction access logs are sent to Cloud Logging.
These logs are standard server access logs, containing information liketimestamp and latency for each prediction request.
--machine-type=MACHINE_TYPE- The machine resources to be used for each node of this deployment. For availablemachine types, seehttps://cloud.google.com/ai-platform-unified/docs/predictions/machine-types.
--max-replica-count=MAX_REPLICA_COUNT- Maximum number of machine replicas the deployed index will be always deployedon.
--min-replica-count=MIN_REPLICA_COUNT- Minimum number of machine replicas the deployed index will be always deployedon. If specified, the value must be equal to or larger than 1.
--psc-automation-configs=[network=NETWORK],[project-id=PROJECT-ID]- A pair of
project-idandnetworkthe PSC index will bedeployed to. For example:--psc-automation-configs=project-id=my-project,network=my-network.For multiple networks, this flag can be repeated:--psc-automation-configs=project-id=my-project,network=my-network--psc-automation-configs=project-id=my-project2,network=my-network2 --reserved-ip-ranges=[RESERVED_IP_RANGES,…]- List of reserved IP ranges deployed index will be deployed to.
- GCLOUD WIDE FLAGS
- These flags are available to all commands:
--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.Run
$gcloud helpfor details. - NOTES
- This command is currently in alpha and might change without notice. If thiscommand fails with API permission errors despite specifying the correct project,you might be trying to access an API with an invitation-only early accessallowlist. These variants are also available:
gcloudaiindex-endpointsdeploy-indexgcloudbetaaiindex-endpointsdeploy-index
Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-12-09 UTC.