REST Resource: projects.locations.indexEndpoints

Resource: IndexEndpoint

Indexes are deployed into it. An IndexEndpoint can have multiple DeployedIndexes.

Fields
namestring

Output only. The resource name of the IndexEndpoint.

displayNamestring

Required. The display name of the IndexEndpoint. The name can be up to 128 characters long and can consist of any UTF-8 characters.

descriptionstring

The description of the IndexEndpoint.

deployedIndexes[]object (DeployedIndex)

Output only. The indexes deployed in this endpoint.

etagstring

Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens.

labelsmap (key: string, value: string)

The labels with user-defined metadata to organize your IndexEndpoints.

label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed.

Seehttps://goo.gl/xmQnxf for more information and examples of labels.

createTimestring (Timestamp format)

Output only. timestamp when this IndexEndpoint was created.

Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples:"2014-10-02T15:01:23Z","2014-10-02T15:01:23.045123456Z" or"2014-10-02T15:01:23+05:30".

updateTimestring (Timestamp format)

Output only. timestamp when this IndexEndpoint was last updated. This timestamp is not updated when the endpoint's DeployedIndexes are updated, e.g. due to updates of the original Indexes they are the deployments of.

Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples:"2014-10-02T15:01:23Z","2014-10-02T15:01:23.045123456Z" or"2014-10-02T15:01:23+05:30".

networkstring

Optional. The full name of the Google Compute Enginenetwork to which the IndexEndpoint should be peered.

Private services access must already be configured for the network. If left unspecified, the Endpoint is not peered with any network.

network andprivateServiceConnectConfig are mutually exclusive.

Format:projects/{project}/global/networks/{network}. Where {project} is a project number, as in '12345', and {network} is network name.

privateServiceConnectConfigobject (PrivateServiceConnectConfig)

Optional. Configuration for private service connect.

network andprivateServiceConnectConfig are mutually exclusive.

publicEndpointEnabledboolean

Optional. If true, the deployed index will be accessible through public endpoint.

publicEndpointDomainNamestring

Output only. IfpublicEndpointEnabled is true, this field will be populated with the domain name to use for this index endpoint.

encryptionSpecobject (EncryptionSpec)

Immutable. Customer-managed encryption key spec for an IndexEndpoint. If set, this IndexEndpoint and all sub-resources of this IndexEndpoint will be secured by this key.

satisfiesPzsboolean

Output only. reserved for future use.

satisfiesPziboolean

Output only. reserved for future use.

JSON representation
{"name":string,"displayName":string,"description":string,"deployedIndexes":[{object (DeployedIndex)}],"etag":string,"labels":{string:string,...},"createTime":string,"updateTime":string,"network":string,"enablePrivateServiceConnect":boolean,"privateServiceConnectConfig":{object (PrivateServiceConnectConfig)},"publicEndpointEnabled":boolean,"publicEndpointDomainName":string,"encryptionSpec":{object (EncryptionSpec)},"satisfiesPzs":boolean,"satisfiesPzi":boolean}

DeployedIndex

A deployment of an Index. IndexEndpoints contain one or more DeployedIndexes.

Fields
idstring

Required. The user specified id of the DeployedIndex. The id can be up to 128 characters long and must start with a letter and only contain letters, numbers, and underscores. The id must be unique within the project it is created in.

indexstring

Required. The name of the Index this is the deployment of. We may refer to this Index as the DeployedIndex's "original" Index.

displayNamestring

The display name of the DeployedIndex. If not provided upon creation, the Index's displayName is used.

createTimestring (Timestamp format)

Output only. timestamp when the DeployedIndex was created.

Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples:"2014-10-02T15:01:23Z","2014-10-02T15:01:23.045123456Z" or"2014-10-02T15:01:23+05:30".

privateEndpointsobject (IndexPrivateEndpoints)

Output only. Provides paths for users to send requests directly to the deployed index services running on Cloud via private services access. This field is populated ifnetwork is configured.

indexSyncTimestring (Timestamp format)

Output only. The DeployedIndex may depend on various data on its original Index. Additionally when certain changes to the original Index are being done (e.g. when what the Index contains is being changed) the DeployedIndex may be asynchronously updated in the background to reflect these changes. If this timestamp's value is at least theIndex.update_time of the original Index, it means that this DeployedIndex and the original Index are in sync. If this timestamp is older, then to see which updates this DeployedIndex already contains (and which it does not), one mustlist the operations that are running on the original Index. Only the successfully completed Operations withupdateTime equal or before this sync time are contained in this DeployedIndex.

Uses RFC 3339, where generated output will always be Z-normalized and use 0, 3, 6 or 9 fractional digits. Offsets other than "Z" are also accepted. Examples:"2014-10-02T15:01:23Z","2014-10-02T15:01:23.045123456Z" or"2014-10-02T15:01:23+05:30".

automaticResourcesobject (AutomaticResources)

Optional. A description of resources that the DeployedIndex uses, which to large degree are decided by Vertex AI, and optionally allows only a modest additional configuration. If minReplicaCount is not set, the default value is 2 (we don't provide SLA when minReplicaCount=1). If maxReplicaCount is not set, the default value is minReplicaCount. The max allowed replica count is 1000.

dedicatedResourcesobject (DedicatedResources)

Optional. A description of resources that are dedicated to the DeployedIndex, and that need a higher degree of manual configuration. The field minReplicaCount must be set to a value strictly greater than 0, or else validation will fail. We don't provide SLA when minReplicaCount=1. If maxReplicaCount is not set, the default value is minReplicaCount. The max allowed replica count is 1000.

Available machine types for SMALL shard: e2-standard-2 and all machine types available for MEDIUM and LARGE shard.

Available machine types for MEDIUM shard: e2-standard-16 and all machine types available for LARGE shard.

Available machine types for LARGE shard: e2-highmem-16, n2d-standard-32.

n1-standard-16 and n1-standard-32 are still available, but we recommend e2-standard-16 and e2-highmem-16 for cost efficiency.

enableAccessLoggingboolean

Optional. If true, private endpoint's access logs are sent to Cloud Logging.

These logs are like standard server access logs, containing information like timestamp and latency for each MatchRequest.

Note that logs may incur a cost, especially if the deployed index receives a high queries per second rate (QPS). Estimate your costs before enabling this option.

enableDatapointUpsertLoggingboolean

Optional. If true, logs to Cloud Logging errors relating to datapoint upserts.

Under normal operation conditions, these log entries should be very rare. However, if incompatible datapoint updates are being uploaded to an index, a high volume of log entries may be generated in a short period of time.

Note that logs may incur a cost, especially if the deployed index receives a high volume of datapoint upserts. Estimate your costs before enabling this option.

deployedIndexAuthConfigobject (DeployedIndexAuthConfig)

Optional. If set, the authentication is enabled for the private endpoint.

reservedIpRanges[]string

Optional. A list of reserved ip ranges under the VPC network that can be used for this DeployedIndex.

If set, we will deploy the index within the provided ip ranges. Otherwise, the index might be deployed to any ip ranges under the provided VPC network.

The value should be the name of the address (https://cloud.google.com/compute/docs/reference/rest/v1/addresses) Example: ['vertex-ai-ip-range'].

For more information about subnets and network IP ranges, please seehttps://cloud.google.com/vpc/docs/subnets#manually_created_subnet_ip_ranges.

deploymentGroupstring

Optional. The deployment group can be no longer than 64 characters (eg: 'test', 'prod'). If not set, we will use the 'default' deployment group.

Creatingdeployment_groups withreservedIpRanges is a recommended practice when the peered network has multiple peering ranges. This creates your deployments from predictable IP spaces for easier traffic administration. Also, one deploymentGroup (except 'default') can only be used with the same reservedIpRanges which means if the deploymentGroup has been used with reservedIpRanges: [a, b, c], using it with [a, b] or [d, e] is disallowed.

Note: we only support up to 5 deployment groups(not including 'default').

deploymentTierenum (DeploymentTier)

Optional. The deployment tier that the index is deployed to. DEPLOYMENT_TIER_UNSPECIFIED will use a system-chosen default tier.

pscAutomationConfigs[]object (PSCAutomationConfig)

Optional. If set for PSC deployed index, PSC connection will be automatically created after deployment is done and the endpoint information is populated in privateEndpoints.psc_automated_endpoints.

JSON representation
{"id":string,"index":string,"displayName":string,"createTime":string,"privateEndpoints":{object (IndexPrivateEndpoints)},"indexSyncTime":string,"automaticResources":{object (AutomaticResources)},"dedicatedResources":{object (DedicatedResources)},"enableAccessLogging":boolean,"enableDatapointUpsertLogging":boolean,"deployedIndexAuthConfig":{object (DeployedIndexAuthConfig)},"reservedIpRanges":[string],"deploymentGroup":string,"deploymentTier":enum (DeploymentTier),"pscAutomationConfigs":[{object (PSCAutomationConfig)}]}

IndexPrivateEndpoints

IndexPrivateEndpoints proto is used to provide paths for users to send requests via private endpoints (e.g. private service access, private service connect). To send request via private service access, use matchGrpcAddress. To send request via private service connect, use serviceAttachment.

Fields
matchGrpcAddressstring

Output only. The ip address used to send match gRPC requests.

serviceAttachmentstring

Output only. The name of the service attachment resource. Populated if private service connect is enabled.

pscAutomatedEndpoints[]object (PscAutomatedEndpoints)

Output only. PscAutomatedEndpoints is populated if private service connect is enabled if PscAutomatedConfig is set.

JSON representation
{"matchGrpcAddress":string,"serviceAttachment":string,"pscAutomatedEndpoints":[{object (PscAutomatedEndpoints)}]}

PscAutomatedEndpoints

PscAutomatedEndpoints defines the output of the forwarding rule automatically created by each PscAutomationConfig.

Fields
projectIdstring

Corresponding projectId in pscAutomationConfigs

networkstring

Corresponding network in pscAutomationConfigs.

matchAddressstring

ip Address created by the automated forwarding rule.

JSON representation
{"projectId":string,"network":string,"matchAddress":string}

DeployedIndexAuthConfig

Used to set up the auth on the DeployedIndex's private endpoint.

Fields
authProviderobject (AuthProvider)

Defines the authentication provider that the DeployedIndex uses.

JSON representation
{"authProvider":{object (AuthProvider)}}

AuthProvider

Configuration for an authentication provider, including support forJSON Web token (JWT).

Fields
audiences[]string

The list of JWTaudiences. that are allowed to access. A JWT containing any of these audiences will be accepted.

allowedIssuers[]string

A list of allowed JWT issuers. Each entry must be a valid Google service account, in the following format:

service-account-name@project-id.iam.gserviceaccount.com

JSON representation
{"audiences":[string],"allowedIssuers":[string]}

DeploymentTier

Tiers encapsulate serving time attributes like latency and throughput.

Enums
DEPLOYMENT_TIER_UNSPECIFIEDDefault deployment tier.
STORAGEOptimized for costs.

Methods

create

Creates an IndexEndpoint.

delete

Deletes an IndexEndpoint.

deployIndex

Deploys an Index into this IndexEndpoint, creating a DeployedIndex within it.

get

Gets an IndexEndpoint.

list

Lists IndexEndpoints in a Location.

mutateDeployedIndex

Update an existing DeployedIndex under an IndexEndpoint.

patch

Updates an IndexEndpoint.

undeployIndex

Undeploys an Index from an IndexEndpoint, removing a DeployedIndex from it, and freeing all resources it's using.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-12-02 UTC.