gcloud alpha ai endpoints

NAME
gcloud alpha ai endpoints - manage Vertex AI endpoints
SYNOPSIS
gcloud alpha ai endpointsCOMMAND[GCLOUD_WIDE_FLAG]
DESCRIPTION
(ALPHA) An endpoint contains one or more deployed models, all ofwhich must have the same interface but may come from different models. Anendpoint is to obtain online prediction and explanation from one of its deployedmodels.

When you communicate with Vertex AI services, you identify a specific endpointthat is deployed in the cloud using a combination of the current project, theregion, and the endpoint.

GCLOUD WIDE FLAGS
These flags are available to all commands:--help.

Run$gcloud help for details.

COMMANDS
COMMAND is one of the following:
create
(ALPHA) Create a new Vertex AI endpoint.
delete
(ALPHA) Delete an existing Vertex AI endpoint.
deploy-model
(ALPHA) Deploy a model to an existing Vertex AI endpoint.
describe
(ALPHA) Describe an existing Vertex AI endpoint.
direct-predict
(ALPHA) Run Vertex AI online direct prediction.
direct-raw-predict
(ALPHA) Run Vertex AI online direct raw prediction.
explain
(ALPHA) Request an online explanation from an Vertex AI endpoint.
list
(ALPHA) List existing Vertex AI endpoints.
predict
(ALPHA) Run Vertex AI online prediction.
raw-predict
(ALPHA) Run Vertex AI online raw prediction.
stream-direct-predict
(ALPHA) Run Vertex AI online stream direct prediction.
stream-direct-raw-predict
(ALPHA) Run Vertex AI online stream direct raw prediction.
stream-raw-predict
(ALPHA) Run Vertex AI online stream raw prediction.
undeploy-model
(ALPHA) Undeploy a model from an existing Vertex AI endpoint.
update
(ALPHA) Update an existing Vertex AI endpoint.
NOTES
This command is currently in alpha and might change without notice. If thiscommand fails with API permission errors despite specifying the correct project,you might be trying to access an API with an invitation-only early accessallowlist. These variants are also available:
gcloudaiendpoints
gcloudbetaaiendpoints

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-09-16 UTC.