gcloud ai endpoints stream-direct-predict

NAME
gcloud ai endpoints stream-direct-predict - run Vertex AI online stream direct prediction
SYNOPSIS
gcloud ai endpoints stream-direct-predict(ENDPOINT :--region=REGION)--json-request=JSON_REQUEST[GCLOUD_WIDE_FLAG]
DESCRIPTION
gcloud ai endpoints stream-direct-predict sends a stream directprediction request to Vertex AI endpoint for the given inputs. The request limitis 10MB.
EXAMPLES
To stream direct predict against an endpoint123 under projectexample in regionus-central1, run:
gcloudaiendpointsstream-direct-predict123--project=example--region=us-central1--json-request=input.json
POSITIONAL ARGUMENTS
Endpoint resource - The endpoint to do online stream direct prediction. Thearguments in this group can be used to specify the attributes of this resource.(NOTE) Some attributes are not given arguments in this group but can be set inother ways.

To set theproject attribute:

  • provide the argumentendpoint on the command line with a fullyspecified name;
  • provide the argument--project on the command line;
  • set the propertycore/project.

This must be specified.

ENDPOINT
ID of the endpoint or fully qualified identifier for the endpoint.

To set thename attribute:

  • provide the argumentendpoint on the command line.

This positional argument must be specified if any of the other arguments in thisgroup are specified.

--region=REGION
Cloud region for the endpoint.

To set theregion attribute:

  • provide the argumentendpoint on the command line with a fullyspecified name;
  • provide the argument--region on the command line;
  • set the propertyai/region;
  • choose one from the prompted list of available regions.
REQUIRED FLAGS
--json-request=JSON_REQUEST
Path to a local file containing the body of a JSON request.

An example of a JSON request:

{"inputs":[{"dtype":"STRING",shape:[1],"string_val":["hello world"]},{"dtype":"INT32",shape:[1],"int_val":[42]}]}

This flag accepts "-" for stdin.

GCLOUD WIDE FLAGS
These flags are available to all commands:--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.

Run$gcloud help for details.

NOTES
These variants are also available:
gcloudalphaaiendpointsstream-direct-predict
gcloudbetaaiendpointsstream-direct-predict

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-11-11 UTC.