gcloud beta ai model-garden models list-deployment-config

NAME
gcloud beta ai model-garden models list-deployment-config - list the machine specifications supported by and verified for a model in Model Garden
SYNOPSIS
gcloud beta ai model-garden models list-deployment-config--model=MODEL[--hugging-face-access-token=HUGGING_FACE_ACCESS_TOKEN][--filter=EXPRESSION][--sort-by=[FIELD,…]][GCLOUD_WIDE_FLAG]
EXAMPLES
To list the supported machine specifications forgoogle/gemma2@gemma-2-9b, run:
gcloudaimodel-gardenmodelslist-deployment-config--model=google/gemma2@gemma-2-9b

To list the supported machine specifications for a Hugging Face modelmeta-llama/Meta-Llama-3-8B, run:

gcloudaimodel-gardenmodelslist-deployment-config--model=meta-llama/Meta-Llama-3-8B
REQUIRED FLAGS
--model=MODEL
The model to be deployed. If it is a Model Garden model, it should be in theformat of{publisher_name}/{model_name}@{model_version_name}, e.g.google/gemma2@gemma-2-2b. If it is a Hugging Face model, it shouldbe in the convention of Hugging Face models, e.g.meta-llama/Meta-Llama-3-8B.
FLAGS
--hugging-face-access-token=HUGGING_FACE_ACCESS_TOKEN
The access token from Hugging Face needed to read the model artifacts of gatedmodels in order to generate the deployment configs. It is only needed when theHugging Face model to deploy is gated and not verified by Model Garden. You canuse thegcloud ai alpha/beta model-garden models list command tofind out which ones are verified by Model Garden.
LIST COMMAND FLAGS
--filter=EXPRESSION
Apply a Boolean filterEXPRESSION to each resource itemto be listed. If the expression evaluatesTrue, then that item islisted. For more details and examples of filter expressions, run $gcloud topic filters. This flaginteracts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit.
--sort-by=[FIELD,…]
Comma-separated list of resource field key names to sort by. The default orderis ascending. Prefix a field with ``~´´ for descending order on thatfield. This flag interacts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit.
GCLOUD WIDE FLAGS
These flags are available to all commands:--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.

Run$gcloud help for details.

NOTES
This command is currently in beta and might change without notice. Thesevariants are also available:
gcloudaimodel-gardenmodelslist-deployment-config
gcloudalphaaimodel-gardenmodelslist-deployment-config

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-10-21 UTC.