gcloud alpha container ai profiles accelerators list

NAME
gcloud alpha container ai profiles accelerators list - list compatible accelerator profiles
SYNOPSIS
gcloud alpha container ai profiles accelerators list--model=MODEL[--format=FORMAT][--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS][--model-server=MODEL_SERVER][--model-server-version=MODEL_SERVER_VERSION][--filter=EXPRESSION][--limit=LIMIT][--page-size=PAGE_SIZE][--sort-by=[FIELD,…]][--uri][GCLOUD_WIDE_FLAG]
DESCRIPTION
(ALPHA) This command lists all supported accelerators with theirperformance details. By default, the supported accelerators are displayed in atable format with select information for each accelerator. To see all details,use --format=yaml.

To get supported model, model servers, and model server versions, rungcloud alpha container ai profiles models list,gcloud alphacontainer ai profiles model-servers list, andgcloud alphacontainer ai profiles model-server-versions list. Alternatively, rungcloud alpha container ai profiles model-and-server-combinationslist to get all supported model and server combinations.

REQUIRED FLAGS
--model=MODEL
The model.
FLAGS
--format=FORMAT
The format to use for the output. Default is table. yaml|table
--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS
The maximum normalized time per output token (NTPOT) in milliseconds. NTPOT ismeasured as the request_latency / output_tokens. If this field is set, thecommand will only return accelerators that can meet the target ntpotmilliseconds and display their throughput performance at the target latency.Otherwise, the command will return all accelerators and display their highestthroughput performance.
--model-server=MODEL_SERVER
The model server. If not specified, this defaults to any model server.
--model-server-version=MODEL_SERVER_VERSION
The model server version. If not specified, this defaults to the latest version.
LIST COMMAND FLAGS
--filter=EXPRESSION
Apply a Boolean filterEXPRESSION to each resource itemto be listed. If the expression evaluatesTrue, then that item islisted. For more details and examples of filter expressions, run $gcloud topic filters. This flaginteracts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit.
--limit=LIMIT
Maximum number of resources to list. The default isunlimited. Thisflag interacts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit.
--page-size=PAGE_SIZE
Some services group resource list output into pages. This flag specifies themaximum number of resources per page. The default is determined by the serviceif it supports paging, otherwise it isunlimited (no paging).Paging may be applied before or after--filter and--limit depending on the service.
--sort-by=[FIELD,…]
Comma-separated list of resource field key names to sort by. The default orderis ascending. Prefix a field with ``~´´ for descending order on thatfield. This flag interacts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit.
--uri
Print a list of resource URIs instead of the default output, and change thecommand output to a list of URIs. If this flag is used with--format, the formatting is applied on this URI list. To displayURIs alongside other keys instead, use theuri() transform.
GCLOUD WIDE FLAGS
These flags are available to all commands:--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.

Run$gcloud help for details.

NOTES
This command is currently in alpha and might change without notice. If thiscommand fails with API permission errors despite specifying the correct project,you might be trying to access an API with an invitation-only early accessallowlist.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-07-15 UTC.