gcloud alpha container ai profiles accelerators list Stay organized with collections Save and categorize content based on your preferences.
- NAME
- gcloud alpha container ai profiles accelerators list - list compatible accelerator profiles
- SYNOPSIS
gcloud alpha container ai profiles accelerators list--model=MODEL[--format=FORMAT][--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS][--model-server=MODEL_SERVER][--model-server-version=MODEL_SERVER_VERSION][--filter=EXPRESSION][--limit=LIMIT][--page-size=PAGE_SIZE][--sort-by=[FIELD,…]][--uri][GCLOUD_WIDE_FLAG …]
- DESCRIPTION
(ALPHA)This command lists all supported accelerators with theirperformance details. By default, the supported accelerators are displayed in atable format with select information for each accelerator. To see all details,use --format=yaml.To get supported model, model servers, and model server versions, run
gcloud alpha container ai profiles models list,gcloud alphacontainer ai profiles model-servers list, andgcloud alphacontainer ai profiles model-server-versions list. Alternatively, rungcloud alpha container ai profiles model-and-server-combinationslistto get all supported model and server combinations.- REQUIRED FLAGS
--model=MODEL- The model.
- FLAGS
--format=FORMAT- The format to use for the output. Default is table. yaml|table
--max-ntpot-milliseconds=MAX_NTPOT_MILLISECONDS- The maximum normalized time per output token (NTPOT) in milliseconds. NTPOT ismeasured as the request_latency / output_tokens. If this field is set, thecommand will only return accelerators that can meet the target ntpotmilliseconds and display their throughput performance at the target latency.Otherwise, the command will return all accelerators and display their highestthroughput performance.
--model-server=MODEL_SERVER- The model server. If not specified, this defaults to any model server.
--model-server-version=MODEL_SERVER_VERSION- The model server version. If not specified, this defaults to the latest version.
- LIST COMMAND FLAGS
--filter=EXPRESSION- Apply a Boolean filter
EXPRESSIONto each resource itemto be listed. If the expression evaluatesTrue, then that item islisted. For more details and examples of filter expressions, run $gcloud topic filters. This flaginteracts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit. --limit=LIMIT- Maximum number of resources to list. The default is
unlimited. Thisflag interacts with other flags that are applied in this order:--flatten,--sort-by,--filter,--limit. --page-size=PAGE_SIZE- Some services group resource list output into pages. This flag specifies themaximum number of resources per page. The default is determined by the serviceif it supports paging, otherwise it is
unlimited(no paging).Paging may be applied before or after--filterand--limitdepending on the service. --sort-by=[FIELD,…]- Comma-separated list of resource field key names to sort by. The default orderis ascending. Prefix a field with ``~´´ for descending order on thatfield. This flag interacts with other flags that are applied in this order:
--flatten,--sort-by,--filter,--limit. --uri- Print a list of resource URIs instead of the default output, and change thecommand output to a list of URIs. If this flag is used with
--format, the formatting is applied on this URI list. To displayURIs alongside other keys instead, use theuri()transform.
- GCLOUD WIDE FLAGS
- These flags are available to all commands:
--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.Run
$gcloud helpfor details. - NOTES
- This command is currently in alpha and might change without notice. If thiscommand fails with API permission errors despite specifying the correct project,you might be trying to access an API with an invitation-only early accessallowlist.
Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-07-15 UTC.