gcloud ml vision detect-text-pdf

NAME
gcloud ml vision detect-text-pdf - detect and transcribe text from PDF files stored in Google Cloud Storage
SYNOPSIS
gcloud ml vision detect-text-pdfINPUT_FILEOUTPUT_PATH[--batch-size=BATCH_SIZE][GCLOUD_WIDE_FLAG]
DESCRIPTION
Detect and transcribe text from PDF files stored in Google Cloud Storage.

The Vision API accepts PDF files up to 2000 pages. Larger files will return anerror.

EXAMPLES
To detect text for input PDF file 'gs://my_bucket/input_file' and store outputin 'gs://my_bucket/out_put_prefix':
gcloudmlvisiondetect-text-pdfgs://my_bucket/input_filegs://my_bucket/out_put_prefix
POSITIONAL ARGUMENTS
INPUT_FILE
Google Cloud Storage location to read the input from. It must be in Google CloudStorage format (gs://bucket/object)
OUTPUT_PATH
Google Cloud Storage location to store the output file. It must be in GoogleCloud Storage format (gs://bucket/object)
FLAGS
--batch-size=BATCH_SIZE
Maximum number of response protos to put into each output JSON file on GoogleCloud Storage. The valid range is [1, 100]. If not specified, the default valueis 20.
GCLOUD WIDE FLAGS
These flags are available to all commands:--access-token-file,--account,--billing-project,--configuration,--flags-file,--flatten,--format,--help,--impersonate-service-account,--log-http,--project,--quiet,--trace-token,--user-output-enabled,--verbosity.

Run$gcloud help for details.

API REFERENCE
This command uses thevision/v1 API. The full documentation forthis API can be found at:https://cloud.google.com/vision/
NOTES
These variants are also available:
gcloudalphamlvisiondetect-text-pdf
gcloudbetamlvisiondetect-text-pdf

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-05-07 UTC.