gcloud dataproc

NAME
gcloud dataproc - create and manage Google Cloud Dataproc clusters and jobs
SYNOPSIS
gcloud dataprocGROUP[GCLOUD_WIDE_FLAG]
DESCRIPTION
The gcloud dataproc command group lets you create and manage Dataproc clustersand jobs.

Dataproc is an Apache Hadoop, Apache Spark, Apache Pig, and Apache Hive service.It easily processes big datasets at low cost, creating managed clusters of anysize that scale down once processing is complete.

More information on Dataproc can be found here:https://cloud.google.com/dataprocand detailed documentation can be found here:https://cloud.google.com/dataproc/docs/

EXAMPLES
To see how to create and manage clusters, run:
gclouddataprocclusters

To see how to submit and manage jobs, run:

gclouddataprocjobs
GCLOUD WIDE FLAGS
These flags are available to all commands:--help.

Run$gcloud help for details.

GROUPS
GROUP is one of the following:
autoscaling-policies
Create and manage Dataproc autoscaling policies.
batches
Submit Dataproc batch jobs.
clusters
Create and manage Dataproc clusters.
jobs
Submit and manage Dataproc jobs.
node-groups
Manage Dataproc node groups.
operations
View and manage Dataproc operations.
workflow-templates
Create and manage Dataproc workflow templates.
NOTES
These variants are also available:
gcloudalphadataproc
gcloudbetadataproc

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-09-16 UTC.