gcloud dataproc Stay organized with collections Save and categorize content based on your preferences.
- NAME
- gcloud dataproc - create and manage Google Cloud Dataproc clusters and jobs
- SYNOPSIS
gcloud dataprocGROUP[GCLOUD_WIDE_FLAG …]
- DESCRIPTION
- The gcloud dataproc command group lets you create and manage Dataproc clustersand jobs.
Dataproc is an Apache Hadoop, Apache Spark, Apache Pig, and Apache Hive service.It easily processes big datasets at low cost, creating managed clusters of anysize that scale down once processing is complete.
More information on Dataproc can be found here:https://cloud.google.com/dataprocand detailed documentation can be found here:https://cloud.google.com/dataproc/docs/
- EXAMPLES
- To see how to create and manage clusters, run:
gclouddataprocclustersTo see how to submit and manage jobs, run:
gclouddataprocjobs - GCLOUD WIDE FLAGS
- These flags are available to all commands:
--help.Run
$gcloud helpfor details. - GROUPS
is one of the following:GROUPautoscaling-policies- Create and manage Dataproc autoscaling policies.
batches- Submit Dataproc batch jobs.
clusters- Create and manage Dataproc clusters.
jobs- Submit and manage Dataproc jobs.
node-groups- Manage Dataproc node groups.
operations- View and manage Dataproc operations.
workflow-templates- Create and manage Dataproc workflow templates.
- NOTES
- These variants are also available:
gcloudalphadataprocgcloudbetadataproc
Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-09-16 UTC.