Use hierarchical namespace enabled buckets for Hadoop workloads

This page describes how to usehierarchical namespace enabled buckets for Hadoop workloads.

Overview

When using a Cloud Storage bucket with hierarchical namespace, you can configure theCloud Storage connector to use therename folder operation for workloads like Hadoop, Spark, Hive.

In a bucket without hierarchical namespace, a rename operation in Hadoop, Spark,and Hive involves multiple object copy and delete jobs, impactingperformance and consistency. Renaming a folder using the Cloud Storageconnector optimizes performance and ensures consistency, when handling folderswith a large number of objects.

Before you begin

To use features of hierarchical namespace buckets, use the following Cloud Storageconnector versions:

  • 2.2.23 or later (if you are using version 2.x.x)
  • 3.0.1 or later (if you are using version 3.x.x)

Older connector versions (3.0.0 and older than 2.2.23) have limitations. For more information about the limitations, seeCompatibility withCloud Storage connector version 3.0.0 or versions older than2.2.23.

Enable the Cloud Storage connector on a cluster

This section describes how to enable the Cloud Storage connector on a Dataproc cluster and a self-managed Hadoop cluster.

Dataproc

You can use the Google Cloud CLI to create a Dataproc cluster and enable the Cloud Storage connector to perform the folder operations.

  1. Create a Dataproc cluster using the following command:

      gcloud dataproc clusters createCLUSTER_NAME  --properties=core:fs.gs.hierarchical.namespace.folders.enable=true,  core:fs.gs.http.read-timeout=30000

    Where:

Self-managed Hadoop

You can enable the Cloud Storage connector on your self-managed Hadoop cluster to perform the folder operations.

  1. Add the following to core-site.xml configuration file:

        <property>      <name>fs.gs.hierarchical.namespace.folders.enable</name>      <value>true</value>    </property>    <property>      <name>fs.gs.http.read-timeout</name>      <value>30000</value>    </property>

    Where:

Compatibility with Cloud Storage connector version 3.0.0 or versions older than 2.2.23

Using the Cloud Storage connector version 3.0.0 or versions older than 2.2.23 or disabling folder operations for hierarchical namespace can lead to the following limitations:

What's next

Try it for yourself

If you're new to Google Cloud, create an account to evaluate how Cloud Storage performs in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.

Try Cloud Storage free

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.