Cloud Data Fusion overview Stay organized with collections Save and categorize content based on your preferences.
Cloud Data Fusion is a fully managed, cloud-native, enterprise dataintegration service for quickly building and managing data pipelines. TheCloud Data Fusion web interface lets you build scalable data integrationsolutions. It lets you connect to various data sources, transform the data, andthen transfer it to various destination systems, without having to manage theinfrastructure.
Cloud Data Fusion is powered by the open source projectCDAP.
Get started with Cloud Data Fusion
You can start exploring Cloud Data Fusion in minutes.
- Create a Cloud Data Fusion instance: get started bycreating aCloud Data Fusion instance.
- Cost: before you begin your journey, familiarize yourself withCloud Data Fusion costs.
- Concepts: understand the keyterminologies used in Cloud Data Fusion.
- Quickstart: experience Cloud Data Fusion bycreating your firstpipeline.
Explore Cloud Data Fusion
The main components of Cloud Data Fusion are explained in the followingsections.
Tenant project
The set of services required to build and orchestrate Cloud Data Fusionpipelines and store pipeline metadata are provisioned in atenantproject, inside a tenancyunit. A separate tenant project is created for each customer project, in whichCloud Data Fusion instances are provisioned. The tenant project inheritsall the networking and firewall configurations from the customer project.
Cloud Data Fusion: Console
The Cloud Data Fusion console, also referred to ascontrol plane, is aset ofAPI operations and a web interface that deal with the Cloud Data Fusion instance itself,such as creating, deleting, restarting, and updating it.
Note: The control plane doesn't include Cloud Data Fusion operations below the instance-level, such as creating and executing pipelines.Cloud Data Fusion: Studio
Cloud Data Fusion Studio, also referred to as thedata plane, is a set ofREST API and web interfaceoperations that deal with creation, execution, and management of pipelines andrelated artifacts.
Concepts
This section introduces some of the core concepts of Cloud Data Fusion.
| Concept | Description |
|---|---|
| Cloud Data Fusion instance |
|
| Namespace | A namespace is a logical grouping of applications, data, and the associated metadata in a Cloud Data Fusion instance. You can think of namespaces as a partitioning of the instance. In a single instance, one namespace stores the data and metadata of an entity independently from another namespace. |
| Pipeline |
|
| Pipeline node |
|
| Plugin |
|
| Hub | In the Cloud Data Fusion web interface, to browse plugins, sample pipelines, and other integrations, clickHub. When a new version of a plugin is released, it's visible in the Hub in any instance that's compatible. This applies even if the instance was created before the plugin was released. |
| Pipeline preview |
|
| Pipeline execution |
|
| Compute profile |
|
| Reusable pipeline |
|
| Trigger |
|
Cloud Data Fusion resources
Explore Cloud Data Fusion resources:
- Release notes provide changelogs of features, changes, and deprecations
- Pricing forCloud Data Fusion
- Supported regions for Cloud Data Fusion
- API and reference
What's next
- See Cloud Data Fusionuse cases.
- Create a Cloud Data Fusioninstance.
- Work through atutorial.
Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-12-15 UTC.