Agentic AI use case: Automate data science workflows Stay organized with collections Save and categorize content based on your preferences.
This document describes a high-level architecture for an application thatruns a data science workflow to automate complex data analytics and machinelearning tasks.
This architecture uses datasets that are hosted in BigQuery orAlloyDB for PostgreSQL. The architecture is a multi-agent system that lets users runactions in natural language commands and it eliminates the need to write complexSQL or Python code.
The intended audience for this document includes architects, developers, andadministrators who build and manage agentic AI applications. This architecturelets business and data teams analyze metrics across a wide range of industries,such as retail, finance, and manufacturing. The document assumes a foundationalunderstanding of agentic AI systems. For information about how agents differfrom non-agentic systems, seeWhat is the difference between AI agents, AI assistants, and bots?
Thedeployment section of this document provides links to codesamples to help you experiment with deploying an agentic AI application thatruns a data science workflow.
Architecture
The following diagram shows the architecture for a data science workflowagent.
This architecture includes the following components:
| Component | Description |
|---|---|
| Frontend | Users interact with the multi-agent system through a frontend, such as a chat interface, that runs as a serverless Cloud Run service. |
| Agents | This architecture uses the following agents:
|
| Agents runtime | The AI agents in this architecture are deployed asserverless Cloud Run services. |
| ADK | ADK provides tools and a framework to develop, test, and deploy agents. ADK abstracts the complexity of agent creation and lets AI developers focus on the agent's logic and capabilities. |
| AI model and model runtimes | For inference serving, the agents in this example architecture use the latestGemini model onVertex AI. |
Products used
This example architecture uses the following Google Cloud and open-sourceproducts and tools:
- Cloud Run: A serverless compute platform that lets you runcontainers directly on top of Google's scalable infrastructure.
- Agent Development Kit (ADK): A set of tools and libraries todevelop, test, and deploy AI agents.
- Vertex AI: An ML platform that lets you train and deploy ML modelsand AI applications, and customize LLMs for use in AI-powered applications.
- Gemini: A family of multimodal AI models developed by Google.
- BigQuery: An enterprise data warehouse that helps you manage andanalyze your data with built-in features like machine learning geospatialanalysis, and business intelligence.
- AlloyDB for PostgreSQL: A fully managed, PostgreSQL-compatible database servicethat's designed for your most demanding workloads, including hybridtransactional and analytical processing.
- MCP Toolbox for Databases: An open-sourceModel Context Protocol (MCP) server that lets AI agents securely connect to databases by managing database complexities like connection pooling, authentication, and observability.
Deployment
To deploy a sample implementation of this architecture, useData Science with Multiple Agents. The repository provides two sample datasets todemonstrate the system's flexibility, including a flight dataset for operationalanalysis and an ecommerce sales dataset for business analytics.
What's next
- (Video) Watch theAgent Factory Podcast about AI agents for data engineering and data science.
- (Notebook)Use the data science agent in Colab Enterprise.
- Learn about how tohost AI agents on Cloud Run.
- For an overview of architectural principles and recommendations that are specific to AIand ML workloads in Google Cloud, see theAI and ML perspectivein the Well-Architected Framework.
- For more reference architectures, diagrams, and best practices, explore theCloud Architecture Center.
Contributors
Author:Samantha He | Technical Writer
Other contributors:
- Amina Mansour | Head of Cloud Platform Evaluations Team
- Kumar Dhanagopal | Cross-Product Solution Developer
- Megan O'Keefe | Developer Advocate
- Rachael Deacon-Smith | Developer Advocate
- Shir Meir Lador | Developer Relations Engineering Manager
Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-12-08 UTC.