Vertex AI RAG Engine overview

TheVPC-SC security controls and CMEK are supported by Vertex AI RAG Engine. Data residency and AXT security controls aren't supported.

If you use a Vertex AI RAG Engine-managed Spanner instance as a vector database in a location that is GA, then Google Cloud will bill you for that Spanner instance. For more information, seeVertex AI RAG Engine billing.

You must be added to the allowlist to access Vertex AI RAG Engine inus-central1 andus-east4. For users with existing projects, there is no impact. For users with new projects, you can try other regions, or contactvertex-ai-rag-engine-support@google.com to onboard tous-central1.

This page describes what Vertex AI RAG Engine is and how itworks.

DescriptionConsole
To learn how to use the Vertex AI SDK to run Vertex AI RAG Engine tasks, see theRAG quickstart for Python.

Try Vertex AI RAG Engine

Overview

Vertex AI RAG Engine, a component of the Vertex AIPlatform, facilitates Retrieval-Augmented Generation (RAG).Vertex AI RAG Engine is also a data framework for developingcontext-augmented large language model (LLM) applications. Context augmentationoccurs when you apply an LLM to your data. This implements retrieval-augmentedgeneration (RAG).

A common problem with LLMs is that they don't understand private knowledge, thatis, your organization's data. With Vertex AI RAG Engine, you canenrich the LLM context with additional private information, because the modelcan reduce hallucination and answer questions more accurately.

By combining additional knowledge sources with the existing knowledge that LLMshave, a better context is provided. The improved context along with the queryenhances the quality of the LLM's response.

The following image illustrates the key concepts to understandingVertex AI RAG Engine.

Vertex AI RAG keyconcepts

These concepts are listed in the order of the retrieval-augmented generation(RAG) process.

  1. Data ingestion: Intake data from different data sources. For example,local files, Cloud Storage, and Google Drive.

  2. Data transformation:Conversion of the data in preparation for indexing. For example, data issplit into chunks.

  3. Embedding: Numericalrepresentations of words or pieces of text. These numbers capture thesemantic meaning and context of the text. Similar or related words or texttend to have similar embeddings, which means they are closer together in thehigh-dimensional vector space.

  4. Data indexing: Vertex AI RAG Engine creates an index called acorpus.The index structures the knowledge base so it's optimized for searching. Forexample, the index is like a detailed table of contents for a massivereference book.

  5. Retrieval: When a user asks a question or provides a prompt, the retrievalcomponent in Vertex AI RAG Engine searches through its knowledgebase to find information that is relevant to the query.

  6. Generation: The retrieved information becomes the context added to theoriginal user query as a guide for the generative AI model to generatefactuallygrounded and relevant responses.

Supported regions

Vertex AI RAG Engine is supported in the following regions:

RegionLocationDescriptionLaunch stage
us-central1Iowav1 andv1beta1 versions are supported.Allowlist
us-east4Virginiav1 andv1beta1 versions are supported.Allowlist
europe-west3Frankfurt, Germanyv1 andv1beta1 versions are supported.GA
europe-west4Eemshaven, Netherlandsv1 andv1beta1 versions are supported.GA
asia-east1Taiwanv1 andv1beta1 versions are supported.Preview
asia-northeast1Tokyov1 andv1beta1 versions are supported.Preview
asia-northeast3Seoulv1 andv1beta1 versions are supported.Preview
asia-south1Mumbaiv1 andv1beta1 versions are supported.Preview
asia-southeast1Singaporev1 andv1beta1 versions are supported.Preview
europe-central2Warsawv1 andv1beta1 versions are supported.Preview
europe-north1Finlandv1 andv1beta1 versions are supported.Preview
europe-southwest1Madridv1 andv1beta1 versions are supported.Preview
europe-west1Belgiumv1 andv1beta1 versions are supported.Preview
europe-west2Londonv1 andv1beta1 versions are supported.Preview
europe-west6Zürichv1 andv1beta1 versions are supported.Preview
europe-west8Milanv1 andv1beta1 versions are supported.Preview
europe-west9Parisv1 andv1beta1 versions are supported.Preview
us-east1Moncks Corner, SCv1 andv1beta1 versions are supported.Preview
us-east5Columbus, OHv1 andv1beta1 versions are supported.Preview
us-south1Dallas, TXv1 andv1beta1 versions are supported.Preview
us-west1Oregonv1 andv1beta1 versions are supported.Preview
us-west4Las Vegas, NVv1 andv1beta1 versions are supported.Preview
  • us-central1 andus-east4 are changed toAllowlist. If you'd like to experiment withVertex AI RAG Engine, try other regions. If you plan to onboardyour production traffic to these regions, contactvertex-ai-rag-engine-support@google.com.

Delete Vertex AI RAG Engine

The following code samples demonstrate how to delete aVertex AI RAG Engine for the Google Cloud console, Python, and REST:

Submit feedback

To chat with Google support, go to theVertex AI RAG Enginesupportgroup.

To send an email, use the email addressvertex-ai-rag-engine-support@google.com.

What's next

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.