ChatVertexAI

This page provides a quick overview for getting started with VertexAIchat models. For detailed documentation of all ChatVertexAI features and configurations head to theAPI reference.

ChatVertexAI exposes all foundational models available in Google Cloud, likegemini-1.5-pro,gemini-1.5-flash, etc. For a full and updated list of available models visitVertexAI documentation.

Google Cloud VertexAI vs Google PaLM

The Google Cloud VertexAI integration is separate from theGoogle PaLM integration. Google has chosen to offer an enterprise version of PaLM through GCP, and this supports the models made available through there.

Overview

Integration details

Class	Package	Local	Serializable	JS support	Package downloads	Package latest
ChatVertexAI	langchain-google-vertexai	❌	beta	✅

Model features

Tool calling	Structured output	JSON mode	Image input	Audio input	Video input	Token-level streaming	Native async	Token usage	Logprobs
✅	✅	❌	✅	✅	✅	✅	✅	✅	❌

Setup

To access VertexAI models you'll need to create a Google Cloud Platform account, set up credentials, and install thelangchain-google-vertexai integration package.

Credentials

To use the integration you must:

Have credentials configured for your environment (gcloud, workload identity, etc...)
Store the path to a service account JSON file as the GOOGLE_APPLICATION_CREDENTIALS environment variable

This codebase uses thegoogle.auth library which first looks for the application credentials variable mentioned above, and then looks for system-level auth.

For more information, see:

To enable automated tracing of your model calls, set yourLangSmith API key:

# os.environ["LANGSMITH_API_KEY"] = getpass.getpass("Enter your LangSmith API key: ")
# os.environ["LANGSMITH_TRACING"] = "true"

Installation

The LangChain VertexAI integration lives in thelangchain-google-vertexai package:

%pip install-qU langchain-google-vertexai

Note: you may need to restart the kernel to use updated packages.

Instantiation

Now we can instantiate our model object and generate chat completions:

from langchain_google_vertexaiimport ChatVertexAI

llm= ChatVertexAI(
    model="gemini-1.5-flash-001",
    temperature=0,
    max_tokens=None,
    max_retries=6,
    stop=None,
# other params...
)

API Reference:ChatVertexAI

Invocation

messages=[
(
"system",
"You are a helpful assistant that translates English to French. Translate the user sentence.",
),
("human","I love programming."),
]
ai_msg= llm.invoke(messages)
ai_msg

AIMessage(content="J'adore programmer. \n", response_metadata={'is_blocked': False, 'safety_ratings': [{'category': 'HARM_CATEGORY_HATE_SPEECH', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_HARASSMENT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}], 'usage_metadata': {'prompt_token_count': 20, 'candidates_token_count': 7, 'total_token_count': 27}}, id='run-7032733c-d05c-4f0c-a17a-6c575fdd1ae0-0', usage_metadata={'input_tokens': 20, 'output_tokens': 7, 'total_tokens': 27})

print(ai_msg.content)

J'adore programmer.

Built-in tools

Gemini supports a range of tools that are executed server-side.

Google search

Requireslangchain-google-vertexai>=2.0.11

Gemini can execute a Google search and use the results toground its responses:

from langchain_google_vertexaiimport ChatVertexAI

llm= ChatVertexAI(model="gemini-2.0-flash-001").bind_tools([{"google_search":{}}])

response= llm.invoke("What is today's news?")

API Reference:ChatVertexAI

Code execution

Requireslangchain-google-vertexai>=2.0.25

Gemini cangenerate and execute Python code:

from langchain_google_vertexaiimport ChatVertexAI

llm= ChatVertexAI(model="gemini-2.0-flash-001").bind_tools([{"code_execution":{}}])

response= llm.invoke("What is 3^3?")

API Reference:ChatVertexAI

Chaining

We canchain our model with a prompt template like so:

from langchain_core.promptsimport ChatPromptTemplate

prompt= ChatPromptTemplate.from_messages(
[
(
"system",
"You are a helpful assistant that translates {input_language} to {output_language}.",
),
("human","{input}"),
]
)

chain= prompt| llm
chain.invoke(
{
"input_language":"English",
"output_language":"German",
"input":"I love programming.",
}
)

API Reference:ChatPromptTemplate

AIMessage(content='Ich liebe Programmieren. \n', response_metadata={'is_blocked': False, 'safety_ratings': [{'category': 'HARM_CATEGORY_HATE_SPEECH', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_DANGEROUS_CONTENT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_HARASSMENT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}, {'category': 'HARM_CATEGORY_SEXUALLY_EXPLICIT', 'probability_label': 'NEGLIGIBLE', 'blocked': False}], 'usage_metadata': {'prompt_token_count': 15, 'candidates_token_count': 8, 'total_token_count': 23}}, id='run-c71955fd-8dc1-422b-88a7-853accf4811b-0', usage_metadata={'input_tokens': 15, 'output_tokens': 8, 'total_tokens': 23})

API reference

For detailed documentation of all ChatVertexAI features and configurations, like how to send multimodal inputs and configure safety settings, head to the API reference:https://python.langchain.com/api_reference/google_vertexai/chat_models/langchain_google_vertexai.chat_models.ChatVertexAI.html

Chat modelconceptual guide
Chat modelhow-to guides

Movatterモバイル変換

ChatVertexAI

Overview

Integration details

Model features

Setup

Credentials

Installation

Instantiation

Invocation

Built-in tools

Google search

Code execution

Chaining

API reference

Related

Movatterモバイル変換

Overview​

Integration details​

Model features​

Setup​

Credentials​

Installation​

Instantiation​

Invocation​

Built-in tools​

Google search​

Code execution​

Chaining​

API reference​

Related​

Overview

Integration details

Model features

Setup

Credentials

Installation

Instantiation

Invocation

Built-in tools

Google search

Code execution

Chaining

API reference

Related