Evaluate models using Vertex AI

This page describes how to evaluate models using Vertex AI. Foran overview, seemodel evaluation in Vertex AI.

Prerequisites

  1. Follow the steps atSet up a project and a development environment.In addition, enable the following services:

  2. Vertex AI can evaluate models that are trained eitherthrough AutoML or custom training. For the Google Cloud consoleguide, you should have a trained modelimported toVertex AI Model Registry.

  3. Upload your test dataset toBigQuery orCloud Storage. The test dataset should contain the groundtruth, which is the actual result expected for an inference. Obtain the linkto the file or the dataset ID.

  4. Have abatch inference output in the form of aBigQuery table or Cloud Storage URI.

  5. Make sure yourdefault Compute Engine service account has thefollowingIAM permissions:

    • Vertex AI Administrator (aiplatform.admin)
    • Vertex AI Service Agent (aiplatform.serviceAgent)
    • Storage Object Admin (storage.objectAdmin)
    • Dataflow Worker (dataflow.worker)
    • BigQuery Data Editor (bigquery.dataEditor) (only requiredif you are providing data in the form of BigQuery tables)

Create an evaluation

Console

  1. In the Google Cloud console, go to the Vertex AI Models page.

    Go to the Models page

  2. Click the name of the model you want to evaluate.

  3. Click the version number for the model.

  4. On theEvaluate tab, clickCreate Evaluation.

  5. Enter anEvaluation name.

  6. Select anObjective, such as classification or regression.

  7. Enter theEvaluation target column name, which is the column from thetraining data that the model is trained to predict.

  8. ForSelect source, select the source for your test dataset.

    1. ForBigQuery table, enter theBigQuery path.

    2. ForFile on Cloud Storage, enter theCloud Storage path.

  9. ForBatch prediction output, select an output format.

    1. Enter the BigQuery path or Cloud Storage URI.
  10. ClickStart Evaluation.

Python

To view the Vertex AI API model evaluation workflow inVertex AI Pipelines, see the example notebooks for the following modeltypes:

Python SDK

The SDK for evaluating models with Vertex AI is inExperimental. To sign up for the Experimental, fill out theonboarding form.

Vertex AI automatically sends an email notification whena model evaluation job is complete.

View evaluation metrics

Note: ForBigQuery ML models that are registered to Model Registry, Vertex AI only shows evaluation metrics for regression and binary classification models.

Console

  1. In the Google Cloud console, go to the Vertex AI Models page.

    Go to the Models page

  2. Navigate to the model version.

  3. View metrics in theEvaluate tab.

Python

To view the Vertex AI API model evaluation workflow inVertex AI Pipelines, see the example notebooks for the following modeltypes:

Python SDK

The SDK for evaluating models with Vertex AI is inExperimental. To sign up for the Experimental, fill out theonboarding form.

Compare evaluation metrics

You can compare evaluation results across different models, model versions, andevaluation jobs. For more information about model versioning, seeVersioning inModel Registry.

You can only compare models of the same type, such as classification,regression, or forecasting. When comparing different models, all themodel versions must be the same type.

You can only compare 5 or fewer evaluations at a time.

  1. Go to the Vertex AI Model Registry in the Google Cloud console:

    Go to the Models page

  2. Navigate to your model or model version:

    • To compare across different models on theModels page, select thecheckboxes next to the names of the models you want to compare.

    • To compare across different model versions:

      1. Click the name of your model on theModels page to open the listof model versions.

      2. Select the checkboxes next to the versions you want to compare.

    • To compare across evaluation jobs for the same model version:

      1. Click the name of your model on theModels page to open the listof model versions.

      2. Click the version number.

      3. Select the checkboxes next to the evaluation jobs you want to compare.

  3. ClickCompare.

What's next

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-18 UTC.