Llama 3.3 70B

Llama 3.3 70B is a text-only 70B instruction-tuned model that providesenhanced performance relative to previous Llama models when usedfor text-only applications.

Managed API (MaaS) specifications

View model card in Model Garden

Model IDllama-3.3-70b-instruct-maas
Launch stageGA
Supported inputs & outputs
  • Inputs:
    Text,Code
  • Outputs:
    Text
Capabilities
Usage types
Knowledge cutoff dateDecember 2023
Versions
  • llama-3.3-70b-instruct-maas
    • Launch stage: GA
    • Release date: April 29, 2025
Supported regions

Model availability

  • United States
    • us-central1

ML processing

  • United States
    • Multi-region
Quota limits

us-central1:

  • Max output: 8,192
  • Context length: 128,000

PricingSeePricing.

Deploy as a self-deployed model

To self-deploy the model, navigate to theLlama 3.3 70B model card in the Model Gardenconsole and clickDeploy model. For more information about deploying andusing partner models, seeDeploy a partner model and make predictionrequests.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.