Llama 3.3 70B

Llama 3.3 70B is a text-only 70B instruction-tuned model that provides enhanced performance relative to previous Llama models when used for text-only applications.

Managed API (MaaS) specifications

View model card in Model Garden

Model ID llama-3.3-70b-instruct-maas
Launch stage GA
Supported inputs & outputs
  • Inputs:
    Text, Code
  • Outputs:
    Text
Capabilities
Usage types
Knowledge cutoff date December 2023
Versions
  • llama-3.3-70b-instruct-maas
    • Launch stage: GA
    • Release date: April 29, 2025
Supported regions

Model availability

  • United States
    • us-central1

ML processing

  • United States
    • Multi-region
Quota limits

us-central1:

  • Max output: 8,192
  • Context length: 128,000

Pricing See Pricing.

Deploy as a self-deployed model

To self-deploy the model, navigate to the Llama 3.3 70B model card in the Model Garden console and click Deploy model. For more information about deploying and using partner models, see Deploy a partner model and make prediction requests.