AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Vertex

Llama 4 Maverick 17B 128E Instruct API

Vertex
About

Integrate multimodal vision capabilities using Llama 4 Maverick 17B 128E Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.35/1M input and $1.15/1M output tokens, native tool calling support, open weights architecture. Access Llama 4 Maverick 17B 128E Instruct via the Vertex API with up to 8K output tokens.

Capabilities

Input Modalities

textimage

Output Modalities

text
Reasoning
No
Structured Output
Yes
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Supported
Limits
Context Window
524K

Tokens

Input Limit
524K

Tokens

Max Output
8K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.35
Output
$1.15
Frequently Asked Questions

Llama 4 Maverick 17B 128E Instruct by Vertex costs $0.35 per 1M input tokens and $1.15 per 1M output tokens.

Model Details
ID
meta/llama-4-maverick-17b-128e-instruct-maas
Provider
Vertex
Family
llama
Release Date
Apr 29, 2025
Knowledge Cutoff
Aug 1, 2024
API Integration
NPM Package
@ai-sdk/google-vertex
Environment Variables
GOOGLE_VERTEX_PROJECT
GOOGLE_VERTEX_LOCATION
GOOGLE_APPLICATION_CREDENTIALS
Documentation