AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Llama

Cerebras-Llama-4-Maverick-17B-128E-Instruct API

Llama
About

Process massive datasets with Cerebras-Llama-4-Maverick-17B-128E-Instruct, featuring an expansive 128K context window for long-document analysis. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access Cerebras-Llama-4-Maverick-17B-128E-Instruct via the Llama API with up to 4K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
No
Structured Output
No
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Supported
Limits
Context Window
128K

Tokens

Input Limit
128K

Tokens

Max Output
4K

Tokens

Pricing

Standard (per 1M tokens)

Input
Free
Output
Free
Frequently Asked Questions

Cerebras-Llama-4-Maverick-17B-128E-Instruct by Llama costs Free per 1M input tokens and Free per 1M output tokens.

Model Details
ID
cerebras-llama-4-maverick-17b-128e-instruct
Provider
Llama
Family
llama
Release Date
Apr 5, 2025
Knowledge Cutoff
Jan 1, 2025
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
LLAMA_API_KEY
API Base URL
https://api.llama.com/compat/v1/
Documentation