AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Chutes

Llama 3.2 1B Instruct API

Chutes
About

Llama 3.2 1B Instruct is a high-performance LLM available via the Chutes API, ideal for scalable text generation and natural language processing. This model delivers cost-effective pricing at $0.01/1M input and $0.01/1M output tokens, open weights architecture. Access Llama 3.2 1B Instruct via the Chutes API with up to 8K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
No
Structured Output
No
Tool Use
No
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
33K

Tokens

Input Limit
33K

Tokens

Max Output
8K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.01
Output
$0.01

Caching (per 1M tokens)

Read
$0.0050
Frequently Asked Questions

Llama 3.2 1B Instruct by Chutes costs $0.01 per 1M input tokens and $0.01 per 1M output tokens. Cached reads cost $0.0050 per 1M tokens.

Model Details
ID
unsloth/Llama-3.2-1B-Instruct
Provider
Chutes
Family
Release Date
Jan 27, 2026
Knowledge Cutoff
N/A
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
CHUTES_API_KEY
API Base URL
https://llm.chutes.ai/v1
Documentation