Llama 3.2 3B Instruct is a high-performance LLM available via the Chutes API, ideal for scalable text generation and natural language processing. This model delivers cost-effective pricing at $0.01/1M input and $0.01/1M output tokens, open weights architecture. Access Llama 3.2 3B Instruct via the Chutes API with up to 16K output tokens.
Tokens
Tokens
Tokens
Llama 3.2 3B Instruct by Chutes costs $0.01 per 1M input tokens and $0.01 per 1M output tokens. Cached reads cost $0.0050 per 1M tokens.