Process massive datasets with Cerebras-Llama-4-Scout-17B-16E-Instruct, featuring an expansive 128K context window for long-document analysis. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access Cerebras-Llama-4-Scout-17B-16E-Instruct via the Llama API with up to 4K output tokens.
Tokens
Tokens
Tokens
Cerebras-Llama-4-Scout-17B-16E-Instruct by Llama costs Free per 1M input tokens and Free per 1M output tokens.