Process massive datasets with @cf/meta/llama-3.1-8b-instruct-fast, featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $0.04/1M input and $0.38/1M output tokens, native tool calling support, open weights architecture. Access @cf/meta/llama-3.1-8b-instruct-fast via the Cloudflare Workers AI API with up to 128K output tokens.
Tokens
Tokens
Tokens
@cf/meta/llama-3.1-8b-instruct-fast by Cloudflare Workers AI costs $0.04 per 1M input tokens and $0.38 per 1M output tokens.