Process massive datasets with Llama 3.1 8B Instruct FP8, featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $0.15/1M input and $0.29/1M output tokens. Access Llama 3.1 8B Instruct FP8 via the Cloudflare AI Gateway API with up to 16K output tokens.
Tokens
Tokens
Tokens
Llama 3.1 8B Instruct FP8 by Cloudflare AI Gateway costs $0.15 per 1M input tokens and $0.29 per 1M output tokens.