Process massive datasets with Llama 2 7B Chat FP16, featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $0.56/1M input and $6.67/1M output tokens. Access Llama 2 7B Chat FP16 via the Cloudflare AI Gateway API with up to 16K output tokens.
Tokens
Tokens
Tokens
Llama 2 7B Chat FP16 by Cloudflare AI Gateway costs $0.56 per 1M input tokens and $6.67 per 1M output tokens.