@cf/meta/llama-2-7b-chat-int8 is a high-performance LLM available via the Cloudflare Workers AI API, ideal for scalable text generation and natural language processing. This model delivers cost-effective pricing at $0.56/1M input and $6.67/1M output tokens, native tool calling support, open weights architecture. Access @cf/meta/llama-2-7b-chat-int8 via the Cloudflare Workers AI API with up to 8K output tokens.
Tokens
Tokens
Tokens
@cf/meta/llama-2-7b-chat-int8 by Cloudflare Workers AI costs $0.56 per 1M input tokens and $6.67 per 1M output tokens.