Integrate multimodal vision capabilities using Llama 3.2 11B Vision Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.05/1M input and $0.68/1M output tokens, open weights architecture. Access Llama 3.2 11B Vision Instruct via the Cloudflare Workers AI API with up to 128K output tokens.
Tokens
Tokens
Tokens
Llama 3.2 11B Vision Instruct by Cloudflare Workers AI costs $0.05 per 1M input tokens and $0.68 per 1M output tokens.