Integrate multimodal vision capabilities using Llama 3.2 11B Vision Instruct, designed to process both text and images seamlessly. This model delivers available completely free of charge, open weights architecture. Access Llama 3.2 11B Vision Instruct via the OpenRouter API with up to 8K output tokens.
Tokens
Tokens
Tokens
Llama 3.2 11B Vision Instruct by OpenRouter costs Free per 1M input tokens and Free per 1M output tokens.