Integrate multimodal vision capabilities using Meta: Llama 3.2 11B Vision Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.05/1M input and $0.05/1M output tokens, open weights architecture. Access Meta: Llama 3.2 11B Vision Instruct via the Kilo Gateway API with up to 16K output tokens.
Tokens
Tokens
Tokens
Meta: Llama 3.2 11B Vision Instruct by Kilo Gateway costs $0.05 per 1M input tokens and $0.05 per 1M output tokens.