Integrate multimodal vision capabilities using Llama 4 Scout 17B 16E Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.20/1M input and $0.78/1M output tokens, native tool calling support, open weights architecture. Access Llama 4 Scout 17B 16E Instruct via the Azure Cognitive Services API with up to 8K output tokens.
Tokens
Tokens
Tokens
Llama 4 Scout 17B 16E Instruct by Azure Cognitive Services costs $0.20 per 1M input tokens and $0.78 per 1M output tokens.