Integrate multimodal vision capabilities using Llama-4-Scout-17B-16E-Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.15/1M input and $0.60/1M output tokens, native tool calling support, open weights architecture. Access Llama-4-Scout-17B-16E-Instruct via the Synthetic API with up to 4K output tokens.
Tokens
Tokens
Tokens
Llama-4-Scout-17B-16E-Instruct by Synthetic costs $0.15 per 1M input tokens and $0.60 per 1M output tokens.