Integrate multimodal vision capabilities using Llama-4-Scout-17B-16E-Instruct-FP8, designed to process both text and images seamlessly. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access Llama-4-Scout-17B-16E-Instruct-FP8 via the Llama API with up to 4K output tokens.
Tokens
Tokens
Tokens
Llama-4-Scout-17B-16E-Instruct-FP8 by Llama costs Free per 1M input tokens and Free per 1M output tokens.