Integrate multimodal vision capabilities using Qwen 2.5 VL 32B Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.05/1M input and $0.22/1M output tokens, native tool calling support, open weights architecture. Access Qwen 2.5 VL 32B Instruct via the IO.NET API with up to 4K output tokens.
Tokens
Tokens
Tokens
Qwen 2.5 VL 32B Instruct by IO.NET costs $0.05 per 1M input tokens and $0.22 per 1M output tokens. Cached reads cost $0.03 per 1M tokens. Cache writes cost $0.10 per 1M tokens.