Integrate multimodal vision capabilities using Qwen2.5 VL 32B Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.05/1M input and $0.22/1M output tokens, open weights architecture. Access Qwen2.5 VL 32B Instruct via the Chutes API with up to 16K output tokens.
Tokens
Tokens
Tokens
Qwen2.5 VL 32B Instruct by Chutes costs $0.05 per 1M input tokens and $0.22 per 1M output tokens.