Integrate multimodal vision capabilities using Qwen3 VL Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.70/1M input and $2.80/1M output tokens, native tool calling support, open weights architecture. Access Qwen3 VL Instruct via the Vercel AI Gateway API with up to 129K output tokens.
Tokens
Tokens
Tokens
Qwen3 VL Instruct by Vercel AI Gateway costs $0.70 per 1M input tokens and $2.80 per 1M output tokens.