Integrate multimodal vision capabilities using Qwen3-VL 235B, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $1.64/1M input and $1.91/1M output tokens, native tool calling support, open weights architecture. Access Qwen3-VL 235B via the STACKIT API with up to 8K output tokens.
Tokens
Tokens
Tokens
Qwen3-VL 235B by STACKIT costs $1.64 per 1M input tokens and $1.91 per 1M output tokens.