Integrate multimodal vision capabilities using Qwen/Qwen3-VL-30B-A3B-Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.29/1M input and $1.00/1M output tokens, native tool calling support. Access Qwen/Qwen3-VL-30B-A3B-Instruct via the SiliconFlow (China) API with up to 262K output tokens.
Tokens
Tokens
Tokens
Qwen/Qwen3-VL-30B-A3B-Instruct by SiliconFlow (China) costs $0.29 per 1M input tokens and $1.00 per 1M output tokens.