Integrate multimodal vision capabilities using Qwen/Qwen2.5-VL-7B-Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.05/1M input and $0.05/1M output tokens, native tool calling support. Access Qwen/Qwen2.5-VL-7B-Instruct via the SiliconFlow API with up to 4K output tokens.
Tokens
Tokens
Tokens
Qwen/Qwen2.5-VL-7B-Instruct by SiliconFlow costs $0.05 per 1M input tokens and $0.05 per 1M output tokens.