Integrate multimodal vision capabilities using Qwen3-Omni Flash Realtime, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.23/1M input and $0.92/1M output tokens, native tool calling support. Access Qwen3-Omni Flash Realtime via the Alibaba (China) API with up to 16K output tokens.
Tokens
Tokens
Tokens
Qwen3-Omni Flash Realtime by Alibaba (China) costs $0.23 per 1M input tokens and $0.92 per 1M output tokens.