Integrate multimodal vision capabilities using qwen/qwen3-vl-30b-a3b-instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.20/1M input and $0.70/1M output tokens, native tool calling support, open weights architecture. Access qwen/qwen3-vl-30b-a3b-instruct via the NovitaAI API with up to 33K output tokens.
Tokens
Tokens
Tokens
qwen/qwen3-vl-30b-a3b-instruct by NovitaAI costs $0.20 per 1M input tokens and $0.70 per 1M output tokens.