Integrate multimodal vision capabilities using Qwen2.5-VL-72B-Instruct, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.25/1M input and $0.75/1M output tokens, native tool calling support, open weights architecture. Access Qwen2.5-VL-72B-Instruct via the Nebius Token Factory API with up to 8K output tokens.
Tokens
Tokens
Tokens
Qwen2.5-VL-72B-Instruct by Nebius Token Factory costs $0.25 per 1M input tokens and $0.75 per 1M output tokens. Cached reads cost $0.03 per 1M tokens. Cache writes cost $0.31 per 1M tokens.