Integrate multimodal vision capabilities using UI-TARS 7B , designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.10/1M input and $0.20/1M output tokens, open weights architecture. Access UI-TARS 7B via the OpenRouter API with up to 2K output tokens.
Tokens
Tokens
Tokens
UI-TARS 7B by OpenRouter costs $0.10 per 1M input tokens and $0.20 per 1M output tokens. Cached reads cost $0.10 per 1M tokens.