Integrate multimodal vision capabilities using Grok Vision Beta, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $5.00/1M input and $15.00/1M output tokens, native tool calling support. Access Grok Vision Beta via the xAI API with up to 4K output tokens.
Tokens
Tokens
Tokens
Grok Vision Beta by xAI costs $5.00 per 1M input tokens and $15.00 per 1M output tokens. Cached reads cost $5.00 per 1M tokens.