Integrate multimodal vision capabilities using Phi-4-multimodal, designed to process both text and images seamlessly. This model delivers cost-effective pricing at $0.08/1M input and $0.32/1M output tokens, open weights architecture. Access Phi-4-multimodal via the Azure Cognitive Services API with up to 4K output tokens.
Tokens
Tokens
Tokens
Phi-4-multimodal by Azure Cognitive Services costs $0.08 per 1M input tokens and $0.32 per 1M output tokens.