Integrate multimodal vision capabilities using paligemma, designed to process both text and images seamlessly. This model delivers available completely free of charge, open weights architecture. Access paligemma via the Nvidia API with up to 8K output tokens.
Tokens
Tokens
Tokens
paligemma by Nvidia costs Free per 1M input tokens and Free per 1M output tokens.