AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Ollama Cloud

deepseek-v4-flash API

Ollama Cloud
About

Build advanced reasoning agents with deepseek-v4-flash, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access deepseek-v4-flash via the Ollama Cloud API with up to 1.0M output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
Yes
Structured Output
No
Tool Use
Yes
WeightsOpen
Temperature
Fixed
Attachment
Not Supported
Limits
Context Window
1.0M

Tokens

Input Limit
1.0M

Tokens

Max Output
1.0M

Tokens

Pricing

Standard (per 1M tokens)

Input
Free
Output
Free
Frequently Asked Questions

deepseek-v4-flash by Ollama Cloud costs Free per 1M input tokens and Free per 1M output tokens.

Model Details
ID
deepseek-v4-flash
Provider
Ollama Cloud
Family
deepseek-flash
Release Date
Apr 24, 2026
Knowledge Cutoff
N/A
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
OLLAMA_API_KEY
API Base URL
https://ollama.com/v1
Documentation