AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Nebius Token Factory

Qwen2.5-Coder-7B (Fast) API

Nebius Token Factory
About

Process massive datasets with Qwen2.5-Coder-7B (Fast), featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $0.03/1M input and $0.09/1M output tokens, native tool calling support, open weights architecture. Access Qwen2.5-Coder-7B (Fast) via the Nebius Token Factory API with up to 8K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
No
Structured Output
Yes
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
128K

Tokens

Input Limit
120K

Tokens

Max Output
8K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.03
Output
$0.09

Caching (per 1M tokens)

Read
$0.0030
Write
$0.03
Frequently Asked Questions

Qwen2.5-Coder-7B (Fast) by Nebius Token Factory costs $0.03 per 1M input tokens and $0.09 per 1M output tokens. Cached reads cost $0.0030 per 1M tokens. Cache writes cost $0.03 per 1M tokens.

Model Details
ID
Qwen/Qwen2.5-Coder-7B-fast
Provider
Nebius Token Factory
Family
Release Date
Sep 19, 2024
Knowledge Cutoff
Sep 1, 2024
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
NEBIUS_API_KEY
API Base URL
https://api.tokenfactory.nebius.com/v1
Documentation