AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Nvidia

Nemotron 3 Ultra 550B A55B API

Nvidia
About

Build advanced reasoning agents with Nemotron 3 Ultra 550B A55B, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers cost-effective pricing at $0.50/1M input and $2.50/1M output tokens, native tool calling support, open weights architecture. Access Nemotron 3 Ultra 550B A55B via the Nvidia API with up to 66K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
Yes
Structured Output
Yes
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
1.0M

Tokens

Input Limit
1.0M

Tokens

Max Output
66K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.50
Output
$2.50

Caching (per 1M tokens)

Read
$0.15
Frequently Asked Questions

Nemotron 3 Ultra 550B A55B by Nvidia costs $0.50 per 1M input tokens and $2.50 per 1M output tokens. Cached reads cost $0.15 per 1M tokens.

Model Details
ID
nvidia/nemotron-3-ultra-550b-a55b
Provider
Nvidia
Family
nemotron
Release Date
Jun 4, 2026
Knowledge Cutoff
N/A
API Integration
NPM Package
@ai-sdk/openai-compatible
Environment Variables
NVIDIA_API_KEY
API Base URL
https://integrate.api.nvidia.com/v1
Documentation