Build advanced reasoning agents with NVIDIA: Llama 3.1 Nemotron Ultra 253B v1, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers cost-effective pricing at $0.60/1M input and $1.80/1M output tokens, open weights architecture. Access NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 via the Kilo Gateway API with up to 26K output tokens.
Tokens
Tokens
Tokens
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 by Kilo Gateway costs $0.60 per 1M input tokens and $1.80 per 1M output tokens.