Process massive datasets with NVIDIA: Llama 3.1 Nemotron 70B Instruct, featuring an expansive 131K context window for long-document analysis. This model delivers cost-effective pricing at $1.20/1M input and $1.20/1M output tokens, native tool calling support. Access NVIDIA: Llama 3.1 Nemotron 70B Instruct via the Kilo Gateway API with up to 16K output tokens.
Tokens
Tokens
Tokens
NVIDIA: Llama 3.1 Nemotron 70B Instruct by Kilo Gateway costs $1.20 per 1M input tokens and $1.20 per 1M output tokens.