Process massive datasets with Llama-3.1-405B-Instruct, featuring an expansive 131K context window for long-document analysis. This model delivers cost-effective pricing at $1.00/1M input and $3.00/1M output tokens, native tool calling support, open weights architecture. Access Llama-3.1-405B-Instruct via the Nebius Token Factory API with up to 8K output tokens.
Tokens
Tokens
Tokens
Llama-3.1-405B-Instruct by Nebius Token Factory costs $1.00 per 1M input tokens and $3.00 per 1M output tokens. Cached reads cost $0.10 per 1M tokens. Cache writes cost $1.25 per 1M tokens.