Process massive datasets with Meta-Llama-3.1-8B-Instruct (Fast), featuring an expansive 128K context window for long-document analysis. This model delivers cost-effective pricing at $0.03/1M input and $0.09/1M output tokens, native tool calling support, open weights architecture. Access Meta-Llama-3.1-8B-Instruct (Fast) via the Nebius Token Factory API with up to 4K output tokens.
Tokens
Tokens
Tokens
Meta-Llama-3.1-8B-Instruct (Fast) by Nebius Token Factory costs $0.03 per 1M input tokens and $0.09 per 1M output tokens. Cached reads cost $0.0030 per 1M tokens. Cache writes cost $0.03 per 1M tokens.