Process massive datasets with Llama 3.1 405b Instruct, featuring an expansive 128K context window for long-document analysis. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access Llama 3.1 405b Instruct via the Nvidia API with up to 4K output tokens.
Tokens
Tokens
Tokens
Llama 3.1 405b Instruct by Nvidia costs Free per 1M input tokens and Free per 1M output tokens.