Question 1

What is the cost of Llama 3.1 405b Instruct by Nvidia?

Accepted Answer

Llama 3.1 405b Instruct by Nvidia costs Free per 1M input tokens and Free per 1M output tokens.

Question 2

What is the context window of Llama 3.1 405b Instruct by Nvidia?

Accepted Answer

Llama 3.1 405b Instruct by Nvidia has a context window of 128K tokens. It supports up to 128K input tokens and can generate up to 4K output tokens.

Question 3

What are the capabilities of Llama 3.1 405b Instruct by Nvidia?

Accepted Answer

Llama 3.1 405b Instruct by Nvidia supports tool calling/function calling, structured output, adjustable temperature. It has open weights.

Question 4

What input and output types does Llama 3.1 405b Instruct support?

Accepted Answer

Llama 3.1 405b Instruct by Nvidia accepts text as input and can generate text as output.

Question 5

Is Llama 3.1 405b Instruct by Nvidia open source?

Accepted Answer

Yes, Llama 3.1 405b Instruct by Nvidia has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 6

Does Llama 3.1 405b Instruct support function calling?

Accepted Answer

Yes, Llama 3.1 405b Instruct by Nvidia supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Llama 3.1 405b Instruct API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Llama 3.1 405b Instruct API

Input Modalities

Output Modalities

Standard (per 1M tokens)