Question 1

What is the cost of Llama 3.2 3B Instruct by Inference?

Accepted Answer

Llama 3.2 3B Instruct by Inference costs $0.02 per 1M input tokens and $0.02 per 1M output tokens.

Question 2

What is the context window of Llama 3.2 3B Instruct by Inference?

Accepted Answer

Llama 3.2 3B Instruct by Inference has a context window of 16K tokens. It supports up to 16K input tokens and can generate up to 4K output tokens.

Question 3

What are the capabilities of Llama 3.2 3B Instruct by Inference?

Accepted Answer

Llama 3.2 3B Instruct by Inference supports tool calling/function calling, adjustable temperature. It has open weights.

Question 4

What input and output types does Llama 3.2 3B Instruct support?

Accepted Answer

Llama 3.2 3B Instruct by Inference accepts text as input and can generate text as output.

Question 5

Is Llama 3.2 3B Instruct by Inference open source?

Accepted Answer

Yes, Llama 3.2 3B Instruct by Inference has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 6

Does Llama 3.2 3B Instruct support function calling?

Accepted Answer

Yes, Llama 3.2 3B Instruct by Inference supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 7

What is the knowledge cutoff date for Llama 3.2 3B Instruct?

Accepted Answer

Llama 3.2 3B Instruct by Inference has a knowledge cutoff date of 2023-12. This means the model was trained on data available up until that date.

Llama 3.2 3B Instruct API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Llama 3.2 3B Instruct API

Input Modalities

Output Modalities

Standard (per 1M tokens)