Question 1

What is the cost of Llama 3.3 70B by Venice AI?

Accepted Answer

Llama 3.3 70B by Venice AI costs $0.70 per 1M input tokens and $2.80 per 1M output tokens.

Question 2

What is the context window of Llama 3.3 70B by Venice AI?

Accepted Answer

Llama 3.3 70B by Venice AI has a context window of 128K tokens. It supports up to 128K input tokens and can generate up to 4K output tokens.

Question 3

What are the capabilities of Llama 3.3 70B by Venice AI?

Accepted Answer

Llama 3.3 70B by Venice AI supports tool calling/function calling, adjustable temperature. It has open weights.

Question 4

What input and output types does Llama 3.3 70B support?

Accepted Answer

Llama 3.3 70B by Venice AI accepts text as input and can generate text as output.

Question 5

Is Llama 3.3 70B by Venice AI open source?

Accepted Answer

Yes, Llama 3.3 70B by Venice AI has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 6

Does Llama 3.3 70B support function calling?

Accepted Answer

Yes, Llama 3.3 70B by Venice AI supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 7

What is the knowledge cutoff date for Llama 3.3 70B?

Accepted Answer

Llama 3.3 70B by Venice AI has a knowledge cutoff date of 2023-12. This means the model was trained on data available up until that date.

Llama 3.3 70B API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Llama 3.3 70B API

Input Modalities

Output Modalities

Standard (per 1M tokens)