Question 1

What is the cost of Llama 4 Maverick 17B 128E Instruct FP8 by Azure?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct FP8 by Azure costs $0.25 per 1M input tokens and $1.00 per 1M output tokens.

Question 2

What is the context window of Llama 4 Maverick 17B 128E Instruct FP8 by Azure?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct FP8 by Azure has a context window of 128K tokens. It supports up to 128K input tokens and can generate up to 8K output tokens.

Question 3

What are the capabilities of Llama 4 Maverick 17B 128E Instruct FP8 by Azure?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct FP8 by Azure supports tool calling/function calling, adjustable temperature, file attachments. It has open weights.

Question 4

What input and output types does Llama 4 Maverick 17B 128E Instruct FP8 support?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct FP8 by Azure accepts text, image as input and can generate text as output.

Question 5

Is Llama 4 Maverick 17B 128E Instruct FP8 by Azure open source?

Accepted Answer

Yes, Llama 4 Maverick 17B 128E Instruct FP8 by Azure has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 6

Does Llama 4 Maverick 17B 128E Instruct FP8 support function calling?

Accepted Answer

Yes, Llama 4 Maverick 17B 128E Instruct FP8 by Azure supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 7

What is the knowledge cutoff date for Llama 4 Maverick 17B 128E Instruct FP8?

Accepted Answer

Llama 4 Maverick 17B 128E Instruct FP8 by Azure has a knowledge cutoff date of 2024-08. This means the model was trained on data available up until that date.

Llama 4 Maverick 17B 128E Instruct FP8 API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Llama 4 Maverick 17B 128E Instruct FP8 API

Input Modalities

Output Modalities

Standard (per 1M tokens)