Question 1

What is the cost of GLM 4.6 by Venice AI?

Accepted Answer

GLM 4.6 by Venice AI costs $0.85 per 1M input tokens and $2.75 per 1M output tokens. Cached reads cost $0.30 per 1M tokens.

Question 2

What is the context window of GLM 4.6 by Venice AI?

Accepted Answer

GLM 4.6 by Venice AI has a context window of 198K tokens. It supports up to 198K input tokens and can generate up to 16K output tokens.

Question 3

What are the capabilities of GLM 4.6 by Venice AI?

Accepted Answer

GLM 4.6 by Venice AI supports advanced reasoning, tool calling/function calling, structured output, adjustable temperature. It has open weights.

Question 4

What input and output types does GLM 4.6 support?

Accepted Answer

GLM 4.6 by Venice AI accepts text as input and can generate text as output.

Question 5

Does GLM 4.6 by Venice AI support reasoning?

Accepted Answer

Yes, GLM 4.6 by Venice AI supports advanced reasoning capabilities.

Question 6

Is GLM 4.6 by Venice AI open source?

Accepted Answer

Yes, GLM 4.6 by Venice AI has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 7

Does GLM 4.6 support function calling?

Accepted Answer

Yes, GLM 4.6 by Venice AI supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 8

What is the knowledge cutoff date for GLM 4.6?

Accepted Answer

GLM 4.6 by Venice AI has a knowledge cutoff date of 2025-04. This means the model was trained on data available up until that date.

GLM 4.6 API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)

GLM 4.6 API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)