Question 1

What is the cost of GLM-4.5-Flash by Zhipu AI?

Accepted Answer

GLM-4.5-Flash by Zhipu AI costs Free per 1M input tokens and Free per 1M output tokens. Cached reads cost Free per 1M tokens. Cache writes cost Free per 1M tokens.

Question 2

What is the context window of GLM-4.5-Flash by Zhipu AI?

Accepted Answer

GLM-4.5-Flash by Zhipu AI has a context window of 131K tokens. It supports up to 131K input tokens and can generate up to 98K output tokens.

Question 3

What are the capabilities of GLM-4.5-Flash by Zhipu AI?

Accepted Answer

GLM-4.5-Flash by Zhipu AI supports advanced reasoning, tool calling/function calling, adjustable temperature. It has open weights.

Question 4

What input and output types does GLM-4.5-Flash support?

Accepted Answer

GLM-4.5-Flash by Zhipu AI accepts text as input and can generate text as output.

Question 5

Does GLM-4.5-Flash by Zhipu AI support reasoning?

Accepted Answer

Yes, GLM-4.5-Flash by Zhipu AI supports advanced reasoning capabilities.

Question 6

Is GLM-4.5-Flash by Zhipu AI open source?

Accepted Answer

Yes, GLM-4.5-Flash by Zhipu AI has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 7

Does GLM-4.5-Flash support function calling?

Accepted Answer

Yes, GLM-4.5-Flash by Zhipu AI supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 8

What is the knowledge cutoff date for GLM-4.5-Flash?

Accepted Answer

GLM-4.5-Flash by Zhipu AI has a knowledge cutoff date of 2025-04. This means the model was trained on data available up until that date.

GLM-4.5-Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)

GLM-4.5-Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)