Question 1

What is the cost of GLM 4.7 Flash by routing.run?

Accepted Answer

GLM 4.7 Flash by routing.run costs $1.32 per 1M input tokens and $4.40 per 1M output tokens. Cached reads cost Free per 1M tokens. Cache writes cost Free per 1M tokens.

Question 2

What is the context window of GLM 4.7 Flash by routing.run?

Accepted Answer

GLM 4.7 Flash by routing.run has a context window of 128K tokens. It supports up to 128K input tokens and can generate up to 128K output tokens.

Question 3

What are the capabilities of GLM 4.7 Flash by routing.run?

Accepted Answer

GLM 4.7 Flash by routing.run supports advanced reasoning, tool calling/function calling, adjustable temperature. It has open weights.

Question 4

What input and output types does GLM 4.7 Flash support?

Accepted Answer

GLM 4.7 Flash by routing.run accepts text as input and can generate text as output.

Question 5

Does GLM 4.7 Flash by routing.run support reasoning?

Accepted Answer

Yes, GLM 4.7 Flash by routing.run supports advanced reasoning capabilities.

Question 6

Is GLM 4.7 Flash by routing.run open source?

Accepted Answer

Yes, GLM 4.7 Flash by routing.run has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 7

Does GLM 4.7 Flash support function calling?

Accepted Answer

Yes, GLM 4.7 Flash by routing.run supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 8

What is the knowledge cutoff date for GLM 4.7 Flash?

Accepted Answer

GLM 4.7 Flash by routing.run has a knowledge cutoff date of 2025-04. This means the model was trained on data available up until that date.

GLM 4.7 Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)

GLM 4.7 Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)