Question 1

What is the cost of GLM 5.2 Short Fast Flex by Neuralwatt?

Accepted Answer

GLM 5.2 Short Fast Flex by Neuralwatt costs $0.72 per 1M input tokens and $2.25 per 1M output tokens. Cached reads cost $0.18 per 1M tokens.

Question 2

What is the context window of GLM 5.2 Short Fast Flex by Neuralwatt?

Accepted Answer

GLM 5.2 Short Fast Flex by Neuralwatt has a context window of 200K tokens. It supports up to 200K input tokens and can generate up to 200K output tokens.

Question 3

What are the capabilities of GLM 5.2 Short Fast Flex by Neuralwatt?

Accepted Answer

GLM 5.2 Short Fast Flex by Neuralwatt supports tool calling/function calling, adjustable temperature. It has open weights.

Question 4

What input and output types does GLM 5.2 Short Fast Flex support?

Accepted Answer

GLM 5.2 Short Fast Flex by Neuralwatt accepts text as input and can generate text as output.

Question 5

Is GLM 5.2 Short Fast Flex by Neuralwatt open source?

Accepted Answer

Yes, GLM 5.2 Short Fast Flex by Neuralwatt has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 6

Does GLM 5.2 Short Fast Flex support function calling?

Accepted Answer

Yes, GLM 5.2 Short Fast Flex by Neuralwatt supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

GLM 5.2 Short Fast Flex API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)

GLM 5.2 Short Fast Flex API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)