Question 1

What is the cost of GLM-4.7-Flash by NovitaAI?

Accepted Answer

GLM-4.7-Flash by NovitaAI costs $0.07 per 1M input tokens and $0.40 per 1M output tokens. Cached reads cost $0.01 per 1M tokens.

Question 2

What is the context window of GLM-4.7-Flash by NovitaAI?

Accepted Answer

GLM-4.7-Flash by NovitaAI has a context window of 200K tokens. It supports up to 200K input tokens and can generate up to 128K output tokens.

Question 3

What are the capabilities of GLM-4.7-Flash by NovitaAI?

Accepted Answer

GLM-4.7-Flash by NovitaAI supports advanced reasoning, tool calling/function calling, structured output, adjustable temperature. It has open weights.

Question 4

What input and output types does GLM-4.7-Flash support?

Accepted Answer

GLM-4.7-Flash by NovitaAI accepts text as input and can generate text as output.

Question 5

Does GLM-4.7-Flash by NovitaAI support reasoning?

Accepted Answer

Yes, GLM-4.7-Flash by NovitaAI supports advanced reasoning capabilities.

Question 6

Is GLM-4.7-Flash by NovitaAI open source?

Accepted Answer

Yes, GLM-4.7-Flash by NovitaAI has open weights, meaning the model weights are publicly available for download and self-hosting.

Question 7

Does GLM-4.7-Flash support function calling?

Accepted Answer

Yes, GLM-4.7-Flash by NovitaAI supports tool calling (also known as function calling), allowing it to interact with external tools and APIs during conversations.

Question 8

What is the knowledge cutoff date for GLM-4.7-Flash?

Accepted Answer

GLM-4.7-Flash by NovitaAI has a knowledge cutoff date of 2025-04. This means the model was trained on data available up until that date.

GLM-4.7-Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)

GLM-4.7-Flash API

Input Modalities

Output Modalities

Standard (per 1M tokens)

Caching (per 1M tokens)