Process massive datasets with GLM 5.2 Short Fast Flex, featuring an expansive 200K context window for long-document analysis. This model delivers cost-effective pricing at $0.72/1M input and $2.25/1M output tokens, native tool calling support, open weights architecture. Access GLM 5.2 Short Fast Flex via the Neuralwatt API with up to 200K output tokens.
Tokens
Tokens
Tokens
GLM 5.2 Short Fast Flex by Neuralwatt costs $0.72 per 1M input tokens and $2.25 per 1M output tokens. Cached reads cost $0.18 per 1M tokens.