Build advanced reasoning agents with GLM 4.7 Flash, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers cost-effective pricing at $1.32/1M input and $4.40/1M output tokens, native tool calling support, open weights architecture. Access GLM 4.7 Flash via the routing.run API with up to 128K output tokens.
Tokens
Tokens
Tokens
GLM 4.7 Flash by routing.run costs $1.32 per 1M input tokens and $4.40 per 1M output tokens. Cached reads cost Free per 1M tokens. Cache writes cost Free per 1M tokens.