Build advanced reasoning agents with GLM-4.7-Flash, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access GLM-4.7-Flash via the Z.AI API with up to 131K output tokens.
Tokens
Tokens
Tokens
GLM-4.7-Flash by Z.AI costs Free per 1M input tokens and Free per 1M output tokens. Cached reads cost Free per 1M tokens. Cache writes cost Free per 1M tokens.