Build advanced reasoning agents with GLM-4.5-Flash, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers available completely free of charge, native tool calling support, open weights architecture. Access GLM-4.5-Flash via the Zhipu AI API with up to 98K output tokens.
Tokens
Tokens
Tokens
GLM-4.5-Flash by Zhipu AI costs Free per 1M input tokens and Free per 1M output tokens. Cached reads cost Free per 1M tokens. Cache writes cost Free per 1M tokens.