Process massive datasets with GLM 4.6, featuring an expansive 198K context window for long-document analysis. This model delivers cost-effective pricing at $0.85/1M input and $2.75/1M output tokens, native tool calling support, open weights architecture. Access GLM 4.6 via the Venice AI API with up to 16K output tokens.
Tokens
Tokens
Tokens
GLM 4.6 by Venice AI costs $0.85 per 1M input tokens and $2.75 per 1M output tokens. Cached reads cost $0.30 per 1M tokens.