AI Model APIs
AI Model APIsProviders
AI Model APIs

The complete platform for comparing AI models. Find pricing, capabilities, and the perfect model for your use case.

contact@aimodelapis.com

Resources

  • AI Models APIs
  • Providers

About

  • About
  • Contact
  • Privacy Policy
  • Terms of Service

© 2026 AI Model APIs. All rights reserved.

Back to Models
Deep Infra

GLM-4.7-Flash API

Deep Infra
About

Build advanced reasoning agents with GLM-4.7-Flash, a specialized AI model optimized for complex logic and chain-of-thought tasks. This model delivers cost-effective pricing at $0.06/1M input and $0.40/1M output tokens, native tool calling support, open weights architecture. Access GLM-4.7-Flash via the Deep Infra API with up to 16K output tokens.

Capabilities

Input Modalities

text

Output Modalities

text
Reasoning
Yes
Structured Output
No
Tool Use
Yes
WeightsOpen
Temperature
Adjustable
Attachment
Not Supported
Limits
Context Window
203K

Tokens

Input Limit
203K

Tokens

Max Output
16K

Tokens

Pricing

Standard (per 1M tokens)

Input
$0.06
Output
$0.40
Frequently Asked Questions

GLM-4.7-Flash by Deep Infra costs $0.06 per 1M input tokens and $0.40 per 1M output tokens.

Model Details
ID
zai-org/GLM-4.7-Flash
Provider
Deep Infra
Family
glm-flash
Release Date
Jan 19, 2026
Knowledge Cutoff
Apr 1, 2025
API Integration
NPM Package
@ai-sdk/deepinfra
Environment Variables
DEEPINFRA_API_KEY
Documentation