Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash

Released Jan 19, 2026202,752 context$0.06/M input tokens$0.40/M output tokens

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

OpenRouter

Product

Chat
Rankings
Apps
Models
Providers
Pricing
Enterprise
Labs

Company

About
Announcements
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR
Data

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash

Released Jan 19, 2026202,752 context$0.06/M input tokens$0.40/M output tokens

Recent activity on GLM 4.7 Flash

Total usage per day on OpenRouter

Prompt

2.7B

Reasoning

56.5M

Completion

15.4M

Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.