GLM-5.2 API via TokenMix

Use GLM-5.2 from Zhipu as a chat model through the TokenMix AI API relay and multi-model gateway.

GLM-5.2 is a new generation of open source flagship model designed by Zhipu AI for Long Horizon Task, which supports 1M lossless ultra-long context. With excellent programming and engineering capabilities, it can independently complete the complete link of task disassembly, architecture design, front-end and back-end development, joint testing to multi-end deployment, which is suitable for complex engineering tasks, long-range interaction, code generation, enterprise applications and other scenarios.

API access

Base URL: https://api.tokenmix.ai/v1
Model ID: glm-5.2
OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $1.117647/M tokens, output $3.911765/M tokens

Capabilities

Function calling, JSON mode, Streaming, Reasoning

Model specs

Context: 1000K tokens
Max output: 131K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 1201ms, latency 5144ms, throughput 62.3 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart