GLM-5 API via TokenMix
Use GLM-5 from Zhipu as a chat model through the TokenMix AI API relay and multi-model gateway.
Zhipu's latest flagship 744B MoE model (40B active) with coding capabilities aligned to Claude Opus 4.5. Excels at long-horizon agentic planning and execution, 200K context. MIT license.
API access
- Base URL:
https://api.tokenmix.ai/v1 - Model ID:
glm-5 - OpenAI SDK compatible. Change the base URL and use your TokenMix API key.
Pricing
Input $0.525547/M tokens, output $2.364964/M tokens
Capabilities
Function calling, JSON mode, Streaming, Reasoning
Model specs
- Context: 200K tokens
- Max output: 128K tokens
Availability
3/3 available API endpoints are healthy right now.
Recent performance
TTFT 548ms, latency 3655ms, throughput 12.9 tok/s.
Start using this model
Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.