GLM-5 API via TokenMix

Use GLM-5 from Zhipu as a chat model through the TokenMix AI API relay and multi-model gateway.

Zhipu's latest flagship 744B MoE model (40B active) with coding capabilities aligned to Claude Opus 4.5. Excels at long-horizon agentic planning and execution, 200K context. MIT license.

API access

  • Base URL: https://api.tokenmix.ai/v1
  • Model ID: glm-5
  • OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.525547/M tokens, output $2.364964/M tokens

Capabilities

Function calling, JSON mode, Streaming, Reasoning

Model specs

  • Context: 200K tokens
  • Max output: 128K tokens

Availability

3/3 available API endpoints are healthy right now.

Recent performance

TTFT 548ms, latency 3655ms, throughput 12.9 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart