GLM-5

by Zhipu · chat

智谱最新旗舰744B MoE模型,编程对齐Claude Opus 4.5,擅长长程智能体规划与执行

Pricing

Input: $0.95/M tokens · Output: $3.04/M tokens

Capabilities

Function Calling, JSON Mode, Streaming, Reasoning

Context: 200K tokens

Max output: 128K tokens

Routes: 2/2 healthy

Performance

TTFT: 13529ms · Latency: 22373ms · Throughput: 30.4 tok/s