GLM-5
by Zhipu · chat
智谱最新旗舰744B MoE模型,编程对齐Claude Opus 4.5,擅长长程智能体规划与执行
Pricing
Input: $0.95/M tokens · Output: $3.04/M tokens
Capabilities
Function Calling, JSON Mode, Streaming, Reasoning
Context: 200K tokens
Max output: 128K tokens
Routes: 2/2 healthy
Performance
TTFT: 13529ms · Latency: 22373ms · Throughput: 30.4 tok/s