Kimi K2.5

by Moonshot · chat

Kimi K2.5 is a 256K-context multimodal model supporting text/image/video input, reasoning mode, tool calling, JSON and structured output, and automatic context caching.

Pricing

Input: $0.57/M tokens · Output: $2.375/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming, Reasoning

Context: 262K tokens

Max output: 66K tokens

Routes: 4/4 healthy

Performance

TTFT: 2607ms · Latency: 7916ms · Throughput: 18.6 tok/s