Kimi K2.7 Code HighSpeed API via TokenMix

Use Kimi K2.7 Code HighSpeed from Moonshot as a chat model through the TokenMix AI API relay and multi-model gateway.

Kimi K2.7 Code HighSpeed has the same capabilities as Kimi K2.7 Code, with output speed about 5-6x faster than the standard version. In common coding scenarios, using median input length, it reaches about 180 tokens/s, and can reach up to 260 tokens/s in short-context scenarios. It supports a 256K context window for a more extreme coding experience.

API access

Base URL: https://api.tokenmix.ai/v1
Model ID: kimi-k2.7-code-highspeed
OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $1.816176/M tokens, output $7.544118/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming, Reasoning

Model specs

Context: 262K tokens
Max output: 33K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 5618ms, latency 5647ms, throughput 34.5 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart