Kimi K2.7 Code HighSpeed API via TokenMix
Use Kimi K2.7 Code HighSpeed from Moonshot as a chat model through the TokenMix AI API relay and multi-model gateway.
Kimi K2.7 Code HighSpeed has the same capabilities as Kimi K2.7 Code, with output speed about 5-6x faster than the standard version. In common coding scenarios, using median input length, it reaches about 180 tokens/s, and can reach up to 260 tokens/s in short-context scenarios. It supports a 256K context window for a more extreme coding experience.
API access
- Base URL:
https://api.tokenmix.ai/v1 - Model ID:
kimi-k2.7-code-highspeed - OpenAI SDK compatible. Change the base URL and use your TokenMix API key.
Pricing
Input $1.816176/M tokens, output $7.544118/M tokens
Capabilities
Vision, Function calling, JSON mode, Streaming, Reasoning
Model specs
- Context: 262K tokens
- Max output: 33K tokens
Availability
1/1 available API endpoints are healthy right now.
Recent performance
TTFT 5618ms, latency 5647ms, throughput 34.5 tok/s.
Start using this model
Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.