Qwen3.5 Flash API via TokenMix
Use Qwen3.5 Flash from Qwen as a chat model through the TokenMix AI API relay and multi-model gateway.
Qwen3.5 generation speed-optimized model with 1M context window and multimodal support. Cost-effective for high-throughput tasks including text, image, and video understanding.
API access
- Base URL:
https://api.tokenmix.ai/v1 - Model ID:
qwen3.5-flash - OpenAI SDK compatible. Change the base URL and use your TokenMix API key.
Pricing
Input $0.026277/M tokens, output $0.262774/M tokens
Capabilities
Vision, Function calling, JSON mode, Streaming
Model specs
- Context: 1000K tokens
- Max output: 66K tokens
Availability
1/1 available API endpoints are healthy right now.
Recent performance
TTFT 1788ms, latency 3421ms, throughput 113.0 tok/s.
Start using this model
Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.