Qwen3.5 Flash API via TokenMix

Use Qwen3.5 Flash from Qwen as a chat model through the TokenMix AI API relay and multi-model gateway.

Qwen3.5 generation speed-optimized model with 1M context window and multimodal support. Cost-effective for high-throughput tasks including text, image, and video understanding.

API access

Base URL: https://api.tokenmix.ai/v1
Model ID: qwen3.5-flash
OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.026277/M tokens, output $0.262774/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming

Model specs

Context: 1000K tokens
Max output: 66K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 13941ms, latency 38750ms, throughput 99.3 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart