Qwen3 VL Flash API via TokenMix

Use Qwen3 VL Flash from Qwen as a chat model through the TokenMix AI API relay and multi-model gateway.

Fast and affordable vision-language model for image and video understanding tasks

API access

Input $0.020357/M tokens, output $0.203571/M tokens

Vision, Function calling, JSON mode, Streaming, Reasoning

1/1 available API endpoints are healthy right now.

TTFT 982ms, latency 7358ms, throughput 133.8 tok/s.

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.