Qwen VL Max API via TokenMix

Use Qwen VL Max from Qwen as a chat model through the TokenMix AI API relay and multi-model gateway.

Qwen2.5-VL series flagship vision-language model with advanced image understanding, OCR, and visual reasoning. Supports up to 131K tokens with enhanced high-resolution image processing.

API access

  • Base URL: https://api.tokenmix.ai/v1
  • Model ID: qwen-vl-max
  • OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.211765/M tokens, output $0.529412/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming

Model specs

  • Context: 131K tokens
  • Max output: 8K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 1023ms, latency 11683ms, throughput 27.7 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart