Qwen Max

by Qwen · chat

Top-tier flagship model with strong reasoning and generation capabilities, supports thinking mode

Pricing

Input: $0.322286/M tokens · Output: .289143/M tokens

Capabilities

Function Calling, JSON Mode, Streaming, Reasoning

Context: 33K tokens

Max output: 8K tokens

Routes: 1/1 healthy

Performance

TTFT: 2475ms · Latency: 31020ms · Throughput: 4.5 tok/s