Qwen3.5 Flash

by Qwen · chat

通义千问3.5代速度优化模型,支持1M上下文和多模态。适合高吞吐量文本、图像和视频理解任务。

Pricing

Input: $0.026277/M tokens · Output: $0.262774/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming

Context: 1000K tokens

Max output: 66K tokens

Routes: 1/1 healthy

Performance

TTFT: 942ms · Latency: 2260ms · Throughput: 149.6 tok/s