Gemini 2.5 Flash Lite

by Google · chat

Google's fastest and most economical multimodal model. Optimized for low latency and high-volume use cases. Supports adjustable thinking budgets. Deprecated March 31, 2026.

Pricing

Input: $0.095/M tokens · Output: $0.38/M tokens

Capabilities

Vision, Function Calling, JSON Mode, Streaming, Reasoning

Context: 1049K tokens

Max output: 66K tokens

Routes: 1/1 healthy

Performance

TTFT: 583ms · Latency: 2373ms · Throughput: 5.3 tok/s