Gemini 3.1 Flash-Lite API via TokenMix

Use Gemini 3.1 Flash-Lite from Google as a chat model through the TokenMix AI API relay and multi-model gateway.

Google's most cost-efficient Gemini model for high-volume workloads. Based on Gemini 3 Pro architecture. 45% faster output generation than Gemini 2.5 Flash with matching quality.

API access

Base URL: https://api.tokenmix.ai/v1
Model ID: gemini-3.1-flash-lite
OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.2425/M tokens, output $1.455/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming, Reasoning

Model specs

Context: 1049K tokens
Max output: 66K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 2381ms, latency 6883ms, throughput 178.4 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart