Gemini 3.1 Flash-Lite API via TokenMix
Use Gemini 3.1 Flash-Lite from Google as a chat model through the TokenMix AI API relay and multi-model gateway.
Google's most cost-efficient Gemini model for high-volume workloads. Based on Gemini 3 Pro architecture. 45% faster output generation than Gemini 2.5 Flash with matching quality.
API access
- Base URL:
https://api.tokenmix.ai/v1 - Model ID:
gemini-3.1-flash-lite - OpenAI SDK compatible. Change the base URL and use your TokenMix API key.
Pricing
Input $0.2425/M tokens, output $1.455/M tokens
Capabilities
Vision, Function calling, JSON mode, Streaming, Reasoning
Model specs
- Context: 1049K tokens
- Max output: 66K tokens
Availability
1/1 available API endpoints are healthy right now.
Recent performance
TTFT 1159ms, latency 1574ms, throughput 252.2 tok/s.
Start using this model
Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.