Gemini 3 Flash API via TokenMix

Use Gemini 3 Flash from Google as a chat model through the TokenMix AI API relay and multi-model gateway.

High-speed thinking model with 1M context for agentic workflows, multi-turn chat, and coding with configurable reasoning effort.

API access

  • Base URL: https://api.tokenmix.ai/v1
  • Model ID: gemini-3-flash-preview
  • OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.485/M tokens, output $2.91/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming, Reasoning

Model specs

  • Context: 1049K tokens
  • Max output: 66K tokens

Availability

1/1 available API endpoints are healthy right now.

Recent performance

TTFT 3294ms, latency 3674ms, throughput 1439.5 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart