GPT-4o Mini API via TokenMix

Use GPT-4o Mini from OpenAI as a chat model through the TokenMix AI API relay and multi-model gateway.

GPT-4o Mini is a smaller, cost-efficient version of GPT-4o, replacing GPT-3.5 Turbo. It offers strong performance on text and vision tasks at 60% lower cost than its predecessor, with 128K context and 16K max output.

API access

  • Base URL: https://api.tokenmix.ai/v1
  • Model ID: gpt-4o-mini
  • OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.1455/M tokens, output $0.582/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming

Model specs

  • Context: 128K tokens
  • Max output: 16K tokens

Availability

3/3 available API endpoints are healthy right now.

Recent performance

TTFT 878ms, latency 4105ms, throughput 82.8 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart