GPT-OSS 120B API via TokenMix

Use GPT-OSS 120B from OpenAI as a chat model through the TokenMix AI API relay and multi-model gateway.

OpenAI's open-weight mixture-of-experts model with 120B total parameters (5.1B active per token), released under Apache 2.0. Features configurable chain-of-thought reasoning and runs on a single 80GB GPU via MXFP4 quantization.

API access

Base URL: https://api.tokenmix.ai/v1
Model ID: gpt-oss-120b
OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.1455/M tokens, output $0.582/M tokens

Capabilities

Function calling, JSON mode, Streaming, Reasoning

Model specs

Context: 131K tokens
Max output: 131K tokens

Availability

2/2 available API endpoints are healthy right now.

Recent performance

TTFT 1313ms, latency 8887ms, throughput 419.6 tok/s.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart