Llama 4 Maverick API via TokenMix

Use Llama 4 Maverick from Meta as a chat model through the TokenMix AI API relay and multi-model gateway.

Meta's first natively multimodal open-weight model using MoE architecture (17B active / 400B total, 128 experts). Beats GPT-4o and Gemini 2.0 Flash across broad benchmarks.

API access

  • Base URL: https://api.tokenmix.ai/v1
  • Model ID: llama-4-maverick
  • OpenAI SDK compatible. Change the base URL and use your TokenMix API key.

Pricing

Input $0.372/M tokens, output $1.581/M tokens

Capabilities

Vision, Function calling, JSON mode, Streaming

Model specs

  • Context: 1000K tokens
  • Max output: 16K tokens

Availability

2/2 available API endpoints are healthy right now.

Start using this model

Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.

Create API key · View pricing · Quickstart