Llama 4 Maverick API via TokenMix
Use Llama 4 Maverick from Meta as a chat model through the TokenMix AI API relay and multi-model gateway.
Meta's first natively multimodal open-weight model using MoE architecture (17B active / 400B total, 128 experts). Beats GPT-4o and Gemini 2.0 Flash across broad benchmarks.
API access
- Base URL:
https://api.tokenmix.ai/v1 - Model ID:
llama-4-maverick - OpenAI SDK compatible. Change the base URL and use your TokenMix API key.
Pricing
Input $0.372/M tokens, output $1.581/M tokens
Capabilities
Vision, Function calling, JSON mode, Streaming
Model specs
- Context: 1000K tokens
- Max output: 16K tokens
Availability
2/2 available API endpoints are healthy right now.
Start using this model
Create an API key, top up from $1 when needed, and call this model through the TokenMix OpenAI-compatible endpoint.