TokenMix Research Lab · 2026-06-08

Tavily AI API Pricing 2026: 1K Free Credits, Agent Math
Last Updated: 2026-06-08 Author: TokenMix Research Lab Data verified: 2026-06-08 - Tavily credits and pricing docs, Search API reference, pricing page, and Tavily FAQ
Tavily pricing is simple until agents start searching repeatedly. The free tier is 1,000 credits/month, not unlimited web access.
Tavily docs say Researcher includes 1,000 free API credits per month with no credit card, pay-as-you-go is $0.008 per credit, and monthly plans range from 4,000 to 100,000 credits. Search API docs show basic, fast, and ultra-fast search cost 1 credit, while auto-parameters may set advanced search and use 2 credits. Agent builders should budget per successful task, not per search call.
Table of Contents
- Quick Verdict
- Pricing Table
- Search Depth Costs
- Agent Cost Math
- Tavily vs Alternatives
- Rate Limit and Credit Risks
- Setup Pattern
- Search Intent Map
- Cost Per Task Calculator
- Decision Matrix
- Monitoring Checklist
- Non-Claims and Caveats
- Final Recommendation
- FAQ
- Sources
- Related Articles
Quick Verdict
| Claim | Status | Source |
|---|---|---|
| Tavily offers 1,000 free API credits per month | Confirmed | Tavily credits docs |
| Tavily pay-as-you-go is $0.008 per credit | Confirmed | Tavily credits docs |
| Tavily Project plan lists 4,000 credits for $30 | Confirmed | Tavily credits docs |
| Tavily Growth plan lists 100,000 credits for $500 | Confirmed | Tavily credits docs |
| Basic/fast/ultra-fast search costs 1 credit | Confirmed | Tavily Search API |
| Tavily free tier is unlimited for production agents | False | Free tier is credit-limited |
| Advanced search is better for every query | False | Tavily help says basic is suited for concise queries |
| Agent search cost will grow with tool loops | Likely | Each repeated search consumes credits |
Pricing Table
| Plan | Credits/month | Monthly price | Effective credit price | Status |
|---|---|---|---|---|
| Researcher | 1,000 | Free | N/A | Confirmed |
| Project | 4,000 | $30 | $0.0075 | Confirmed |
| Bootstrap | 15,000 | $100 | $0.0067 | Confirmed |
| Startup | 38,000 | $220 | $0.0058 | Confirmed |
| Growth | 100,000 | $500 | $0.005 | Confirmed |
| Pay as you go | Per usage | $0.008/credit | $0.008 | Confirmed |
If Tavily is being used to ground model answers, compare the search cost with OpenAI Realtime Voice, AI Agent Architecture, and AI API Gateway.
Search Depth Costs
| API behavior | Credit cost | When to use | Status |
|---|---|---|---|
| Basic search | 1 credit | Simple/direct queries | Confirmed |
| Fast search | 1 credit | Speed-first search | Confirmed |
| Ultra-fast search | 1 credit | Low-latency discovery | Confirmed |
| Advanced search | 2 credits | Multi-step/deeper queries | Confirmed |
| Auto parameters | Can become 2 credits | Let Tavily infer depth | Confirmed |
| Include raw content | May increase latency | Need extracted content | Confirmed |
Credit cost is only half the story. Latency and result quality determine whether a search reduces or increases total agent cost.
Agent Cost Math
Scenario 1: simple RAG lookup. 1 search per user question, 10,000 questions/month, basic search = 10,000 credits. That is above the free tier and near a paid plan decision.
Scenario 2: research agent. 5 searches per task, 5,000 tasks/month, advanced search = 50,000 credits. Growth-style pricing becomes relevant.
Scenario 3: runaway loop. 12 searches per failed task, 2,000 failures/month, advanced search = 48,000 wasted credits. The fix is max tool calls per task.
| Workload | Searches/task | Credit depth | Monthly tasks | Credits/month |
|---|---|---|---|---|
| FAQ grounding | 1 | 1 | 1,000 | 1,000 |
| Support bot | 2 | 1 | 10,000 | 20,000 |
| Research agent | 5 | 2 | 5,000 | 50,000 |
| News monitor | 3 | 1 | 50,000 | 150,000 |
| Failed loop | 12 | 2 | 2,000 | 48,000 |
Tavily vs Alternatives
| Need | Tavily fit | Alternative | Status |
|---|---|---|---|
| Agent-ready search snippets | Strong | Exa, Brave, SerpAPI | Likely |
| High-volume scraping | Medium | Firecrawl/crawler stack | Likely |
| Low-cost simple search | Medium | Brave Search API | Likely |
| Deep extraction | Good with raw content/extract | Firecrawl | Likely |
| RAG grounding | Strong | Provider web search | Confirmed |
Tavily is strongest when your LLM needs curated search context. It is not automatically the cheapest crawler.
Rate Limit and Credit Risks
| Risk | Symptom | Fix | Status |
|---|---|---|---|
| Credit exhaustion | Search starts failing | Per-user search budget | Confirmed |
| Auto advanced search | Cost doubles unexpectedly | Pin search_depth |
Confirmed |
| Raw content latency | Agent feels slow | Use only when needed | Confirmed |
| Duplicate searches | Same query repeated | Cache by normalized query | Likely |
| No-card false assumption | Free means unlimited | Track credits | Confirmed |
The first production guardrail is a max_searches_per_task setting.
Setup Pattern
def tavily_search_policy(task_type, remaining_credits):
if remaining_credits < 100:
return {"enabled": False, "reason": "credit_guard"}
if task_type in {"quick_faq", "docs_lookup"}:
return {"search_depth": "basic", "max_results": 5}
if task_type == "deep_research":
return {"search_depth": "advanced", "max_results": 8, "max_calls": 3}
return {"search_depth": "basic", "max_calls": 1}
curl -X POST https://api.tavily.com/search \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $TAVILY_API_KEY" \
-d '{"query":"latest OpenAI Realtime API pricing","search_depth":"basic"}'
Search Intent Map
| Search query | What the user really needs | Best answer | Status |
|---|---|---|---|
tavily ai api pricing |
A current, non-marketing answer | Compare official limits and cost controls | Confirmed |
tavily ai api pricing pricing |
Whether this becomes a monthly bill | Use per-task math, not sticker price | Confirmed |
tavily ai api pricing free |
Whether a no-cost path exists | Treat free quota as testing capacity | Likely |
tavily ai api pricing error |
Why setup fails | Check auth, quota, region, and model access | Likely |
tavily ai api pricing alternative |
Whether another route is safer | Compare direct API, gateway, and self-hosting | Likely |
This is the reason the article is structured around tables instead of a narrative review. Search traffic for these terms usually comes from blocked developers, not readers browsing AI news.
Cost Per Task Calculator
| Cost component | Formula | Why it matters | Status |
|---|---|---|---|
| Input tokens | input MTok x input price | Long prompts dominate retrieval and agents | Confirmed |
| Output tokens | output MTok x output price | Reasoning and verbose answers compound cost | Confirmed |
| Retry waste | failed calls x average cost | 429 and timeout loops become real spend | Likely |
| Human review | minutes saved or added x hourly rate | Tooling can shift, not remove, labor cost | Likely |
| Infrastructure | storage, runners, or hosted platform cost | Non-token cost often appears later | Confirmed |
Use this minimum calculator before choosing a provider: 30 days x calls per day x average input tokens x input price, plus 30 days x calls per day x average output tokens x output price. Then add retries. If the retry rate is 10%, your apparent price is already 1.1x before latency or support cost.
| Monthly calls | Avg input | Avg output | Token volume | Operational reading |
|---|---|---|---|---|
| 1,000 | 1K | 300 | 1M in / 0.3M out | Prototype |
| 10,000 | 2K | 600 | 20M in / 6M out | Small app |
| 100,000 | 4K | 1K | 400M in / 100M out | Production workload |
| 1,000,000 | 2K | 500 | 2B in / 500M out | Procurement problem |
Decision Matrix
| If your situation is... | Default move | Why | Confidence |
|---|---|---|---|
| You are still prototyping | Use the lowest-friction official route | Learning speed beats premature optimization | Likely |
| You have user-facing traffic | Add fallback and spend caps before launch | Users feel quota failures immediately | Confirmed |
| You have compliance constraints | Prefer direct vendor, cloud marketplace, or audited gateway | Procurement trail matters | Likely |
| You have high volume but flexible latency | Test batch or async processing | Batch discounts can beat realtime routes | Confirmed where documented |
| You have unknown token shape | Run a 7-day sample before committing | Average prompts hide tail risk | Likely |
| You need newest model features | Check direct provider docs first | Gateways and clouds may lag direct release | Likely |
The durable rule: do not optimize for the cheapest successful demo. Optimize for the cheapest successful month with logs, retries, fallback, and support.
def pick_route(stage, traffic, compliance, latency_flexible):
if stage == "prototype" and traffic < 1000:
return "official_free_or_low_cost_route"
if compliance == "strict":
return "direct_vendor_or_cloud_marketplace"
if latency_flexible and traffic > 100000:
return "batch_or_async_route"
if traffic > 10000:
return "gateway_with_budget_caps"
return "direct_api_with_monitoring"
Monitoring Checklist
| Metric | Alert threshold | Why | Status |
|---|---|---|---|
| 429 rate | >2% sustained | Quota is now user-visible | Confirmed |
| Retry multiplier | >1.1x | Hidden cost leak | Likely |
| Fallback rate | >10% | Primary route is unstable | Likely |
| Output/input ratio | Sudden 2x jump | Prompt or model behavior changed | Likely |
| Cost per successful task | Week-over-week increase | Real business KPI | Confirmed |
| Error by model | Any model-specific spike | Route or provider issue | Confirmed |
| User-level spend | Outlier user >5x median | Abuse or runaway workflow | Likely |
The operational test is simple: if you cannot answer which model, user, route, or retry loop created the cost, you are not ready to scale that workflow.
Non-Claims and Caveats
| Not claimed | Reason | Label |
|---|---|---|
| Universal benchmark superiority | No single benchmark covers every workload and provider route | False as a broad claim |
| Permanent free availability | Free tiers and previews can change | Speculation |
| Guaranteed model access in every region | Providers gate by region, tier, quota, or account status | False as a broad claim |
| Refund availability without official text | Refund terms must come from provider policy or support | Speculation |
| Identical pricing across direct API, cloud, and gateway | Routing layer, region, priority, and batch mode can change cost | False as a broad claim |
| Production safety from docs alone | Real workloads need logs and failure drills | Confirmed |
This article uses official docs for hard numbers and marks forward-looking guidance as Likely or Speculation. If a provider changes a price, model name, rate limit, or credit rule after the data verification date, the conclusion should be rechecked before procurement.
Final Recommendation
Use Tavily when search quality matters more than raw scraping volume. Pin search depth, cap searches per task, cache repeated queries, and treat the 1,000-credit free tier as a prototype budget.
FAQ
How many free Tavily credits do I get?
Tavily documents 1,000 free API credits per month with no credit card required.
How much is Tavily pay-as-you-go?
Tavily lists pay-as-you-go at $0.008 per credit.
How many credits does Tavily search use?
Basic, fast, and ultra-fast search cost 1 credit. Advanced search costs more and auto-parameters can move a request to advanced.
Is Tavily good for AI agents?
Yes for search grounding and research-style agents. It needs credit caps because agents can loop.
Can Tavily replace a crawler?
Sometimes, but not always. For high-volume structured crawling, a crawler-first tool may be cheaper or more controllable.
What is the main cost trap?
Letting an agent search repeatedly without caching, max call limits, or search depth controls.
Should I use advanced search by default?
No. Use basic for simple queries and advanced for multi-step, deeper research needs.
Sources
- Tavily Credits and Pricing
- Tavily Search API
- Tavily Pricing Page
- Tavily Basic vs Advanced Search
- Tavily FAQ
- Tavily CLI Docs
- TokenMix AI API Gateway
- TokenMix Agent Architecture