TokenMix Research Lab · 2026-06-04

GitHub Copilot AI Credits 2026: Prices, Limits, Cost Math
Last Updated: 2026-06-04 Author: TokenMix Research Lab Data verified: 2026-06-04 - GitHub Copilot usage-based billing announcement, GitHub Docs billing tables, model pricing reference, budget controls, June 1 changelog
GitHub Copilot's June 1 billing change is simple in wording and sharp in practice: agentic usage now burns AI Credits by token. Completions stay included. Long coding-agent sessions do not.
GitHub announced that all Copilot plans transitioned to usage-based billing on June 1, 2026, replacing premium request units with GitHub AI Credits calculated from input, output, and cached tokens (GitHub Blog). One AI Credit equals $0.01, and paid individual plans now include 1,500 credits for Pro, 7,000 for Pro+, and 20,000 for Max (GitHub Docs). Business and Enterprise receive pooled monthly allowances of 1,900 and 3,900 credits per user, with temporary promotional allowances of 3,000 and 7,000 from June 1 to September 1, 2026 (GitHub Docs). The real cost swing comes from model choice: GPT-5.5 is $5/$30 per million input/output tokens, while MAI-Code-1-Flash is $0.75/$4.50 (GitHub Docs).
Table of Contents
- Quick Answer
- Confirmed Facts vs Misreads
- Plan Credit Allowances
- Model Pricing Table
- $10 Buys How Many Tokens
- Cost Per Coding Task
- Business and Enterprise Pool Math
- Budget Controls That Matter
- How To Cut Copilot Costs
- Hidden Cost Factors
- Final Recommendation
- FAQ
- Sources
- Related Articles
Quick Answer
| Question | Direct answer | Status |
|---|---|---|
| Did Copilot switch to AI Credits? | Yes, usage-based billing is live as of June 1, 2026 | Confirmed |
| Did base plan prices change? | No. Pro $10, Pro+ $39, Business $19/user, Enterprise $39/user stayed the same in the launch announcement | Confirmed |
| Do completions consume AI Credits? | No. Code completions and next edit suggestions remain included for paid plans | Confirmed |
| What consumes AI Credits? | Chat, CLI, cloud agent, Spaces, Spark, third-party coding agents, and other AI model features | Confirmed |
| Does code review have extra billing? | Yes. It consumes AI Credits and GitHub Actions minutes | Confirmed |
| Is there still a low-cost fallback after exhausting quota? | No automatic fallback under the new model | Confirmed |
| Can admins cap spending? | Yes, through user-level, cost-center, enterprise, and organization budgets | Confirmed |
| Is every task now more expensive? | No. Lightweight model usage can be cheap; long frontier-agent sessions can be expensive | Likely |
Confirmed Facts vs Misreads
| Claim | Status | Correct reading |
|---|---|---|
| "Copilot now charges for every autocomplete" | False | Completions and next edit suggestions stay included in paid plans (GitHub Docs) |
| "Copilot Pro is still $10/month" | Confirmed | Base plan price did not change in the announcement (GitHub Blog) |
| "Pro includes 1,500 monthly AI Credits" | Confirmed | 1,000 base plus 500 flex credits (GitHub Docs) |
| "Pro+ includes 7,000 monthly AI Credits" | Confirmed | 3,900 base plus 3,100 flex credits |
| "Max includes 20,000 monthly AI Credits" | Confirmed | 10,000 base plus 10,000 flex credits |
| "Business gets 1,900 credits per user" | Confirmed | Pooled at billing entity level |
| "Enterprise gets 3,900 credits per user" | Confirmed | Pooled at billing entity level |
| "Business / Enterprise promo credits last forever" | False | Promo period is June 1 to September 1, 2026 |
| "A quick chat and a multi-hour agent run still cost the same" | False | GitHub explicitly changed billing to reflect token usage |
| "Model choice now matters" | Confirmed | GitHub publishes per-model input, cached input, cache write, and output rates |
Plan Credit Allowances
| Plan | Monthly price | Base credits | Flex credits | Total monthly AI Credits | Effective included dollar value |
|---|---|---|---|---|---|
| Copilot Pro | $10 | 1,000 | 500 | 1,500 | $15 |
| Copilot Pro+ | $39 | 3,900 | 3,100 | 7,000 | $70 |
| Copilot Max | $100 | 10,000 | 10,000 | 20,000 | $200 |
| Copilot Business | $19/user | n/a | n/a | 1,900/user pooled | $19/user |
| Copilot Enterprise | $39/user | n/a | n/a | 3,900/user pooled | $39/user |
| Business promo | $19/user | n/a | n/a | 3,000/user pooled through 2026-09-01 | $30/user |
| Enterprise promo | $39/user | n/a | n/a | 7,000/user pooled through 2026-09-01 | $70/user |
The oddity is intentional: individual Pro gets $15 in monthly credits for a $10 subscription, Pro+ gets $70 for $39, and Max gets $200 for $100. GitHub calls part of that a flex allotment, a variable component that can change as AI economics change (GitHub Docs). That means the base credit amount is more stable than the flex amount.
Model Pricing Table
All prices below are GitHub's published Copilot per-1M-token rates as of June 4, 2026 (GitHub Docs).
| Provider | Model | Category | Input / 1M | Cached input / 1M | Output / 1M | Status |
|---|---|---|---|---|---|---|
| Microsoft | MAI-Code-1-Flash | Lightweight | $0.75 | $0.075 | $4.50 | Confirmed |
| OpenAI | GPT-5 mini | Lightweight | $0.25 | $0.025 | $2.00 | Confirmed |
| OpenAI | GPT-5.4 nano | Lightweight | $0.20 | $0.02 | $1.25 | Confirmed |
| OpenAI | GPT-5.4 mini | Lightweight | $0.75 | $0.075 | $4.50 | Confirmed |
| OpenAI | GPT-5.4 | Versatile | $2.50 | $0.25 | $15.00 | Confirmed |
| OpenAI | GPT-5.5 | Powerful | $5.00 | $0.50 | $30.00 | Confirmed |
| Anthropic | Claude Haiku 4.5 | Versatile | $1.00 | $0.10 | $5.00 | Confirmed |
| Anthropic | Claude Sonnet 4.6 | Versatile | $3.00 | $0.30 | $15.00 | Confirmed |
| Anthropic | Claude Opus 4.8 | Powerful | $5.00 | $0.50 | $25.00 | Confirmed |
| Gemini 3.1 Pro | Powerful | $2.00 | $0.20 | $12.00 | Confirmed | |
| Gemini 3.5 Flash | Lightweight | $1.50 | $0.15 | $9.00 | Confirmed |
The model-price spread is huge. GPT-5.5 output is 24x GPT-5.4 nano output. A careless model picker is now a billing bug.
$10 Buys How Many Tokens
| Model | $10 buys input tokens | $10 buys output tokens | Best use |
|---|---|---|---|
| GPT-5.4 nano | 50.0M | 8.0M | Simple chat, rewrite, classification |
| GPT-5 mini | 40.0M | 5.0M | Cheap coding support |
| MAI-Code-1-Flash | 13.3M | 2.22M | Lightweight coding inside Copilot |
| Claude Haiku 4.5 | 10.0M | 2.0M | Cheap Claude-style assistant work |
| Gemini 3.1 Pro | 5.0M | 0.83M | Long-ish powerful tasks |
| Claude Sonnet 4.6 | 3.33M | 0.67M | Serious coding/reasoning |
| GPT-5.5 | 2.0M | 0.33M | Frontier tasks only |
| Claude Opus 4.8 | 2.0M | 0.40M | High-value reasoning and coding |
Cost calculation: Copilot Pro has 1,500 included AI Credits, equal to $15 in credit value. On GPT-5.5 output, that is 500,000 output tokens. On GPT-5.4 nano output, it is 12,000,000 output tokens. Same plan. Very different runway.
Cost Per Coding Task
These are math examples using GitHub's model rates. They are not GitHub-provided workload benchmarks.
| Workload | Token shape | MAI-Code-1-Flash | Claude Sonnet 4.6 | GPT-5.5 |
|---|---|---|---|---|
| Small bug fix prompt | 3K in / 1K out | $0.0068 / 0.68 credits | $0.024 / 2.4 credits | $0.045 / 4.5 credits |
| Medium coding agent step | 10K in / 2K out | $0.0165 / 1.65 credits | $0.060 / 6 credits | $0.110 / 11 credits |
| Large repo context pass | 80K in / 5K out | $0.0825 / 8.25 credits | $0.315 / 31.5 credits | $0.550 / 55 credits |
| Heavy agent iteration | 250K in / 20K out | $0.2775 / 27.75 credits | $1.05 / 105 credits | $1.85 / 185 credits |
| Review-heavy task | 100K in / 40K out | $0.255 / 25.5 credits | $0.900 / 90 credits | $1.700 / 170 credits |
Cost calculation 1: a Pro user with 1,500 credits can run about 136 medium GPT-5.5 agent steps at 11 credits each, before additional usage. The same credit pool can run about 909 MAI-Code-1-Flash medium steps at 1.65 credits each.
Cost calculation 2: a Pro+ user with 7,000 credits can run about 63 heavy GPT-5.5 agent iterations at 110 credits each if using a 120K/15K shape. Switch that same workflow to Claude Sonnet 4.6 at $3/$15 and it becomes about 111 iterations at 63 credits each.
Cost calculation 3: output dominates. A 5K input / 50K output generation on GPT-5.5 costs $1.525, or 152.5 credits. On MAI-Code-1-Flash, it costs $0.22875, or 22.875 credits. Long answers are where credit surprises show up.
Business and Enterprise Pool Math
| Organization | Plan | Seat fee | Standard credit pool | Promo pool through Sep 1 | Credit value delta |
|---|---|---|---|---|---|
| 25 users | Business | $475/mo | 47,500 | 75,000 | +$275 |
| 25 users | Enterprise | $975/mo | 97,500 | 175,000 | +$775 |
| 100 users | Business | $1,900/mo | 190,000 | 300,000 | +$1,100 |
| 100 users | Enterprise | $3,900/mo | 390,000 | 700,000 | +$3,100 |
| 400 users | Business | $7,600/mo | 760,000 | 1,200,000 | +$4,400 |
| 400 users | Enterprise | $15,600/mo | 1,560,000 | 2,800,000 | +$12,400 |
Pooled credits are good for teams because light users subsidize heavy users. They are bad for surprise costs if admins do not set user-level budgets. GitHub states there is no automatic lower-cost fallback when budgets are exhausted; usage is either blocked or continues into metered spend depending on policy (GitHub Docs).
Budget Controls That Matter
| Control | What it caps | When active | Hard stop? | Probe verdict |
|---|---|---|---|---|
| Universal user-level budget | Each user's total AI Credit consumption | Pool and metered phases | Yes | Turn on first |
| Individual user-level budget | One user's total consumption | Pool and metered phases | Yes | Use for power users |
| Cost-center budget | Metered charges after pool is exhausted | Metered phase only | Optional | Useful but not enough |
| Enterprise budget | Total enterprise metered charges after pool exhaustion | Metered phase only | Optional | Must enable stop-at-limit |
| Organization-level budget | Repository/org spend tracking | Org context | Depends | Good for visibility |
GitHub's budget docs contain one easy-to-miss line: "Stop usage when budget limit is reached" is off by default for enterprise spending limits and cost-center budgets (GitHub Docs). If you do not enable it, charges can continue past the limit.
How To Cut Copilot Costs
| Lever | Effort | Expected saving | Status | Why |
|---|---|---|---|---|
| Use lightweight models for simple tasks | Low | 50-90% | Confirmed math | Model price spread is large |
Cap max_tokens / output length |
Low | 20-70% | Confirmed math | Output is often the expensive side |
| Use cached context when available | Medium | 50-90% on reused context | Confirmed pricing | Cached input is much cheaper |
| Set universal user-level budgets | Low | Prevents runaway spend | Confirmed | Hard stop during pool and metered phases |
| Route agent tasks by complexity | Medium | 30-80% | Likely | Use frontier only when needed |
| Avoid Copilot code review on huge PRs without policy | Medium | Prevents dual billing | Confirmed risk | Code review uses AI Credits plus Actions minutes |
| Monitor credit burn by workflow | Medium | Improves decisions | Likely | Request count no longer reflects cost |
| Use external model gateways for non-GitHub workflows | Medium | Variable | Likely | TokenMix/OpenRouter/LiteLLM can route by cost |
For non-GitHub application calls, compare this with TokenMix's AI API gateway, OpenRouter alternatives, and Claude API pricing. Copilot Credits are great inside Copilot. They are not a universal model-routing layer for your product.
Hidden Cost Factors
| Hidden factor | Real impact | Status |
|---|---|---|
| Agentic loops | Multiple model calls per task | Confirmed |
| Large repository context | Input tokens can spike quickly | Confirmed |
| Long generated diffs | Output token burn dominates | Confirmed |
| Code review Actions minutes | Dual billing path | Confirmed |
| Model auto-selection | May choose higher-cost model if not visible | Likely |
| Prompt scheduling | Background prompts can burn credits while unattended | Likely |
| Third-party coding agents | GitHub says they consume AI Credits | Confirmed |
| Budget stop setting off by default | Spending caps may not stop spend | Confirmed |
The practical metric is no longer "how many Copilot requests did we use?" It is "how many credits did one merged PR, one resolved bug, or one review cycle consume?"
Final Recommendation
Keep Copilot for completions and normal IDE flow. Put strict budgets on agentic features. Route simple work to cheap models. Reserve GPT-5.5, Claude Opus, and long cloud-agent sessions for tasks where the value per merged change beats the credit burn.
FAQ
What changed in GitHub Copilot billing on June 1, 2026?
GitHub Copilot moved from premium request units to usage-based GitHub AI Credits. Usage is calculated from input, output, and cached tokens at the model's published rates.
Does GitHub Copilot autocomplete cost AI Credits now?
No. Code completions and next edit suggestions remain included for paid Copilot plans and do not consume AI Credits.
How many AI Credits does Copilot Pro include?
Copilot Pro includes 1,500 monthly AI Credits: 1,000 base credits and 500 flex credits. One AI Credit equals $0.01.
How many AI Credits do Business and Enterprise include?
Copilot Business includes 1,900 credits per user per month, pooled at the billing entity level. Copilot Enterprise includes 3,900. Existing customers temporarily receive 3,000 and 7,000 from June 1 to September 1, 2026.
What is the cheapest Copilot model?
Among GitHub's listed rates, GPT-5.4 nano is cheaper than MAI-Code-1-Flash on both input and output. MAI-Code-1-Flash is Microsoft's lightweight coding model inside Copilot, but not the absolute cheapest listed model.
Why did my Copilot bill increase?
Likely causes are agentic sessions, frontier model selection, long output, code review dual billing, or missing budget controls. Under the new system, a quick chat and a multi-step coding agent run no longer cost the same.
Can admins stop Copilot from overspending?
Yes. Use universal user-level budgets first because they are hard stops. Also set enterprise or cost-center spending limits and enable "Stop usage when budget limit is reached."
Should I replace Copilot with an API gateway?
No for IDE completions. Yes for product workloads that need multi-provider routing, fallback, and cost-per-task optimization. Copilot is an IDE and GitHub workflow product; TokenMix is a model-routing gateway.
Sources
- GitHub Blog - GitHub Copilot is moving to usage-based billing
- GitHub Changelog - Updates to Copilot billing and plans
- GitHub Docs - Usage-based billing for individuals
- GitHub Docs - Usage-based billing for organizations and enterprises
- GitHub Docs - Models and pricing for GitHub Copilot
- GitHub Docs - Budgets for usage-based billing
- GitHub Docs - Plans for GitHub Copilot
- GitHub Blog - Copilot App agent-native desktop experience
- GitHub Changelog - Cloud and local sandboxes for Copilot
- GitHub Changelog - MAI-Code-1-Flash for Copilot