Free • No sign-up • 12 models • Caching included
How much does that prompt actually cost?
Compare per-request and at-scale costs across every major AI model in 2026 — including the prompt-caching discounts most calculators ignore. Math runs in your browser.
Monthly: 30,000 requests
How often your input prefix is reused (RAG context, system prompts, agent loops). Only applies to models with caching support.
Cost across 12 models
Verified May 28, 2026. At 1,000 requests/day, Gemini 3.5 Flash costs $2.340/mo vs $585.00/mo for Claude Opus 4.7 — that's a 250× difference.
Estimates based on published API pricing as of May 28, 2026. Tokens approximated client-side (~80% accurate). Actual bills depend on exact tokenization, request volume discounts, and provider-specific rules. Re-check pricing with the provider before signing budgets.
How it works
Honest math, not vendor sales pitches
Token estimation
We count characters and divide by 4 — a ~80% accurate heuristic. For exact numbers, paste actual token counts from a tokenizer like tiktoken or cl100k.
Caching discount
Anthropic offers 90% off cached input. OpenAI offers 50%. Google offers 75%. Slide the cache-hit rate up to see what your real bill looks like for repeat-context workflows.
At-scale math
The slider scales from 1 request to 1M requests per day. We multiply per-request cost by 30 days for a monthly view — the number you actually need to pitch to finance.
More free tools
AI Prompt Improver
Cheaper bills start with shorter prompts.
Rewrite verbose prompts into tighter ones. Same quality, fewer tokens, lower bill.
Open prompt improverChatGPT Detector
Is that text AI-written?
Free AI writing detector for ChatGPT, Claude, Gemini. Three-signal score with an honest accuracy disclaimer.
Open ChatGPT detector