LLM API Pricing Calculator: Compare OpenAI, Claude, Gemini, Mistral & More

Estimate real LLM API costs from your expected input and output tokens. Compare current models from OpenAI, Claude, Gemini, Mistral, Cohere, DeepSeek, Grok and more — with pricing, context windows and total cost in one place.

Input Tokens

Output Tokens

Minimum Context Window in Tokens

tokens

Use-case presets

Choose a realistic starting point, then fine-tune token values.

Include legacy, deprecated and retired models

Quick result for 1,000 input tokens + 1,000 output tokens

Cheapest overall

Gemini 2.5 Flash-Lite

$0.0005

Best pure price

Cheapest Gemini

Gemini 2.5 Flash-Lite

$0.0005

Low-cost Gemini option

Best long context

Gemini 2.5 Flash-Lite

$0.0005

Large context window

Model	Provider	Status	Context Window	Input / 1M	Output / 1M	Total Cost
Gemini 2.5 Flash-Lite	Google	CURRENT	1m tokens	$0.1000	$0.4000	$0.0005
Gemini 3.1 Flash-Lite	Google	CURRENT	1m tokens	$0.2500	$1.5000	$0.0018
Gemini 2.5 Flash	Google	CURRENT	1m tokens	$0.3000	$2.5000	$0.0028
Gemini 3.5 Flash	Google	CURRENT	1m tokens	$1.5000	$9.0000	$0.0105
Gemini 2.5 Pro	Google	CURRENT	1m tokens	$1.2500	$10.0000	$0.0113
Gemini 3.1 Pro	Google	CURRENT	1m tokens	$2.0000	$12.0000	$0.0140

CURRENT

Gemini 2.5 Flash-Lite

Google

Input Cost (per 1M) $0.1000

Output Cost (per 1M) $0.4000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0001

Output Cost (for 1.000 tokens) $0.0004

Total Cost $0.0005

View API Documentation →

CURRENT

Gemini 3.1 Flash-Lite

Google

Input Cost (per 1M) $0.2500

Output Cost (per 1M) $1.5000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0003

Output Cost (for 1.000 tokens) $0.0015

Total Cost $0.0018

View API Documentation →

CURRENT

Gemini 2.5 Flash

Google

Input Cost (per 1M) $0.3000

Output Cost (per 1M) $2.5000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0003

Output Cost (for 1.000 tokens) $0.0025

Total Cost $0.0028

View API Documentation →

CURRENT

Gemini 3.5 Flash

Google

Input Cost (per 1M) $1.5000

Output Cost (per 1M) $9.0000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0015

Output Cost (for 1.000 tokens) $0.0090

Total Cost $0.0105

View API Documentation →

CURRENT

Gemini 2.5 Pro

Google

Input Cost (per 1M) $1.2500

Output Cost (per 1M) $10.0000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0013

Output Cost (for 1.000 tokens) $0.0100

Total Cost $0.0113

View API Documentation →

CURRENT

Gemini 3.1 Pro

Google

Input Cost (per 1M) $2.0000

Output Cost (per 1M) $12.0000

Context Window 1m tokens

Input Cost (for 1.000 tokens) $0.0020

Output Cost (for 1.000 tokens) $0.0120

Total Cost $0.0140

View API Documentation →

Building with these APIs?

You compared model costs. DigitalOcean can help host the rest of your stack — API backend, Laravel app, queues, workers, database, or small AI prototype.

Explore DigitalOcean →

Frequently Asked Questions

Text generation API costs are calculated based on token usage - the fundamental unit of text processing. Providers charge for:

Input tokens: Text sent to the model (prompts, instructions, context)
Output tokens: Text generated by the model (completions, responses)

Each provider (OpenAI, Anthropic, Google Gemini, etc.) sets unique pricing tiers per 1,000,000 tokens, with premium models typically costing more than base models.

Input tokens represent the text you send to the LLM API (your prompt or context), while output tokens are what the model generates in response. For example:

Input: "Write a summary about Paris." (6 tokens)
Output: "Paris is the capital of France and a global center for art, fashion, and culture." (18 tokens)

Most providers charge different rates for input versus output tokens, with output tokens typically costing 2-5x more than input tokens.

Our Text generation API pricing database is monitored and updated regularly. We track official pricing pages, API documentation, and company announcements to try to ensure accuracy across all models from OpenAI, Anthropic, Google, Mistral, Cohere, and DeepSeek. If you notice any discrepancies, please feel free to send us a message to test@test.de.

The most cost-effective LLM depends on your specific requirements. OpenAI's GPT-4o-mini offers competitive pricing for general applications, while Anthropic's models excel at processing lengthy documents. Mistral and DeepSeek provide affordable alternatives for certain tasks. Our comparison tool helps you calculate exact costs based on your expected token usage and performance needs.

Yes, several strategies can optimize API costs:

Prompt engineering: Craft concise, effective prompts to reduce input tokens
Response parameters: Set maximum token limits for outputs
Caching: Store common responses to avoid redundant API calls
Model selection: Choose the most affordable model that meets your quality requirements
Batch processing: Combine multiple requests where possible

Each LLM has a maximum context window (the total tokens it can process at once). Context window sizes vary dramatically across providers, from Google Gemini's expansive 2M token capacity to more modest windows in other models. While OpenAI's GPT-4o and GPT-4o-mini share the same context window size, the mini version offers a more economical option. Similarly, Claude models offer large windows at different price points. Our calculator helps you determine if using a larger context model is more economical than breaking your task into multiple calls with a smaller-context, less expensive model.

While we strive to maintain accurate pricing information across all LLM providers, the rapid evolution of AI services means occasional discrepancies may occur. If you spot any errors in our pricing data or calculations, please feel free to contact us at test@test.de. We appreciate user feedback as it helps us maintain the most reliable comparison tool possible. However, we recommend that all users conduct their own due diligence and verify current pricing with the official provider documentation before making final decisions for production systems or budget-critical applications.