LLM Model Comparison

Compare pricing, capabilities, and ratings of 14 LLM APIs. Use "Pick For Me" for an instant recommendation, or select up to 3 for a detailed side-by-side.

Data as of April 2026 · Read the full guide

Pick For Me

Answer 3 questions and get an instant recommendation.

Llama 4 Maverick

Meta (Open)mediumVisionTools
Input
Free/1M tok
Output
Free/1M tok
Context
1.0M
Max output
33K
Coding
Reasoning
Creative
Instructions
Multilingual

Self-hosting, data-sensitive workloads, and avoiding vendor lock-in

GPT-4.1 nano

OpenAIfastVisionTools
Input
$0.10/1M tok
Output
$0.40/1M tok
Context
1.0M
Max output
16K
Coding
Reasoning
Creative
Instructions
Multilingual

Classification, routing, extraction, and ultra-budget workloads

GPT-4o mini

OpenAIfastVisionTools
Input
$0.15/1M tok
Output
$0.60/1M tok
Context
128K
Max output
16K
Coding
Reasoning
Creative
Instructions
Multilingual

Budget-friendly chatbots, classification, and high-volume simple tasks

Gemini 2.5 Flash

GooglefastVisionTools
Input
$0.15/1M tok
Output
$0.60/1M tok
Context
1.0M
Max output
66K
Coding
Reasoning
Creative
Instructions
Multilingual

High-speed, budget workloads that benefit from large context windows

DeepSeek V3

DeepSeekmediumTools
Input
$0.27/1M tok
Output
$1.10/1M tok
Context
131K
Max output
8K
Coding
Reasoning
Creative
Instructions
Multilingual

Budget coding tasks and open-weight enthusiasts

GPT-4.1 mini

OpenAIfastVisionTools
Input
$0.40/1M tok
Output
$1.60/1M tok
Context
1.0M
Max output
33K
Coding
Reasoning
Creative
Instructions
Multilingual

Budget-friendly long-context tasks and coding assistance

DeepSeek R1

DeepSeekslow
Input
$0.55/1M tok
Output
$2.19/1M tok
Context
131K
Max output
8K
Coding
Reasoning
Creative
Instructions
Multilingual

Math, logic, and tasks requiring step-by-step reasoning

Claude Haiku 3.5

AnthropicfastVisionTools
Input
$0.80/1M tok
Output
$4/1M tok
Context
200K
Max output
8K
Coding
Reasoning
Creative
Instructions
Multilingual

High-volume, cost-sensitive workloads: classification, extraction, simple chat

Gemini 2.5 Pro

GooglemediumVisionTools
Input
$1.25/1M tok
Output
$10/1M tok
Context
1.0M
Max output
66K
Coding
Reasoning
Creative
Instructions
Multilingual

Long document processing, research, and tasks needing massive context

GPT-4.1

OpenAImediumVisionTools
Input
$2/1M tok
Output
$8/1M tok
Context
1.0M
Max output
33K
Coding
Reasoning
Creative
Instructions
Multilingual

Coding tasks, long-context processing, and instruction-heavy workflows

Mistral Large

MistralmediumTools
Input
$2/1M tok
Output
$6/1M tok
Context
128K
Max output
8K
Coding
Reasoning
Creative
Instructions
Multilingual

European teams needing data sovereignty and strong multilingual support

GPT-4o

OpenAImediumVisionTools
Input
$2.50/1M tok
Output
$10/1M tok
Context
128K
Max output
16K
Coding
Reasoning
Creative
Instructions
Multilingual

General-purpose tasks, multimodal apps, and teams in the OpenAI ecosystem

Claude Sonnet 4

AnthropicmediumVisionTools
Input
$3/1M tok
Output
$15/1M tok
Context
200K
Max output
16K
Coding
Reasoning
Creative
Instructions
Multilingual

Best all-rounder for coding, chat, and most production workloads

Claude Opus 4

AnthropicslowVisionTools
Input
$15/1M tok
Output
$75/1M tok
Context
200K
Max output
32K
Coding
Reasoning
Creative
Instructions
Multilingual

Complex reasoning, agentic workflows, and tasks requiring deep analysis

💰 Cost Estimator
3,000 requests/month · 6M input tokens · 1.5M output tokens
ModelPer requestMonthly
Llama 4 Maverickcheapest<$0.01<$0.01/mo
GPT-4.1 nano<$0.01$1.2/mo
GPT-4o mini<$0.01$1.8/mo
Gemini 2.5 Flash<$0.01$1.8/mo
DeepSeek V3<$0.01$3.3/mo
GPT-4.1 mini<$0.01$4.8/mo
DeepSeek R1<$0.01$6.6/mo
Claude Haiku 3.5<$0.01$10.8/mo
Mistral Large<$0.01$21.0/mo
Gemini 2.5 Pro<$0.01$22.5/mo
GPT-4.1<$0.01$24.0/mo
GPT-4o$0.01$30.0/mo
Claude Sonnet 4$0.01$40.5/mo
Claude Opus 4$0.07$203/mo

Not sure which model fits your app?

Read the full comparison guide or get a personalized recommendation.