Compare

AI model comparison

Compare the major AI models side by side — Claude, GPT, Gemini, Llama, and more — by context window, cost, modalities, and what each is genuinely best for. Calm, sourced, and free.

Cost is shown as a tier, not a live price — model pricing moves often, so always confirm current rates on the provider's page before you commit. Last updated 2026-07-01.

Model	Type	Context	Cost	Modalities	Open	Best for
Claude OpusAnthropic	Frontier	200K	$$$$	text, vision	—	The hardest reasoning, long agentic tasks, and code that has to be right
Claude SonnetAnthropic	Balanced	200K	$$	text, vision	—	The default for most work — reads code like a senior engineer at a fair price
Claude HaikuAnthropic	Fast & cheap	200K	$	text, vision	—	High-volume, low-latency work where speed and cost matter most
GPT-4oOpenAI	Frontier	128K	$$$	text, vision, audio	—	A strong all-rounder with native voice and vision in one model
GPT-4o miniOpenAI	Fast & cheap	128K	$	text, vision	—	Cheap, fast, high-volume tasks like extraction and classification
OpenAI o-seriesOpenAI	Reasoning	128K	$$$$	text, vision	—	Multi-step reasoning, math, and problems that reward slow thinking
Gemini 2.5 ProGoogle	Frontier	1M	$$$	text, vision, audio	—	Huge-context work — whole codebases, long documents, video reasoning
Gemini 2.5 FlashGoogle	Fast & cheap	1M	Free tier	text, vision, audio	—	Prototyping and high-volume work — capable, fast, with a usable free tier
LlamaMeta	Open weights	128K	Free tier	text, vision		Running locally or self-hosting — private, free to run, no vendor lock-in
QwenAlibaba	Open weights	128K	Free tier	text, vision		A strong open model for local use, especially good at coding for its size
DeepSeekDeepSeek	Reasoning	128K	$	text		Open reasoning at a fraction of the cost of closed reasoning models
MistralMistral AI	Open weights	128K	$	text, vision		Efficient European open models for building without heavy lock-in

Claude OpusFrontier

The hardest reasoning, long agentic tasks, and code that has to be right

Context 200KCost $$$$text · vision

Claude SonnetBalanced

The default for most work — reads code like a senior engineer at a fair price

Context 200KCost $$text · vision

Claude HaikuFast & cheap

High-volume, low-latency work where speed and cost matter most

Context 200KCost $text · vision

GPT-4oFrontier

A strong all-rounder with native voice and vision in one model

Context 128KCost $$$text · vision · audio

GPT-4o miniFast & cheap

Cheap, fast, high-volume tasks like extraction and classification

Context 128KCost $text · vision

OpenAI o-seriesReasoning

Multi-step reasoning, math, and problems that reward slow thinking

Context 128KCost $$$$text · vision

Gemini 2.5 ProFrontier

Huge-context work — whole codebases, long documents, video reasoning

Context 1MCost $$$text · vision · audio

Gemini 2.5 FlashFast & cheap

Prototyping and high-volume work — capable, fast, with a usable free tier

Context 1MFree tiertext · vision · audio

LlamaOpen weights

Running locally or self-hosting — private, free to run, no vendor lock-in

Context 128KFree tiertext · visionOpen weights

QwenOpen weights

A strong open model for local use, especially good at coding for its size

Context 128KFree tiertext · visionOpen weights

DeepSeekReasoning

Open reasoning at a fraction of the cost of closed reasoning models

Context 128KCost $textOpen weights

MistralOpen weights

Efficient European open models for building without heavy lock-in

Context 128KCost $text · visionOpen weights

Go deeper

The matrix is the quick answer. For the reasoning behind a pick, read Claude vs GPT-4o, Claude vs Gemini, or running open models locally. Every category above — Frontier, Balanced, Fast & cheap, Reasoning, Open weights — maps to a use case, not a leaderboard rank.

Embed this comparison

Free to use on your own site — paste this snippet where you want the live, auto-updating matrix to appear.

<iframe src="https://kapyn.app/embed/compare" width="100%" height="640" style="border:1px solid #222;border-radius:14px" title="AI Model Comparison by Kapyn" loading="lazy"></iframe>