AI model comparison
Compare the major AI models side by side — Claude, GPT, Gemini, Llama, and more — by context window, cost, modalities, and what each is genuinely best for. Calm, sourced, and free.
Cost is shown as a tier, not a live price — model pricing moves often, so always confirm current rates on the provider's page before you commit. Last updated 2026-07-01.
| Model | Type | Context | Cost | Modalities | Open | Best for |
|---|---|---|---|---|---|---|
| Claude OpusAnthropic | Frontier | 200K | $$$$ | text, vision | — | The hardest reasoning, long agentic tasks, and code that has to be right |
| Claude SonnetAnthropic | Balanced | 200K | $$ | text, vision | — | The default for most work — reads code like a senior engineer at a fair price |
| Claude HaikuAnthropic | Fast & cheap | 200K | $ | text, vision | — | High-volume, low-latency work where speed and cost matter most |
| GPT-4oOpenAI | Frontier | 128K | $$$ | text, vision, audio | — | A strong all-rounder with native voice and vision in one model |
| GPT-4o miniOpenAI | Fast & cheap | 128K | $ | text, vision | — | Cheap, fast, high-volume tasks like extraction and classification |
| OpenAI o-seriesOpenAI | Reasoning | 128K | $$$$ | text, vision | — | Multi-step reasoning, math, and problems that reward slow thinking |
| Gemini 2.5 ProGoogle | Frontier | 1M | $$$ | text, vision, audio | — | Huge-context work — whole codebases, long documents, video reasoning |
| Gemini 2.5 FlashGoogle | Fast & cheap | 1M | Free tier | text, vision, audio | — | Prototyping and high-volume work — capable, fast, with a usable free tier |
| LlamaMeta | Open weights | 128K | Free tier | text, vision | Running locally or self-hosting — private, free to run, no vendor lock-in | |
| QwenAlibaba | Open weights | 128K | Free tier | text, vision | A strong open model for local use, especially good at coding for its size | |
| DeepSeekDeepSeek | Reasoning | 128K | $ | text | Open reasoning at a fraction of the cost of closed reasoning models | |
| MistralMistral AI | Open weights | 128K | $ | text, vision | Efficient European open models for building without heavy lock-in |
The hardest reasoning, long agentic tasks, and code that has to be right
The default for most work — reads code like a senior engineer at a fair price
High-volume, low-latency work where speed and cost matter most
A strong all-rounder with native voice and vision in one model
Cheap, fast, high-volume tasks like extraction and classification
Multi-step reasoning, math, and problems that reward slow thinking
Huge-context work — whole codebases, long documents, video reasoning
Prototyping and high-volume work — capable, fast, with a usable free tier
Running locally or self-hosting — private, free to run, no vendor lock-in
A strong open model for local use, especially good at coding for its size
Open reasoning at a fraction of the cost of closed reasoning models
Efficient European open models for building without heavy lock-in
Go deeper
The matrix is the quick answer. For the reasoning behind a pick, read Claude vs GPT-4o, Claude vs Gemini, or running open models locally. Every category above — Frontier, Balanced, Fast & cheap, Reasoning, Open weights — maps to a use case, not a leaderboard rank.
Embed this comparison
Free to use on your own site — paste this snippet where you want the live, auto-updating matrix to appear.
<iframe src="https://kapyn.app/embed/compare" width="100%" height="640" style="border:1px solid #222;border-radius:14px" title="AI Model Comparison by Kapyn" loading="lazy"></iframe>