kapynAI / Models

Claude Opus 4.8: "a modest but tangible improvement"

Claude Opus 4.8 is a modest but tangible LLM improvement. It features increased honesty, being four times less likely to overlook flaws in generated code and better at flagging uncertainties. This version prioritizes abstaining on uncertain questions, resulting in the lowest incorrect-rate on benchmarks.

Simon Willison·May 28, 2026

Opening Kapyn…