Claude Opus 4.8 is a modest but tangible LLM improvement. It features increased honesty, being four times less likely to overlook flaws in generated code and better at flagging uncertainties. This version prioritizes abstaining on uncertain questions, resulting in the lowest incorrect-rate on benchmarks.
Opening Kapyn…