kapynResearch

What happened after 2,000 people tried to hack my AI assistant

An AI assistant withstood 6,000 prompt injection attempts in a public challenge, highlighting advancements in model security. The experiment tested defenses against malicious email inputs, with the frontier model Opus proving remarkably resilient. While successful, the results suggest continued vigilance is necessary for production systems.

Simon Willison·Jun 26, 2026

Opening Kapyn…