kapynOpen Source

Evaluate AI agents systematically with Agent-EvalKit

Agent-EvalKit is an open-source toolkit for systematic AI agent evaluation. It integrates with popular AI coding assistants, offering six distinct evaluation phases to help developers benchmark agent performance.

AWS ML Blog·Jun 11, 2026

Opening Kapyn…