Agent-EvalKit is an open-source toolkit for systematic AI agent evaluation. It integrates with popular AI coding assistants and provides six distinct evaluation phases, simplifying the process for developers to benchmark agent performance.
Opening Kapyn…