Evaluate AI agents systematically with Agent-EvalKit
Agent-EvalKit is an open-source toolkit for systematic AI agent evaluation. It integrates with popular AI coding assistants, offering six distinct evaluation phases to help developers benchmark agent performance.