Agent-EvalKit is an open-source toolkit for systematic AI agent evaluation. It integrates with popular AI coding assistants and offers six evaluation phases, demonstrated using a travel research agent built with Strands Agents SDK and Amazon Bedrock.
Opening Kapyn…