Awesome-evals is a new curated library of resources for building and evaluating AI agents. The list includes papers, blogs, talks, tools, and benchmarks, aiming to provide developers with essential materials. It's maintained by BenchFlow and has gained 150 GitHub stars since its launch.
Opening Kapyn…