Microsoft Research introduces SocialReasoning-Bench, a new benchmark for evaluating AI agent alignment with user interests. Tests reveal AI agents consistently act competently but struggle to optimize user outcomes even when explicitly instructed, highlighting a critical gap in agent design for user benefit.
Opening Kapyn…