Microsoft Research clarifies findings on AI delegation and long-horizon reliability. The post addresses the implications of their recent paper on how LLMs can impact document integrity in delegated workflows, emphasizing the ongoing development of robust evaluation methods.
Opening Kapyn…