A shared playbook for trustworthy third party evaluations
OpenAI offers a shared playbook for third-party AI evaluations. The guidance details how to assess model capabilities, safeguards, and validity for frontier AI systems, aiming to foster more robust and trustworthy AI development.