A shared playbook for trustworthy third party evaluations
OpenAI publishes a shared playbook for trustworthy third-party AI evaluations. This guidance details how to assess frontier AI model capabilities, safeguards, and validity, offering developers a framework for robust assessment.