A Grading Rubric for AI Safety Frameworks

Over the past year, artificial intelligence (AI) companies have been increasingly adopting AI safety frameworks. These frameworks outline how companies intend to keep the potential risks associated with developing and deploying frontier AI systems to an acceptable level. Major players like Anthropic, OpenAI, and Google DeepMind have already published their frameworks, while another 13 companies have signaled their intent to release similar frameworks by February 2025. Given their central role in AI companies' efforts to identify and address unacceptable risks from their systems, AI safety frameworks warrant significant scrutiny. To enable governments, academia, and civil society to pass judgment on these frameworks, this paper proposes a grading rubric. The rubric consists of seven evaluation criteria and 21 indicators that concretize the criteria. Each criterion can be graded on a scale from A (gold standard) to F (substandard). The paper also suggests three methods for applying the rubric: surveys, Delphi studies, and audits. The purpose of the grading rubric is to enable nuanced comparisons between frameworks, identify potential areas of improvement, and promote a race to the top in responsible AI development.

Read paper

A Grading Rubric for AI Safety Frameworks

Theme

Date

author

s

Share

Research Summary

Footnotes

Further reading

Related publications

AI Regulation

Regulatory Supervision of Frontier AI Developers

March 2025

Research Paper

Peter Wills

AI Regulation

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

March 2025

Research Paper

Shayne Longpre et al., including Markus Anderljung

AI Regulation

On Regulating Downstream AI Developers

March 2025

Research Paper

Sophie Williams, Jonas Schuett, Markus Anderljung

AI Regulation

Regulatory Supervision of Frontier AI Developers

March 2025

Research Paper

Peter Wills

AI Regulation

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

March 2025

Research Paper

Shayne Longpre et al., including Markus Anderljung

AI Regulation

On Regulating Downstream AI Developers

March 2025

Research Paper

Sophie Williams, Jonas Schuett, Markus Anderljung

AI Regulation

Safety Case Template for Frontier AI: A Cyber Inability Argument

November 2024

Research Paper

Arthur Goemans, Marie Davidsen Buhl, Jonas Schuett, Tomek Korbak, Jessica Wang, Benjamin Hilton, Geoffrey Irving