Safety Case Template for Frontier AI: A Cyber Inability Argument

Frontier artificial intelligence (AI) systems pose increasing risks to society, making it essential for developers to provide assurances about their safety. One approach to offering such assurances is through a safety case: a structured, evidence-based argument aimed at demonstrating why the risk associated with a safety-critical system is acceptable. In this article, we propose a safety case template for offensive cyber capabilities. We illustrate how developers could argue that a model does not have capabilities posing unacceptable cyber risks by breaking down the main claim into progressively specific sub-claims, each supported by evidence. In our template, we identify a number of risk models, derive proxy tasks from the risk models, define evaluation settings for the proxy tasks, and connect those with evaluation results. Elements of current frontier safety techniques - such as risk models, proxy tasks, and capability evaluations - use implicit arguments for overall system safety. This safety case template integrates these elements using the Claims Arguments Evidence (CAE) framework in order to make safety arguments coherent and explicit. While uncertainties around the specifics remain, this template serves as a proof of concept, aiming to foster discussion on AI safety cases and advance AI assurance.

Read paper

Theme

AI Regulation

Date

November 12, 2024

author

s

Arthur Goemans, Marie Davidsen Buhl, Jonas Schuett, Tomek Korbak, Jessica Wang, Benjamin Hilton, Geoffrey Irving

Safety Case Template for Frontier AI: A Cyber Inability Argument

Theme

Date

author

s

Share

Research Summary

Footnotes

Further reading

Related publications

AI Regulation

Regulatory Supervision of Frontier AI Developers

March 2025

Research Paper

Peter Wills

AI Regulation

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

March 2025

Research Paper

Shayne Longpre et al., including Markus Anderljung

AI Regulation

On Regulating Downstream AI Developers

March 2025

Research Paper

Sophie Williams, Jonas Schuett, Markus Anderljung

AI Regulation

Regulatory Supervision of Frontier AI Developers

March 2025

Research Paper

Peter Wills

AI Regulation

In-House Evaluation Is Not Enough: Towards Robust Third-Party Flaw Disclosure for General-Purpose AI

March 2025

Research Paper

Shayne Longpre et al., including Markus Anderljung

AI Regulation

On Regulating Downstream AI Developers

March 2025

Research Paper

Sophie Williams, Jonas Schuett, Markus Anderljung

AI Regulation