Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2501.17315
Cited By
A sketch of an AI control safety case
28 January 2025
Tomek Korbak
Joshua Clymer
Benjamin Hilton
Buck Shlegeris
Geoffrey Irving
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A sketch of an AI control safety case"
3 / 3 papers shown
Title
An alignment safety case sketch based on debate
Marie Davidsen Buhl
Jacob Pfau
Benjamin Hilton
Geoffrey Irving
19
0
0
06 May 2025
How to evaluate control measures for LLM agents? A trajectory from today to superintelligence
Tomek Korbak
Mikita Balesni
Buck Shlegeris
Geoffrey Irving
ELM
24
1
0
07 Apr 2025
A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management
Simeon Campos
Henry Papadatos
Fabien Roger
Chloé Touzet
Malcolm Murray
Otter Quarks
73
2
0
20 Feb 2025
1