Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.18003
Cited By
An Example Safety Case for Safeguards Against Misuse
23 May 2025
Joshua Clymer
Jonah Weinbaum
Robert Kirk
Kimberly Mai
Selena Zhang
Xander Davies
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Example Safety Case for Safeguards Against Misuse"
2 / 2 papers shown
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Mrinank Sharma
Meg Tong
Jesse Mu
Jerry Wei
Jorrit Kruthoff
...
Ruiqi Zhong
Giulio Zhou
Jan Leike
Jared Kaplan
Ethan Perez
429
96
0
31 Jan 2025
A sketch of an AI control safety case
Tomek Korbak
Joshua Clymer
Benjamin Hilton
Buck Shlegeris
Geoffrey Irving
364
20
0
28 Jan 2025
1
Page 1 of 1