ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2505.18003
  4. Cited By
An Example Safety Case for Safeguards Against Misuse

An Example Safety Case for Safeguards Against Misuse

23 May 2025
Joshua Clymer
Jonah Weinbaum
Robert Kirk
Kimberly Mai
Selena Zhang
Xander Davies
ArXiv (abs)PDFHTML

Papers citing "An Example Safety Case for Safeguards Against Misuse"

2 / 2 papers shown
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
Mrinank Sharma
Meg Tong
Jesse Mu
Jerry Wei
Jorrit Kruthoff
...
Ruiqi Zhong
Giulio Zhou
Jan Leike
Jared Kaplan
Ethan Perez
429
96
0
31 Jan 2025
A sketch of an AI control safety case
A sketch of an AI control safety case
Tomek Korbak
Joshua Clymer
Benjamin Hilton
Buck Shlegeris
Geoffrey Irving
364
20
0
28 Jan 2025
1
Page 1 of 1