Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.03336
Cited By
Towards evaluations-based safety cases for AI scheming
29 October 2024
Mikita Balesni
Marius Hobbhahn
David Lindner
Alexander Meinke
Tomek Korbak
Joshua Clymer
Buck Shlegeris
Jérémy Scheurer
Charlotte Stix
Rusheb Shah
Nicholas Goldowsky-Dill
Dan Braun
Bilal Chughtai
Owain Evans
Daniel Kokotajlo
Lucius Bushnaq
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards evaluations-based safety cases for AI scheming"
2 / 2 papers shown
Title
A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management
Simeon Campos
Henry Papadatos
Fabien Roger
Chloé Touzet
Malcolm Murray
Otter Quarks
73
2
0
20 Feb 2025
Safety case template for frontier AI: A cyber inability argument
Arthur Goemans
Marie Davidsen Buhl
Jonas Schuett
Tomek Korbak
Jessica Wang
Benjamin Hilton
Geoffrey Irving
56
15
0
12 Nov 2024
1