Title
What Is AI Safety? What Do We Want It to Be? Jacqueline Harding Cameron Domenico Kirk-Giannini 48 0 0 05 May 2025
A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management Simeon Campos Henry Papadatos Fabien Roger Chloé Touzet Malcolm Murray Otter Quarks 66 2 0 20 Feb 2025
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity David Williams-King Linh Le Adam Oberman Yoshua Bengio AAML 41 0 0 19 Jan 2025
Agentic Information Retrieval Weinan Zhang Junwei Liao Ning Li Kounianhua Du Jianghao Lin AIFin 41 2 0 13 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond Shanshan Han 55 1 0 09 Oct 2024
Safeguarding AI Agents: Developing and Analyzing Safety Architectures Ishaan Domkundwar Mukunda N S Ishaan Bhola Riddhik Kochhar LLMAG 29 1 0 03 Sep 2024
Non-maximizing policies that fulfill multi-criterion aspirations in expectation Simon Dima Simon Fischer J. Heitzig Joss Oliver 18 1 0 08 Aug 2024
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias Waqar Hussain 30 0 0 16 Jul 2024
Generative AI Systems: A Systems-based Perspective on Generative AI Jakub M. Tomczak 35 1 0 25 Jun 2024
Securing the Future of GenAI: Policy and Technology Mihai Christodorescu Craven S. Feizi Neil Zhenqiang Gong Mia Hoffmann ... Jessica Newman Emelia Probasco Yanjun Qi Khawaja Shams Turek SILM 18 3 0 21 May 2024
The Consensus Game: Language Model Generation via Equilibrium Search Athul Paul Jacob Yikang Shen Gabriele Farina Jacob Andreas 31 19 0 13 Oct 2023
Interpretability of Machine Learning: Recent Advances and Future Prospects Lei Gao L. Guan AAML 36 29 0 30 Apr 2023
Autoformalization with Large Language Models Yuhuai Wu Albert Q. Jiang Wenda Li M. Rabe Charles Staats M. Jamnik Christian Szegedy AI4CE 108 107 0 25 May 2022
Introduction to Neural Network Verification Aws Albarghouthi AAML 45 85 0 21 Sep 2021
A Survey on Neural Network Interpretability Yu Zhang Peter Tiño A. Leonardis K. Tang FaML XAI 126 494 0 28 Dec 2020
Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to the Real World Daniel J. Fremont Edward Kim Yash Vardhan Pant S. Seshia Atul Acharya Xantha Bruso Paul Wells Steve Lemke Q. Lu Shalin Mehta 68 122 0 17 Mar 2020