ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.06624
  4. Cited By
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable
  AI Systems

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

10 May 2024
David Dalrymple
Joar Skalse
Yoshua Bengio
Stuart J. Russell
Max Tegmark
S. Seshia
Steve Omohundro
Christian Szegedy
Ben Goldhaber
Nora Ammann
Alessandro Abate
Joe Halpern
Clark Barrett
Ding Zhao
Zhi-Xuan Tan
Jeannette Wing
Joshua Tenenbaum
ArXivPDFHTML

Papers citing "Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems"

16 / 16 papers shown
Title
What Is AI Safety? What Do We Want It to Be?
What Is AI Safety? What Do We Want It to Be?
Jacqueline Harding
Cameron Domenico Kirk-Giannini
48
0
0
05 May 2025
A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management
A Frontier AI Risk Management Framework: Bridging the Gap Between Current AI Practices and Established Risk Management
Simeon Campos
Henry Papadatos
Fabien Roger
Chloé Touzet
Malcolm Murray
Otter Quarks
66
2
0
20 Feb 2025
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity
David Williams-King
Linh Le
Adam Oberman
Yoshua Bengio
AAML
41
0
0
19 Jan 2025
Agentic Information Retrieval
Agentic Information Retrieval
Weinan Zhang
Junwei Liao
Ning Li
Kounianhua Du
Jianghao Lin
AIFin
41
2
0
13 Oct 2024
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Shanshan Han
55
1
0
09 Oct 2024
Safeguarding AI Agents: Developing and Analyzing Safety Architectures
Safeguarding AI Agents: Developing and Analyzing Safety Architectures
Ishaan Domkundwar
Mukunda N S
Ishaan Bhola
Riddhik Kochhar
LLMAG
29
1
0
03 Sep 2024
Non-maximizing policies that fulfill multi-criterion aspirations in expectation
Non-maximizing policies that fulfill multi-criterion aspirations in expectation
Simon Dima
Simon Fischer
J. Heitzig
Joss Oliver
18
1
0
08 Aug 2024
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by
  Giraffes and Acacias
Thorns and Algorithms: Navigating Generative AI Challenges Inspired by Giraffes and Acacias
Waqar Hussain
30
0
0
16 Jul 2024
Generative AI Systems: A Systems-based Perspective on Generative AI
Generative AI Systems: A Systems-based Perspective on Generative AI
Jakub M. Tomczak
35
1
0
25 Jun 2024
Securing the Future of GenAI: Policy and Technology
Securing the Future of GenAI: Policy and Technology
Mihai Christodorescu
Craven
S. Feizi
Neil Zhenqiang Gong
Mia Hoffmann
...
Jessica Newman
Emelia Probasco
Yanjun Qi
Khawaja Shams
Turek
SILM
18
3
0
21 May 2024
The Consensus Game: Language Model Generation via Equilibrium Search
The Consensus Game: Language Model Generation via Equilibrium Search
Athul Paul Jacob
Yikang Shen
Gabriele Farina
Jacob Andreas
31
19
0
13 Oct 2023
Interpretability of Machine Learning: Recent Advances and Future
  Prospects
Interpretability of Machine Learning: Recent Advances and Future Prospects
Lei Gao
L. Guan
AAML
36
29
0
30 Apr 2023
Autoformalization with Large Language Models
Autoformalization with Large Language Models
Yuhuai Wu
Albert Q. Jiang
Wenda Li
M. Rabe
Charles Staats
M. Jamnik
Christian Szegedy
AI4CE
108
107
0
25 May 2022
Introduction to Neural Network Verification
Introduction to Neural Network Verification
Aws Albarghouthi
AAML
45
85
0
21 Sep 2021
A Survey on Neural Network Interpretability
A Survey on Neural Network Interpretability
Yu Zhang
Peter Tiño
A. Leonardis
K. Tang
FaML
XAI
126
494
0
28 Dec 2020
Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to
  the Real World
Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to the Real World
Daniel J. Fremont
Edward Kim
Yash Vardhan Pant
S. Seshia
Atul Acharya
Xantha Bruso
Paul Wells
Steve Lemke
Q. Lu
Shalin Mehta
68
122
0
17 Mar 2020
1