Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility

13 October 2018

Papers citing "Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility"

32 / 32 papers shown

Title
Measuring Goal-Directedness Matt MacDermott James Fox Francesco Belardinelli Tom Everitt 152 1 0 06 Dec 2024
Causal Responsibility Attribution for Human-AI Collaboration Yahang Qi Bernhard Schölkopf Zhijing Jin 52 2 0 05 Nov 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making Stelios Triantafyllou A. Sukovic Yasaman Zolfimoselo Goran Radanović CML 102 0 0 16 Oct 2024
The Benefits of Power Regularization in Cooperative Reinforcement Learning Michelle Li Michael Dennis 74 3 0 17 Jun 2024
AI Sandbagging: Language Models can Strategically Underperform on Evaluations Teun van der Weij Felix Hofstätter Ollie Jaffe Samuel F. Brown Francis Rhys Ward ELM 89 31 0 11 Jun 2024
Robust agents learn causal world models Jonathan G. Richens Tom Everitt OOD 182 46 0 16 Feb 2024
Honesty Is the Best Policy: Defining and Mitigating AI Deception Francis Rhys Ward Francesco Belardinelli Francesca Toni Tom Everitt 182 31 0 03 Dec 2023
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs Stelios Triantafyllou A. Sukovic Debmalya Mandal Goran Radanović 93 0 0 17 Oct 2023
SHAPE: A Framework for Evaluating the Ethicality of Influence Elfia Bezou-Vrakatseli Benedikt Brückner Luke Thorburn TDI 65 3 0 08 Sep 2023
Anticipating Responsibility in Multiagent Planning T. Parker Umberto Grandi E. Lorini 40 4 0 31 Jul 2023
Analyzing Intentional Behavior in Autonomous Agents under Uncertainty Filip Cano Córdoba Samuel Judson Timos Antonopoulos Katrine Bjørner Nicholas Shoemaker Scott J. Shapiro R. Piskac Bettina Könighofer 44 3 0 04 Jul 2023
Experiments with Detecting and Mitigating AI Deception Ismail Sahbane Francis Rhys Ward Henrik ˚Aslund 49 1 0 26 Jun 2023
Toward A Logical Theory Of Fairness and Bias Vaishak Belle FaML 140 1 0 08 Jun 2023
Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity Model Valentyn Melnychuk Dennis Frauen Stefan Feuerriegel 122 11 0 02 Jun 2023
Human Control: Definitions and Algorithms Ryan Carey Tom Everitt 68 7 0 31 May 2023
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs Stelios Triantafyllou Goran Radanović 62 5 0 24 Feb 2023
Reasoning about Causality in Games Lewis Hammond James Fox Tom Everitt Ryan Carey Alessandro Abate Michael Wooldridge LRM AI4CE 77 16 0 05 Jan 2023
Discovering Agents Zachary Kenton Ramana Kumar Sebastian Farquhar Jonathan G. Richens Matt MacDermott Tom Everitt CML 119 31 0 17 Aug 2022
Moral reinforcement learning using actual causation Tue Herlau 133 0 0 17 May 2022
Counterfactual harm Jonathan G. Richens R. Beard Daniel H. Thompson 108 29 0 27 Apr 2022
Path-Specific Objectives for Safer Agent Incentives Sebastian Farquhar Ryan Carey Tom Everitt 81 27 0 21 Apr 2022
Utility Functions for Human/Robot Interaction Bruno Yun Nir Oren Madalina Croitoru 25 0 0 08 Apr 2022
On Blame Attribution for Accountable Multi-Agent Sequential Decision Making Stelios Triantafyllou Adish Singla Goran Radanović 63 12 0 26 Jul 2021
Definitions of intent suitable for algorithms Hal Ashton 59 18 0 08 Jun 2021
Extending counterfactual accounts of intent to include oblique intent Hal Ashton 152 3 0 07 Jun 2021
Experiential AI D. Hemment R. Aylett Vaishak Belle Dave Murray-Rust Ewa Luger J. Hillston Michael Rovatsos F. Broz 54 13 0 06 Aug 2019
Efficiently Checking Actual Causality with SAT Solving Amjad Ibrahim Simon Rehwald A. Pretschner LRM 24 3 0 30 Apr 2019
Blameworthiness in Multi-Agent Settings Meir Friedenberg Joseph Y. Halpern 49 30 0 11 Mar 2019
The Limits of Morality in Strategic Games Rui Cao Pavel Naumov 20 0 0 22 Jan 2019
Knowledge and Blameworthiness Pavel Naumov Jia Tao 20 1 0 05 Nov 2018
Learning Tractable Probabilistic Models for Moral Responsibility and Blame Lewis Hammond Vaishak Belle 38 4 0 08 Oct 2018
Blameworthiness in Strategic Games Pavel Naumov Jia Tao 15 18 0 14 Sep 2018