Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.05903
Cited By
Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility
13 October 2018
Joseph Y. Halpern
Max Kleiman-Weiner
XAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Formal Definitions of Blameworthiness, Intention, and Moral Responsibility"
32 / 32 papers shown
Title
Measuring Goal-Directedness
Matt MacDermott
James Fox
Francesco Belardinelli
Tom Everitt
152
1
0
06 Dec 2024
Causal Responsibility Attribution for Human-AI Collaboration
Yahang Qi
Bernhard Schölkopf
Zhijing Jin
52
2
0
05 Nov 2024
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making
Stelios Triantafyllou
A. Sukovic
Yasaman Zolfimoselo
Goran Radanović
CML
102
0
0
16 Oct 2024
The Benefits of Power Regularization in Cooperative Reinforcement Learning
Michelle Li
Michael Dennis
74
3
0
17 Jun 2024
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
Teun van der Weij
Felix Hofstätter
Ollie Jaffe
Samuel F. Brown
Francis Rhys Ward
ELM
89
31
0
11 Jun 2024
Robust agents learn causal world models
Jonathan G. Richens
Tom Everitt
OOD
182
46
0
16 Feb 2024
Honesty Is the Best Policy: Defining and Mitigating AI Deception
Francis Rhys Ward
Francesco Belardinelli
Francesca Toni
Tom Everitt
179
31
0
03 Dec 2023
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs
Stelios Triantafyllou
A. Sukovic
Debmalya Mandal
Goran Radanović
93
0
0
17 Oct 2023
SHAPE: A Framework for Evaluating the Ethicality of Influence
Elfia Bezou-Vrakatseli
Benedikt Brückner
Luke Thorburn
TDI
65
3
0
08 Sep 2023
Anticipating Responsibility in Multiagent Planning
T. Parker
Umberto Grandi
E. Lorini
38
4
0
31 Jul 2023
Analyzing Intentional Behavior in Autonomous Agents under Uncertainty
Filip Cano Córdoba
Samuel Judson
Timos Antonopoulos
Katrine Bjørner
Nicholas Shoemaker
Scott J. Shapiro
R. Piskac
Bettina Könighofer
44
3
0
04 Jul 2023
Experiments with Detecting and Mitigating AI Deception
Ismail Sahbane
Francis Rhys Ward
Henrik ˚Aslund
49
1
0
26 Jun 2023
Toward A Logical Theory Of Fairness and Bias
Vaishak Belle
FaML
140
1
0
08 Jun 2023
Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity Model
Valentyn Melnychuk
Dennis Frauen
Stefan Feuerriegel
122
11
0
02 Jun 2023
Human Control: Definitions and Algorithms
Ryan Carey
Tom Everitt
68
7
0
31 May 2023
Towards Computationally Efficient Responsibility Attribution in Decentralized Partially Observable MDPs
Stelios Triantafyllou
Goran Radanović
62
5
0
24 Feb 2023
Reasoning about Causality in Games
Lewis Hammond
James Fox
Tom Everitt
Ryan Carey
Alessandro Abate
Michael Wooldridge
LRM
AI4CE
77
16
0
05 Jan 2023
Discovering Agents
Zachary Kenton
Ramana Kumar
Sebastian Farquhar
Jonathan G. Richens
Matt MacDermott
Tom Everitt
CML
119
31
0
17 Aug 2022
Moral reinforcement learning using actual causation
Tue Herlau
131
0
0
17 May 2022
Counterfactual harm
Jonathan G. Richens
R. Beard
Daniel H. Thompson
108
29
0
27 Apr 2022
Path-Specific Objectives for Safer Agent Incentives
Sebastian Farquhar
Ryan Carey
Tom Everitt
81
27
0
21 Apr 2022
Utility Functions for Human/Robot Interaction
Bruno Yun
Nir Oren
Madalina Croitoru
23
0
0
08 Apr 2022
On Blame Attribution for Accountable Multi-Agent Sequential Decision Making
Stelios Triantafyllou
Adish Singla
Goran Radanović
61
12
0
26 Jul 2021
Definitions of intent suitable for algorithms
Hal Ashton
59
18
0
08 Jun 2021
Extending counterfactual accounts of intent to include oblique intent
Hal Ashton
152
3
0
07 Jun 2021
Experiential AI
D. Hemment
R. Aylett
Vaishak Belle
Dave Murray-Rust
Ewa Luger
J. Hillston
Michael Rovatsos
F. Broz
54
13
0
06 Aug 2019
Efficiently Checking Actual Causality with SAT Solving
Amjad Ibrahim
Simon Rehwald
A. Pretschner
LRM
22
3
0
30 Apr 2019
Blameworthiness in Multi-Agent Settings
Meir Friedenberg
Joseph Y. Halpern
47
30
0
11 Mar 2019
The Limits of Morality in Strategic Games
Rui Cao
Pavel Naumov
18
0
0
22 Jan 2019
Knowledge and Blameworthiness
Pavel Naumov
Jia Tao
18
1
0
05 Nov 2018
Learning Tractable Probabilistic Models for Moral Responsibility and Blame
Lewis Hammond
Vaishak Belle
36
4
0
08 Oct 2018
Blameworthiness in Strategic Games
Pavel Naumov
Jia Tao
13
18
0
14 Sep 2018
1