Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.00064
Cited By
v1
v2
v3 (latest)
Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)
31 December 2018
P. Eckersley
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)"
15 / 15 papers shown
Title
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs
Zhaowei Zhang
Fengshuo Bai
Qizhi Chen
Chengdong Ma
Mingzhi Wang
Haoran Sun
Zilong Zheng
Yaodong Yang
173
5
0
26 Feb 2025
Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems
Enrico Liscio
Luciano Cavalcante Siebert
Catholijn M. Jonker
P. Murukannaiah
105
5
0
26 Feb 2024
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities
Xudong Shen
H. Brown
Jiashu Tao
Martin Strobel
Yao Tong
Akshay Narayan
Harold Soh
Finale Doshi-Velez
98
3
0
22 Jun 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
Chao Yu
Jiaxuan Gao
Weiling Liu
Bo Xu
Hao Tang
Jiaqi Yang
Yu Wang
Yi Wu
109
42
0
03 Feb 2023
Aligned with Whom? Direct and social goals for AI systems
Anton Korinek
Avital Balwit
58
11
0
09 May 2022
Impossibility Results in AI: A Survey
Mario Brčič
Roman V. Yampolskiy
107
25
0
01 Sep 2021
Hard Choices in Artificial Intelligence
Roel Dobbe
T. Gilbert
Yonatan Dov Mintz
73
58
0
10 Jun 2021
Consequences of Misaligned AI
Simon Zhuang
Dylan Hadfield-Menell
77
75
0
07 Feb 2021
On Controllability of AI
Roman V. Yampolskiy
55
14
0
19 Jul 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel S. Brown
Russell Coleman
R. Srinivasan
S. Niekum
BDL
127
102
0
21 Feb 2020
Hard Choices in Artificial Intelligence: Addressing Normative Uncertainty through Sociotechnical Commitments
Roel Dobbe
T. Gilbert
Yonatan Dov Mintz
61
18
0
20 Nov 2019
Requisite Variety in Ethical Utility Functions for AI Value Alignment
Nadisha-Marie Aliman
L. Kester
54
10
0
30 Jun 2019
Unpredictability of AI
Roman V. Yampolskiy
64
30
0
29 May 2019
Augmented Utilitarianism for AGI Safety
Nadisha-Marie Aliman
L. Kester
57
9
0
02 Apr 2019
The Ethics of AI Ethics -- An Evaluation of Guidelines
Thilo Hagendorff
AI4TS
87
1,216
0
28 Feb 2019
1