Impossibility and Uncertainty Theorems in AI Value Alignment (or why
your AGI should not have a utility function)

v1v2v3 (latest)

Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)

31 December 2018

ArXiv (abs)PDF HTML

Papers citing "Impossibility and Uncertainty Theorems in AI Value Alignment (or why your AGI should not have a utility function)"

15 / 15 papers shown

Title
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs Zhaowei Zhang Fengshuo Bai Qizhi Chen Chengdong Ma Mingzhi Wang Haoran Sun Zilong Zheng Yaodong Yang 173 5 0 26 Feb 2025
Value Preferences Estimation and Disambiguation in Hybrid Participatory Systems Enrico Liscio Luciano Cavalcante Siebert Catholijn M. Jonker P. Murukannaiah 105 5 0 26 Feb 2024
Towards Regulatable AI Systems: Technical Gaps and Policy Opportunities Xudong Shen H. Brown Jiashu Tao Martin Strobel Yao Tong Akshay Narayan Harold Soh Finale Doshi-Velez 98 3 0 22 Jun 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased Chao Yu Jiaxuan Gao Weiling Liu Bo Xu Hao Tang Jiaqi Yang Yu Wang Yi Wu 109 42 0 03 Feb 2023
Aligned with Whom? Direct and social goals for AI systems Anton Korinek Avital Balwit 58 11 0 09 May 2022
Impossibility Results in AI: A Survey Mario Brčič Roman V. Yampolskiy 107 25 0 01 Sep 2021
Hard Choices in Artificial Intelligence Roel Dobbe T. Gilbert Yonatan Dov Mintz 73 58 0 10 Jun 2021
Consequences of Misaligned AI Simon Zhuang Dylan Hadfield-Menell 77 75 0 07 Feb 2021
On Controllability of AI Roman V. Yampolskiy 55 14 0 19 Jul 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences Daniel S. Brown Russell Coleman R. Srinivasan S. Niekum BDL 127 102 0 21 Feb 2020
Hard Choices in Artificial Intelligence: Addressing Normative Uncertainty through Sociotechnical Commitments Roel Dobbe T. Gilbert Yonatan Dov Mintz 61 18 0 20 Nov 2019
Requisite Variety in Ethical Utility Functions for AI Value Alignment Nadisha-Marie Aliman L. Kester 54 10 0 30 Jun 2019
Unpredictability of AI Roman V. Yampolskiy 64 30 0 29 May 2019
Augmented Utilitarianism for AGI Safety Nadisha-Marie Aliman L. Kester 57 9 0 02 Apr 2019
The Ethics of AI Ethics -- An Evaluation of Guidelines Thilo Hagendorff AI4TS 87 1,216 0 28 Feb 2019