Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.03543
Cited By
v1
v2
v3
v4
v5 (latest)
Logical Induction
12 September 2016
Scott Garrabrant
Tsvi Benson-Tilsen
Andrew Critch
N. Soares
Jessica Taylor
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Logical Induction"
10 / 10 papers shown
Title
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
139
192
0
30 Aug 2022
Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety
Issa Rice
David Manheim
43
0
0
09 Jan 2022
Temporal Inference with Finite Factored Sets
Scott Garrabrant
109
2
0
23 Sep 2021
AI Research Considerations for Human Existential Safety (ARCHES)
Andrew Critch
David M. Krueger
112
53
0
30 May 2020
Epistemic Phase Transitions in Mathematical Proofs
Scott Viteri
Simon DeDeo
LRM
41
8
0
31 Mar 2020
On Learning to Prove
Daniel Huang
44
3
0
24 Apr 2019
Embedded Agency
A. Demski
Scott Garrabrant
AIFin
115
34
0
25 Feb 2019
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
420
0
19 Nov 2018
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
79
116
0
03 May 2018
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making
Andrew Critch
30
13
0
05 Jan 2017
1