Logical Induction

v1v2v3v4v5 (latest)

Logical Induction

12 September 2016

Scott Garrabrant

Tsvi Benson-Tilsen

ArXiv (abs)PDF HTML

Papers citing "Logical Induction"

10 / 10 papers shown

Title
The Alignment Problem from a Deep Learning Perspective Richard Ngo Lawrence Chan Sören Mindermann 139 192 0 30 Aug 2022
Arguments about Highly Reliable Agent Designs as a Useful Path to Artificial Intelligence Safety Issa Rice David Manheim 43 0 0 09 Jan 2022
Temporal Inference with Finite Factored Sets Scott Garrabrant 109 2 0 23 Sep 2021
AI Research Considerations for Human Existential Safety (ARCHES) Andrew Critch David M. Krueger 112 53 0 30 May 2020
Epistemic Phase Transitions in Mathematical Proofs Scott Viteri Simon DeDeo LRM 41 8 0 31 Mar 2020
On Learning to Prove Daniel Huang 44 3 0 24 Apr 2019
Embedded Agency A. Demski Scott Garrabrant AIFin 115 34 0 25 Feb 2019
Scalable agent alignment via reward modeling: a research direction Jan Leike David M. Krueger Tom Everitt Miljan Martic Vishal Maini Shane Legg 124 420 0 19 Nov 2018
AGI Safety Literature Review Tom Everitt G. Lea Marcus Hutter AI4CE 79 116 0 03 May 2018
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making Andrew Critch 30 13 0 05 Jan 2017