ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01815
  4. Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
ArXiv (abs)PDFHTML

Papers citing "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"

39 / 839 papers shown
Title
ML + FV = $\heartsuit$? A Survey on the Application of Machine Learning
  to Formal Verification
ML + FV = ♡\heartsuit♡? A Survey on the Application of Machine Learning to Formal Verification
Moussa Amrani
L. Lucio
Adrien Bibal
170
5
0
10 Jun 2018
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&RoLRM
607
557
0
07 Jun 2018
Re-evaluating Evaluation
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
229
112
0
07 Jun 2018
Model-free, Model-based, and General Intelligence
Model-free, Model-based, and General Intelligence
Hector Geffner
LRMELM
101
60
0
06 Jun 2018
Deep Pepper: Expert Iteration based Chess agent in the Reinforcement
  Learning Setting
Deep Pepper: Expert Iteration based Chess agent in the Reinforcement Learning Setting
Sai Krishna G.V.
Kyle Goyette
A. Chamseddine
Breandan Considine
126
3
0
02 Jun 2018
Between Progress and Potential Impact of AI: the Neglected Dimensions
Between Progress and Potential Impact of AI: the Neglected Dimensions
Fernando Martínez-Plumed
S. Avin
Miles Brundage
Allan Dafoe
Seán Ó hÉigeartaigh
José Hernández-Orallo
169
5
0
02 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic
  Planning in Model Based Reinforcement Learning
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
259
7
0
01 Jun 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
139
89
0
26 May 2018
A0C: Alpha Zero in Continuous Action Space
A0C: Alpha Zero in Continuous Action Space
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
139
52
0
24 May 2018
Bandit-Based Monte Carlo Optimization for Nearest Neighbors
Bandit-Based Monte Carlo Optimization for Nearest Neighbors
Vivek Bagaria
Tavor Z. Baharav
G. Kamath
David Tse
143
12
0
21 May 2018
Multiple-Step Greedy Policies in Online and Approximate Reinforcement
  Learning
Multiple-Step Greedy Policies in Online and Approximate Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
227
14
0
21 May 2018
Reinforcement Learning of Theorem Proving
Reinforcement Learning of Theorem Proving
C. Kaliszyk
Josef Urban
Henryk Michalewski
Miroslav Olsák
130
155
0
19 May 2018
Solving the Rubik's Cube Without Human Knowledge
Solving the Rubik's Cube Without Human Knowledge
Alexander Shmakov
Forest Agostinelli
Alexander Shmakov
Pierre Baldi
97
42
0
18 May 2018
Towards Autonomous Reinforcement Learning: Automatic Setting of
  Hyper-parameters using Bayesian Optimization
Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization
Juan Cruz Barsce
J. Palombarini
E. Martínez
GP
121
34
0
12 May 2018
AGI Safety Literature Review
AGI Safety Literature Review
Tom Everitt
G. Lea
Marcus Hutter
AI4CE
166
126
0
03 May 2018
AI safety via debate
AI safety via debate
G. Irving
Paul Christiano
Dario Amodei
427
298
0
02 May 2018
The Sharer's Dilemma in Collective Adaptive Systems of Self-Interested
  Agents
The Sharer's Dilemma in Collective Adaptive Systems of Self-Interested Agents
Lenz Belzner
Kyrill Schmid
Thomy Phan
Thomas Gabor
M. Wirsing
85
4
0
28 Apr 2018
State Distribution-aware Sampling for Deep Q-learning
State Distribution-aware Sampling for Deep Q-learningNeural Processing Letters (NPL), 2018
Weichao Li
Fuxian Huang
Xi Li
G. Pan
Leilei Gan
TTA
74
4
0
23 Apr 2018
Event Extraction with Generative Adversarial Imitation Learning
Event Extraction with Generative Adversarial Imitation Learning
Tongtao Zhang
Heng Ji
GANOffRL
85
19
0
21 Apr 2018
A Study on Overfitting in Deep Reinforcement Learning
A Study on Overfitting in Deep Reinforcement Learning
Chiyuan Zhang
Oriol Vinyals
Rémi Munos
Samy Bengio
OffRLOnRL
215
419
0
18 Apr 2018
Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and
  Some New Implementations
Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations
Dimitri Bertsekas
OffRL
246
137
0
12 Apr 2018
Programmatically Interpretable Reinforcement Learning
Programmatically Interpretable Reinforcement Learning
Abhinav Verma
V. Murali
Rishabh Singh
Pushmeet Kohli
Swarat Chaudhuri
400
380
0
06 Apr 2018
Automated Speed and Lane Change Decision Making using Deep Reinforcement
  Learning
Automated Speed and Lane Change Decision Making using Deep Reinforcement LearningInternational Conference on Intelligent Transportation Systems (ITSC), 2018
C. Hoel
Krister Wolff
L. Laine
158
190
0
14 Mar 2018
Hierarchical Reinforcement Learning: Approximating Optimal Discounted
  TSP Using Local Policies
Hierarchical Reinforcement Learning: Approximating Optimal Discounted TSP Using Local Policies
Tom Zahavy
Avinatan Hassidim
Haim Kaplan
Yishay Mansour
108
1
0
13 Mar 2018
A Likelihood-Free Inference Framework for Population Genetic Data using
  Exchangeable Neural Networks
A Likelihood-Free Inference Framework for Population Genetic Data using Exchangeable Neural Networks
Jeffrey Chan
Valerio Perrone
J. Spence
Paul A. Jenkins
Sara Mathieson
Yun S. Song
450
120
0
16 Feb 2018
Monte Carlo Q-learning for General Game Playing
Monte Carlo Q-learning for General Game Playing
Hui Wang
M. Emmerich
Aske Plaat
GP
123
20
0
16 Feb 2018
From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in
  the Style of Alpha(Go) Zero
From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in the Style of Alpha(Go) Zero
Fei Wang
Tiark Rompf
NAI
110
6
0
14 Feb 2018
Efficient Model-Based Deep Reinforcement Learning with Variational State
  Tabulation
Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation
Dane S. Corneil
W. Gerstner
Johanni Brea
OffRL
144
63
0
12 Feb 2018
ProofWatch: Watchlist Guidance for Large Theories in E
ProofWatch: Watchlist Guidance for Large Theories in E
Z. Goertzel
Jan Jakubuv
S. Schulz
Josef Urban
LRM
222
13
0
12 Feb 2018
Beyond the One Step Greedy Approach in Reinforcement Learning
Beyond the One Step Greedy Approach in Reinforcement Learning
Yonathan Efroni
Gal Dalal
B. Scherrer
Shie Mannor
OffRL
261
53
0
10 Feb 2018
Tunneling Neural Perception and Logic Reasoning through Abductive
  Learning
Tunneling Neural Perception and Logic Reasoning through Abductive Learning
Wang-Zhou Dai
Qiu-Ling Xu
Yang Yu
Zhi Zhou
LRMAI4CE
116
24
0
04 Feb 2018
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Deep Reinforcement Learning using Capsules in Advanced Game Environments
Per-Arne Andersen
116
16
0
29 Jan 2018
Comparison Training for Computer Chinese Chess
Comparison Training for Computer Chinese Chess
Wen-Jie Tseng
Jr-Chang Chen
I-Chen Wu
Ting Han Wei
39
3
0
23 Jan 2018
Innateness, AlphaZero, and Artificial Intelligence
Innateness, AlphaZero, and Artificial Intelligence
G. Marcus
112
76
0
17 Jan 2018
Building a Conversational Agent Overnight with Dialogue Self-Play
Building a Conversational Agent Overnight with Dialogue Self-Play
Pararth Shah
Dilek Z. Hakkani-Tür
Gokhan Tur
Abhinav Rastogi
Ankur Bapna
Neha Nayak Kennard
Larry Heck
210
205
0
15 Jan 2018
Distributed Deep Reinforcement Learning: Learn how to play Atari games
  in 21 minutes
Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes
Igor Adamski
R. Adamski
T. Grel
Adam Jedrych
Kamil Kaczmarek
Henryk Michalewski
OffRL
181
37
0
09 Jan 2018
Adversarial Examples: Attacks and Defenses for Deep Learning
Adversarial Examples: Attacks and Defenses for Deep LearningIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2017
Xiaoyong Yuan
Pan He
Qile Zhu
Xiaolin Li
SILMAAML
513
1,729
0
19 Dec 2017
Is prioritized sweeping the better episodic control?
Is prioritized sweeping the better episodic control?
Johanni Brea
85
8
0
20 Nov 2017
Exponential improvements for quantum-accessible reinforcement learning
Exponential improvements for quantum-accessible reinforcement learning
Vedran Dunjko
Yi-Kai Liu
Xingyao Wu
Jacob M. Taylor
222
24
0
30 Oct 2017
Previous
123...151617