ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01815
  4. Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
ArXiv (abs)PDFHTML

Papers citing "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"

50 / 839 papers shown
Title
On Multi-Agent Learning in Team Sports Games
On Multi-Agent Learning in Team Sports Games
Yunqi Zhao
Igor Borovikov
J. Rupert
C. Somers
Ahmad Beirami
100
14
0
25 Jun 2019
Modern Deep Reinforcement Learning Algorithms
Modern Deep Reinforcement Learning Algorithms
Sergey Ivanov
A. Dýakonov
OffRL
132
42
0
24 Jun 2019
Inductive general game playing
Inductive general game playingMachine-mediated learning (ML), 2019
Andrew Cropper
Richard Evans
Mark Law
AI4CE
211
33
0
23 Jun 2019
Defending Against Adversarial Examples with K-Nearest Neighbor
Chawin Sitawarin
David Wagner
AAML
174
29
0
23 Jun 2019
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent
  Coordination
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent CoordinationInternational Conference on Machine Learning (ICML), 2019
Shauharda Khadka
Somdeb Majumdar
Santiago Miret
Stephen McAleer
Kagan Tumer
230
66
0
18 Jun 2019
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning
Georgios Papoudakis
Filippos Christianos
Arrasy Rahman
Stefano V. Albrecht
179
213
0
11 Jun 2019
Planning With Uncertain Specifications (PUnS)
Planning With Uncertain Specifications (PUnS)IEEE Robotics and Automation Letters (RA-L), 2019
Ankit J. Shah
Shen Li
J. Shah
177
25
0
07 Jun 2019
On the Generalization Gap in Reparameterizable Reinforcement Learning
On the Generalization Gap in Reparameterizable Reinforcement LearningInternational Conference on Machine Learning (ICML), 2019
Huan Wang
Stephan Zheng
Caiming Xiong
R. Socher
193
40
0
29 May 2019
CopyCAT: Taking Control of Neural Policies with Constant Attacks
CopyCAT: Taking Control of Neural Policies with Constant AttacksAdaptive Agents and Multi-Agent Systems (AAMAS), 2019
Léonard Hussenot
Matthieu Geist
Olivier Pietquin
AAML
134
34
0
29 May 2019
LeTS-Drive: Driving in a Crowd by Learning from Tree Search
LeTS-Drive: Driving in a Crowd by Learning from Tree Search
Panpan Cai
Yuanfu Luo
Aseem Saxena
David Hsu
Wee Sun Lee
118
31
0
29 May 2019
AI-GAs: AI-generating algorithms, an alternate paradigm for producing
  general artificial intelligence
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Jeff Clune
357
131
0
27 May 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adversarial Policies: Attacking Deep Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2019
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
298
396
0
25 May 2019
Ignorance-Aware Approaches and Algorithms for Prototype Selection in
  Machine Learning
Ignorance-Aware Approaches and Algorithms for Prototype Selection in Machine Learning
V. Terziyan
A. Nikulin
38
4
0
15 May 2019
Benchmark and Survey of Automated Machine Learning Frameworks
Benchmark and Survey of Automated Machine Learning Frameworks
Marc-André Zöller
Marco F. Huber
260
89
0
26 Apr 2019
Neural Path Planning: Fixed Time, Near-Optimal Path Generation via
  Oracle Imitation
Neural Path Planning: Fixed Time, Near-Optimal Path Generation via Oracle Imitation
M. J. Bency
A. H. Qureshi
Michael C. Yip
168
95
0
25 Apr 2019
On Learning to Prove
On Learning to Prove
Daniel Huang
212
3
0
24 Apr 2019
Low-Memory Neural Network Training: A Technical Report
Low-Memory Neural Network Training: A Technical Report
N. Sohoni
Christopher R. Aberger
Megan Leszczynski
Jian Zhang
Christopher Ré
229
110
0
24 Apr 2019
Deep learning investigation for chess player attention prediction using
  eye-tracking and game data
Deep learning investigation for chess player attention prediction using eye-tracking and game data
Justin Le Louëdec
Thomas Guntz
James L. Crowley
Dominique Vaufreydaz
99
18
0
17 Apr 2019
Deep Policies for Width-Based Planning in Pixel Domains
Deep Policies for Width-Based Planning in Pixel Domains
Miquel Junyent
Anders Jonsson
Vicencc Gómez
195
10
0
12 Apr 2019
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost
  RL
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
Philippe Preux
302
2
0
08 Apr 2019
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep
  Reinforcement Learning
Creating Pro-Level AI for a Real-Time Fighting Game Using Deep Reinforcement Learning
In-Suk Oh
Seungeun Rho
Sangbin Moon
Seongho Son
Hyoil Lee
Jinyun Chung
214
62
0
08 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without
  Search Trees
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
191
30
0
07 Apr 2019
Synthesized Policies for Transfer and Adaptation across Tasks and
  Environments
Synthesized Policies for Transfer and Adaptation across Tasks and Environments
Hexiang Hu
Liyu Chen
Boqing Gong
Fei Sha
144
9
0
05 Apr 2019
Reducing catastrophic forgetting when evolving neural networks
Reducing catastrophic forgetting when evolving neural networks
Joseph Early
55
2
0
05 Apr 2019
A Local Approach to Forward Model Learning: Results on the Game of Life
  Game
A Local Approach to Forward Model Learning: Results on the Game of Life Game
Simon Lucas
Alexander Dockhorn
Vanessa Volz
Chris Bamford
Raluca D. Gaina
Ivan Bravi
Diego Perez-Liebana
Sanaz Mostaghim
R. Kruse
156
18
0
29 Mar 2019
Improved Reinforcement Learning with Curriculum
Improved Reinforcement Learning with Curriculum
Joseph West
Frederic Maire
C. Browne
Akila Pemasiri
LRM
67
6
0
29 Mar 2019
Winning Isn't Everything: Enhancing Game Development with Intelligent
  Agents
Winning Isn't Everything: Enhancing Game Development with Intelligent Agents
Yunqi Zhao
Igor Borovikov
Fernando de Mesentier Silva
Ahmad Beirami
J. Rupert
...
Mohsen Sardari
Long Lin
S. Narravula
Navid Aghdaie
Kazi A. Zaman
223
47
0
25 Mar 2019
Single-step Options for Adversary Driving
Single-step Options for Adversary Driving
Nazmus Sakib
Hengshuai Yao
Kuanqi Cai
Shangling Jui
138
2
0
20 Mar 2019
On the Robustness of Deep K-Nearest Neighbors
On the Robustness of Deep K-Nearest Neighbors
Chawin Sitawarin
David Wagner
AAMLOOD
176
62
0
20 Mar 2019
Hyper-Parameter Sweep on AlphaZero General
Hyper-Parameter Sweep on AlphaZero General
Hui Wang
M. Emmerich
Mike Preuss
Aske Plaat
112
16
0
19 Mar 2019
Truly Proximal Policy Optimization
Truly Proximal Policy OptimizationConference on Uncertainty in Artificial Intelligence (UAI), 2019
Yuhui Wang
Hao He
Chao Wen
Xiaoyang Tan
188
164
0
19 Mar 2019
Learning Self-Game-Play Agents for Combinatorial Optimization Problems
Learning Self-Game-Play Agents for Combinatorial Optimization Problems
Ruiyang Xu
K. Lieberherr
AI4CE
79
12
0
08 Mar 2019
A cooperative game for automated learning of elasto-plasticity knowledge
  graphs and models with AI-guided experimentation
A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation
Kun Wang
WaiChing Sun
Q. Du
AI4CE
108
60
0
08 Mar 2019
Convergence of Multi-Agent Learning with a Finite Step Size in
  General-Sum Games
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games
Xinliang Song
Tonghan Wang
Chongjie Zhang
103
13
0
07 Mar 2019
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree
  Search, and Deep Reinforcement Learning
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree Search, and Deep Reinforcement Learning
Ziyu Liu
Meng Zhou
Weiqing Cao
Qiang Qu
H. W. F. Yeung
Yuk Ying Chung
114
4
0
05 Mar 2019
A Strongly Asymptotically Optimal Agent in General Environments
A Strongly Asymptotically Optimal Agent in General EnvironmentsInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Michael K. Cohen
Elliot Catt
Marcus Hutter
198
13
0
04 Mar 2019
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Catalyst.RL: A Distributed Framework for Reproducible RL Research
Sergey Kolesnikov
Oleksii Hrinchuk
OffRL
90
8
0
28 Feb 2019
Coloring Big Graphs with AlphaGoZero
Coloring Big Graphs with AlphaGoZero
Jiayi Huang
Md. Mostofa Ali Patwary
G. Diamos
AI4CEGNN
166
54
0
26 Feb 2019
Planning in Hierarchical Reinforcement Learning: Guarantees for Using
  Local Policies
Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local PoliciesInternational Conference on Algorithmic Learning Theory (ALT), 2019
Tom Zahavy
Avinatan Hassidim
Haim Kaplan
Yishay Mansour
OffRL
147
7
0
26 Feb 2019
Challenges for an Ontology of Artificial Intelligence
Challenges for an Ontology of Artificial Intelligence
Scott H. Hawley
81
11
0
25 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
115
15
0
15 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
268
9
0
14 Feb 2019
Neural-Network Guided Expression Transformation
Neural-Network Guided Expression Transformation
Romain Edelmann
Viktor Kunčak
75
1
0
06 Feb 2019
Neural Fictitious Self-Play on ELF Mini-RTS
Neural Fictitious Self-Play on ELF Mini-RTS
Keigo Kawamura
Yoshimasa Tsuruoka
122
7
0
06 Feb 2019
Competitive Experience Replay
Competitive Experience ReplayInternational Conference on Learning Representations (ICLR), 2019
Hao Liu
Alexander R. Trott
R. Socher
Caiming Xiong
OffRL
298
57
0
01 Feb 2019
The Hanabi Challenge: A New Frontier for AI Research
The Hanabi Challenge: A New Frontier for AI ResearchArtificial Intelligence (AI), 2019
Nolan Bard
Jakob N. Foerster
A. Chandar
Neil Burch
Marc Lanctot
...
Iain Dunning
Shibl Mourad
Hugo Larochelle
Marc G. Bellemare
Michael Bowling
LLMAG
434
389
0
01 Feb 2019
Learning Position Evaluation Functions Used in Monte Carlo Softmax
  Search
Learning Position Evaluation Functions Used in Monte Carlo Softmax Search
H. Igarashi
Yuichi Morioka
Kazumasa Yamamoto
32
0
0
30 Jan 2019
Trust Region-Guided Proximal Policy Optimization
Trust Region-Guided Proximal Policy OptimizationNeural Information Processing Systems (NeurIPS), 2019
Yuhui Wang
Hao He
Xiaoyang Tan
Yaozhong Gan
OffRL
291
65
0
29 Jan 2019
Making Deep Q-learning methods robust to time discretization
Making Deep Q-learning methods robust to time discretization
Corentin Tallec
Léonard Blier
Yann Ollivier
OODOffRL
169
98
0
28 Jan 2019
Ablation Studies in Artificial Neural Networks
Ablation Studies in Artificial Neural Networks
Richard Meyes
Melanie Lu
Constantin Waubert de Puiseau
Tobias Meisen
182
261
0
24 Jan 2019
Previous
123...14151617
Next