ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.01815
  4. Cited By
Mastering Chess and Shogi by Self-Play with a General Reinforcement
  Learning Algorithm

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

5 December 2017
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
A. Guez
Marc Lanctot
Laurent Sifre
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
ArXivPDFHTML

Papers citing "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"

50 / 207 papers shown
Title
Learning Generalizable Visual Representations via Interactive Gameplay
Learning Generalizable Visual Representations via Interactive Gameplay
Luca Weihs
Aniruddha Kembhavi
Kiana Ehsani
Sarah M Pratt
Winson Han
Alvaro Herrasti
Eric Kolve
Dustin Schwenk
Roozbeh Mottaghi
Ali Farhadi
16
9
0
17 Dec 2019
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement
  Learning and Hierarchical Actions Filtering
DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering
Yuval Heffetz
Roman Vainshtein
Gilad Katz
L. Rokach
17
39
0
31 Oct 2019
Rethinking Cooperative Rationalization: Introspective Extraction and
  Complement Control
Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
Mo Yu
Shiyu Chang
Yang Zhang
Tommi Jaakkola
15
140
0
29 Oct 2019
Optimal Immunization Policy Using Dynamic Programming
Optimal Immunization Policy Using Dynamic Programming
A. Alaeddini
Daniel J. Klein
12
1
0
19 Oct 2019
On the Utility of Learning about Humans for Human-AI Coordination
On the Utility of Learning about Humans for Human-AI Coordination
Micah Carroll
Rohin Shah
Mark K. Ho
Thomas L. Griffiths
S. Seshia
Pieter Abbeel
Anca Dragan
HAI
11
378
0
13 Oct 2019
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
9
86
0
10 Sep 2019
No Press Diplomacy: Modeling Multi-Agent Gameplay
No Press Diplomacy: Modeling Multi-Agent Gameplay
Philip Paquette
Yuchen Lu
Steven Bocco
Max O. Smith
Satya Ortiz-Gagné
Jonathan K. Kummerfeld
Satinder Singh
Joelle Pineau
Aaron Courville
25
57
0
04 Sep 2019
Playing a Strategy Game with Knowledge-Based Reinforcement Learning
Playing a Strategy Game with Knowledge-Based Reinforcement Learning
Viktor Voss
L. Nechepurenko
Rudi Schaefer
Steffen Bauer
22
5
0
15 Aug 2019
SentiMATE: Learning to play Chess through Natural Language Processing
SentiMATE: Learning to play Chess through Natural Language Processing
Isaac Kamlish
Isaac Bentata Chocron
Nicholas McCarthy
15
10
0
18 Jul 2019
General Board Game Playing for Education and Research in Generic AI Game
  Learning
General Board Game Playing for Education and Research in Generic AI Game Learning
W. Konen
LLMAG
14
25
0
11 Jul 2019
On Inductive Biases in Deep Reinforcement Learning
On Inductive Biases in Deep Reinforcement Learning
Matteo Hessel
H. V. Hasselt
Joseph Modayil
David Silver
AI4CE
25
41
0
05 Jul 2019
Growing Action Spaces
Growing Action Spaces
Gregory Farquhar
Laura Gustafson
Zeming Lin
Shimon Whiteson
Nicolas Usunier
Gabriel Synnaeve
14
37
0
28 Jun 2019
Generalization to Novel Objects using Prior Relational Knowledge
Generalization to Novel Objects using Prior Relational Knowledge
V. Vijay
Abhinav Ganesh
Hanlin Tang
Arjun K. Bansal
GNN
16
6
0
26 Jun 2019
Inductive general game playing
Inductive general game playing
Andrew Cropper
Richard Evans
Mark Law
AI4CE
18
27
0
23 Jun 2019
Planning With Uncertain Specifications (PUnS)
Planning With Uncertain Specifications (PUnS)
Ankit J. Shah
Shen Li
J. Shah
8
25
0
07 Jun 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adversarial Policies: Attacking Deep Reinforcement Learning
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
27
348
0
25 May 2019
Ignorance-Aware Approaches and Algorithms for Prototype Selection in
  Machine Learning
Ignorance-Aware Approaches and Algorithms for Prototype Selection in Machine Learning
V. Terziyan
A. Nikulin
14
4
0
15 May 2019
Benchmark and Survey of Automated Machine Learning Frameworks
Benchmark and Survey of Automated Machine Learning Frameworks
Marc Zoller
Marco F. Huber
25
86
0
26 Apr 2019
Deep Policies for Width-Based Planning in Pixel Domains
Deep Policies for Width-Based Planning in Pixel Domains
Miquel Junyent
Anders Jonsson
Vicencc Gómez
28
10
0
12 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without
  Search Trees
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
12
30
0
07 Apr 2019
A Local Approach to Forward Model Learning: Results on the Game of Life
  Game
A Local Approach to Forward Model Learning: Results on the Game of Life Game
Simon Lucas
Alexander Dockhorn
Vanessa Volz
Chris Bamford
Raluca D. Gaina
Ivan Bravi
Diego Perez-Liebana
Sanaz Mostaghim
R. Kruse
18
17
0
29 Mar 2019
A cooperative game for automated learning of elasto-plasticity knowledge
  graphs and models with AI-guided experimentation
A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation
Kun Wang
WaiChing Sun
Q. Du
AI4CE
23
56
0
08 Mar 2019
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree
  Search, and Deep Reinforcement Learning
Towards Understanding Chinese Checkers with Heuristics, Monte Carlo Tree Search, and Deep Reinforcement Learning
Ziyu Liu
Meng Zhou
Weiqing Cao
Qiang Qu
H. W. F. Yeung
Yuk Ying Chung
13
4
0
05 Mar 2019
Coloring Big Graphs with AlphaGoZero
Coloring Big Graphs with AlphaGoZero
Jiayi Huang
Md. Mostofa Ali Patwary
G. Diamos
AI4CE
GNN
12
49
0
26 Feb 2019
Planning in Hierarchical Reinforcement Learning: Guarantees for Using
  Local Policies
Planning in Hierarchical Reinforcement Learning: Guarantees for Using Local Policies
Tom Zahavy
Avinatan Hassidim
Haim Kaplan
Yishay Mansour
OffRL
17
7
0
26 Feb 2019
Challenges for an Ontology of Artificial Intelligence
Challenges for an Ontology of Artificial Intelligence
Scott H. Hawley
11
11
0
25 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
23
9
0
15 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
11
9
0
14 Feb 2019
Learning Position Evaluation Functions Used in Monte Carlo Softmax
  Search
Learning Position Evaluation Functions Used in Monte Carlo Softmax Search
H. Igarashi
Yuichi Morioka
Kazumasa Yamamoto
11
0
0
30 Jan 2019
The Oracle of DLphi
The Oracle of DLphi
Dominik Alfke
W. Baines
J. Blechschmidt
Mauricio J. del Razo Sarmina
Amnon Drory
...
L. Thesing
Philipp Trunschke
Johannes von Lindheim
David Weber
Melanie Weber
32
0
0
17 Jan 2019
Malthusian Reinforcement Learning
Malthusian Reinforcement Learning
Joel Z. Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
16
37
0
17 Dec 2018
Visual Foresight: Model-Based Deep Reinforcement Learning for
  Vision-Based Robotic Control
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control
F. Ebert
Chelsea Finn
Sudeep Dasari
Annie Xie
Alex X. Lee
Sergey Levine
SSL
29
377
0
03 Dec 2018
On the Complexity of Reconnaissance Blind Chess
On the Complexity of Reconnaissance Blind Chess
Jared Markowitz
Matthieu de Rochemonteix
Ashley J. Llorens
14
11
0
07 Nov 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael H. Bowling
6
148
0
21 Oct 2018
Supervising strong learners by amplifying weak experts
Supervising strong learners by amplifying weak experts
Paul Christiano
Buck Shlegeris
Dario Amodei
11
114
0
19 Oct 2018
Transfer Learning versus Multi-agent Learning regarding Distributed
  Decision-Making in Highway Traffic
Transfer Learning versus Multi-agent Learning regarding Distributed Decision-Making in Highway Traffic
Mark Schutera
Niklas Goby
Dirk Neumann
Markus Reischl
21
5
0
19 Oct 2018
Reinforcement Learning Decoders for Fault-Tolerant Quantum Computation
Reinforcement Learning Decoders for Fault-Tolerant Quantum Computation
R. Sweke
Markus S. Kesselring
E. van Nieuwenburg
Jens Eisert
AI4CE
LRM
11
107
0
16 Oct 2018
The 30-Year Cycle In The AI Debate
The 30-Year Cycle In The AI Debate
J. Chauvet
19
8
0
08 Oct 2018
SAI, a Sensible Artificial Intelligence that plays Go
SAI, a Sensible Artificial Intelligence that plays Go
F. Morandin
G. Amato
R. Gini
C. Metta
Maurizio Parton
G. Pascutto
LLMAG
16
13
0
11 Sep 2018
ViZDoom Competitions: Playing Doom from Pixels
ViZDoom Competitions: Playing Doom from Pixels
Marek Wydmuch
Michal Kempka
Wojciech Ja'skowski
24
119
0
10 Sep 2018
Adaptive Behavior Generation for Autonomous Driving using Deep
  Reinforcement Learning with Compact Semantic States
Adaptive Behavior Generation for Autonomous Driving using Deep Reinforcement Learning with Compact Semantic States
Peter Wolf
Karl Kurzer
Tobias Wingert
Florian Kuhnt
Johann Marius Zöllner
30
55
0
10 Sep 2018
A proof that artificial neural networks overcome the curse of
  dimensionality in the numerical approximation of Black-Scholes partial
  differential equations
A proof that artificial neural networks overcome the curse of dimensionality in the numerical approximation of Black-Scholes partial differential equations
Philipp Grohs
F. Hornung
Arnulf Jentzen
Philippe von Wurstemberger
9
167
0
07 Sep 2018
Improving Hearthstone AI by Combining MCTS and Supervised Learning
  Algorithms
Improving Hearthstone AI by Combining MCTS and Supervised Learning Algorithms
M. Świechowski
T. Tajmajer
Andrzej Janusz
BDL
57
59
0
14 Aug 2018
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
42
641
0
01 Jul 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
22
212
0
20 Jun 2018
ML + FV = $\heartsuit$? A Survey on the Application of Machine Learning
  to Formal Verification
ML + FV = ♡\heartsuit♡? A Survey on the Application of Machine Learning to Formal Verification
Moussa Amrani
L. Lucio
Adrien Bibal
20
5
0
10 Jun 2018
Re-evaluating Evaluation
Re-evaluating Evaluation
David Balduzzi
K. Tuyls
Julien Perolat
T. Graepel
MoMe
16
96
0
07 Jun 2018
Fast Exploration with Simplified Models and Approximately Optimistic
  Planning in Model Based Reinforcement Learning
Fast Exploration with Simplified Models and Approximately Optimistic Planning in Model Based Reinforcement Learning
Ramtin Keramati
Jay Whang
Patrick Cho
Emma Brunskill
OffRL
21
7
0
01 Jun 2018
Fast Policy Learning through Imitation and Reinforcement
Fast Policy Learning through Imitation and Reinforcement
Ching-An Cheng
Xinyan Yan
Nolan Wagener
Byron Boots
11
83
0
26 May 2018
Bandit-Based Monte Carlo Optimization for Nearest Neighbors
Bandit-Based Monte Carlo Optimization for Nearest Neighbors
Vivek Bagaria
Tavor Z. Baharav
G. Kamath
David Tse
11
12
0
21 May 2018
Previous
12345
Next