Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

5 December 2017

David Silver

Papers citing "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm"

50 / 839 papers shown

Accelerating Monte Carlo Tree Search with Probability Tree State AbstractionNeural Information Processing Systems (NeurIPS), 2023

186

10 Oct 2023

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

312

10 Oct 2023

Learning Interactive Real-World SimulatorsInternational Conference on Learning Representations (ICLR), 2023

Pieter Abbeel

345

334

09 Oct 2023

Hierarchical Reinforcement Learning for Temporal Pattern Prediction

Faith Johnson

Kristin J. Dana

09 Oct 2023

Multi-timestep models for Model-based Reinforcement Learning

220

09 Oct 2023

"A Nova Eletricidade: Aplicações, Riscos e Tendências da IA Moderna -- "The New Electricity": Applications, Risks, and Trends in Current AI

Mariana Recamonde Mendoza

T. L. T. D. Silveira

V. P. Moreira

152

08 Oct 2023

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language ModelsInternational Conference on Machine Learning (ICML), 2023

Xiaoxiao Sun

Yang Yang

Michal Shlapentokh-Rothman

Haohan Wang

Yu-Xiong Wang

LRM AI4CE LM&Ro LLMAG

439

324

06 Oct 2023

Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignNeural Information Processing Systems (NeurIPS), 2023

226

04 Oct 2023

Differentially Encoded Observation Spaces for Perceptive Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2023

Lev Grossman

Brian Plancher

OffRL

207

03 Oct 2023

Iterative Option Discovery for Planning, by Planning

Kenny Young

Richard S. Sutton

384

02 Oct 2023

LEGO-Prover: Neural Theorem Proving with Growing LibrariesInternational Conference on Learning Representations (ICLR), 2023

Zhengying Liu

...

Enze Xie

Xiaodan Liang

379

108

01 Oct 2023

Reinforcement Learning for Node Selection in Branch-and-Bound

Alexander Mattick

Christopher Mutschler

250

29 Sep 2023

Optimizing with Low Budgets: a Comparison on the Black-box Optimization Benchmarking Suite and OpenAI GymIEEE Transactions on Evolutionary Computation (IEEE TEVC), 2023

Elena Raponi

Nathanaël Carraz Rakotonirina

Jérémy Rapin

Carola Doerr

O. Teytaud

569

29 Sep 2023

Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned

Han Bao

Raphaël Jungers

Jean-Charles Delvenne

OffRL

196

28 Sep 2023

Vision Transformers for Computer Go

125

22 Sep 2023

Monte-Carlo tree search with uncertainty propagation via optimal transport

Odalric-Ambrym Maillard

171

19 Sep 2023

MBAPPE: MCTS-Built-Around Prediction for Planning Explicitly

236

15 Sep 2023

Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning

Xiao Liu

Wubing Chen

Mao Tan

204

12 Sep 2023

Neurosymbolic Reinforcement Learning and Planning: A SurveyIEEE Transactions on Artificial Intelligence (IEEE TAI), 2023

230

02 Sep 2023

DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving

Yinda Xu

Lidong Yu

249

30 Aug 2023

Stabilizing Unsupervised Environment Design with a Learned Adversary

329

21 Aug 2023

DFB: A Data-Free, Low-Budget, and High-Efficacy Clean-Label Backdoor Attack

178

18 Aug 2023

Generating Personas for Games with Multimodal Adversarial Imitation Learning

162

15 Aug 2023

Provably Efficient Algorithm for Nonstationary Low-Rank MDPsNeural Information Processing Systems (NeurIPS), 2023

217

10 Aug 2023

JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

Wei Pan

292

09 Aug 2023

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

...

Sergio Gomez Colmenarejo

Aaron van den Oord

Wojciech M. Czarnecki

Nando de Freitas

Oriol Vinyals

OffRL

176

07 Aug 2023

CASSINI: Network-Aware Job Scheduling in Machine Learning ClustersSymposium on Networked Systems Design and Implementation (NSDI), 2023

135

01 Aug 2023

SAKSHI: Decentralized AI Platforms

...

Pramod Viswanath

101

31 Jul 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

294

27 Jul 2023

Towards General Game Representations: Decomposing Games Pixels into Content and Style

C. Trivedi

Konstantinos Makantasis

Antonios Liapis

Georgios N. Yannakakis

OCL

199

20 Jul 2023

PyTAG: Challenges and Opportunities for Reinforcement Learning in Tabletop Games

Diego Perez-Liebana

203

19 Jul 2023

Towards A Unified Agent with Foundation Models

Markus Wulfmeier

Martin Riedmiller

241

18 Jul 2023

Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual TasksNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Bailin Wang

426

302

05 Jul 2023

Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure SensingJournal of Fluid Mechanics (JFM), 2023

240

05 Jul 2023

Enhancing Dexterity in Robotic Manipulation via Hierarchical Contact ExplorationIEEE Robotics and Automation Letters (RA-L), 2023

341

01 Jul 2023

Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented TasksNeural Information Processing Systems (NeurIPS), 2023

Maxime Chevalier-Boisvert

353

308

24 Jun 2023

Warm-Start Actor-Critic: From Approximation Error to Sub-optimality GapInternational Conference on Machine Learning (ICML), 2023

226

20 Jun 2023

iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement Learning

364

09 Jun 2023

Introduction to Latent Variable Energy-Based Models: A Path Towards Autonomous Machine IntelligenceJournal of Statistical Mechanics: Theory and Experiment (J. Stat. Mech.), 2023

Anna Dawid

Yann LeCun

DRL

260

05 Jun 2023

Active Vision Reinforcement Learning under Limited Visual ObservabilityNeural Information Processing Systems (NeurIPS), 2023

Jinghuan Shang

Michael S. Ryoo

313

01 Jun 2023

Non-stationary Reinforcement Learning under General Function ApproximationInternational Conference on Machine Learning (ICML), 2023

Ming Yin

158

01 Jun 2023

Cross-Domain Policy Adaptation via Value-Guided Data FilteringNeural Information Processing Systems (NeurIPS), 2023

Zhen Wang

Xuelong Li

Wei Li

309

28 May 2023

Self-Supervised Reinforcement Learning that Transfers using Random FeaturesNeural Information Processing Systems (NeurIPS), 2023

Abhishek Gupta

247

26 May 2023

Model-Based Simulation for Optimising Smart ReplyAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Benjamin Towle

Ke Zhou

177

26 May 2023

Reasoning with Language Model is Planning with World ModelConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

450

831

24 May 2023

ADA-GP: Accelerating DNN Training By Adaptive Gradient PredictionMicro (MICRO), 2023

161

22 May 2023

Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning

218

17 May 2023

Large Language Model Guided Tree-of-Thought

Jieyi Long

LM&Ro LRM

230

282

15 May 2023

Stackelberg Games for Learning Emergent Behaviors During Competitive AutocurriculaIEEE International Conference on Robotics and Automation (ICRA), 2023

197

04 May 2023

Physics-Inspired Interpretability Of Machine Learning Models

Maximilian P. Niroomand

D. Wales

AI4CE

108

05 Apr 2023