Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control

20 August 2021

Papers citing "Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control"

25 / 25 papers shown

Title
Rank-One Modified Value Iteration A. S. Kolarijani Tolga Ok Peyman Mohajerin Esfahani Mohamad Amin Sharif Kolarijani 22 0 0 03 May 2025
Trust-Region Twisted Policy Improvement Joery A. de Vries Jinke He Yaniv Oren M. Spaan OffRL LRM 30 0 0 08 Apr 2025
MPCritic: A plug-and-play MPC architecture for reinforcement learning Nathan P. Lawrence Thomas Banker Ali Mesbah 31 0 0 01 Apr 2025
On-line Policy Improvement using Monte-Carlo Search Gerald Tesauro Gregory R. Galperin 83 270 0 09 Jan 2025
Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming Dimitri Bertsekas 41 6 0 02 Jun 2024
Learning to Boost the Performance of Stable Nonlinear Systems Luca Furieri C. Galimberti Giancarlo Ferrari-Trecate 26 9 0 01 May 2024
The Effective Horizon Explains Deep RL Performance in Stochastic Environments Cassidy Laidlaw Banghua Zhu Stuart J. Russell Anca Dragan 28 2 0 13 Dec 2023
Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare Daniel Garces Stephanie Gil AI4TS 19 2 0 05 Jul 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap Hang Wang Sen Lin Junshan Zhang OffRL OnRL 26 3 0 20 Jun 2023
What model does MuZero learn? Jinke He Thomas M. Moerland F. Oliehoek 33 4 0 01 Jun 2023
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation E. Gordon Rana Soltani-Zarrin OffRL 24 5 0 04 Apr 2023
Deep networks for system identification: a Survey G. Pillonetto Aleksandr Aravkin Daniel Gedon L. Ljung Antônio H. Ribeiro Thomas B. Schon OOD 35 35 0 30 Jan 2023
Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation Dimitri Bertsekas 27 3 0 15 Dec 2022
Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand Daniel Garces Sushmita Bhattacharya Stephanie Gil Dimitri Bertsekas 22 10 0 28 Nov 2022
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach Siddhant Bhambri Amrita Bhattacharjee Dimitri Bertsekas 11 9 0 15 Nov 2022
Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation Anna Winnicki R. Srikant OffRL 8 6 0 13 Oct 2022
Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis P. Braca L. Millefiori A. Aubry S. Maranò A. De Maio P. Willett 29 12 0 22 Jul 2022
New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning Dimitri Bertsekas 9 2 0 19 Jul 2022
Bayesian Learning Approach to Model Predictive Control Namhoon Cho Seokwon Lee Hyo-Sang Shin Antonios Tsourdos 10 1 0 05 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges Yuxi Li OffRL 36 9 0 23 Feb 2022
Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems Christos K. Verginis Zhe Xu Ufuk Topcu 9 3 0 11 Oct 2021
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation Anna Winnicki Joseph Lubars Michael Livesay R. Srikant 15 3 0 28 Sep 2021
Non-Parametric Neuro-Adaptive Control Subject to Task Specifications Christos K. Verginis Zhe Xu Ufuk Topcu 10 4 0 25 Jun 2021
Adaptive Variants of Optimal Feedback Policies B. Lopez Jean-Jacques E. Slotine OffRL 21 4 0 06 Apr 2021
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning Dimitri Bertsekas 38 38 0 04 May 2020