Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2108.10315
Cited By
Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control
20 August 2021
Dimitri Bertsekas
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control"
25 / 25 papers shown
Title
Rank-One Modified Value Iteration
A. S. Kolarijani
Tolga Ok
Peyman Mohajerin Esfahani
Mohamad Amin Sharif Kolarijani
22
0
0
03 May 2025
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
30
0
0
08 Apr 2025
MPCritic: A plug-and-play MPC architecture for reinforcement learning
Nathan P. Lawrence
Thomas Banker
Ali Mesbah
31
0
0
01 Apr 2025
On-line Policy Improvement using Monte-Carlo Search
Gerald Tesauro
Gregory R. Galperin
83
270
0
09 Jan 2025
Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming
Dimitri Bertsekas
41
6
0
02 Jun 2024
Learning to Boost the Performance of Stable Nonlinear Systems
Luca Furieri
C. Galimberti
Giancarlo Ferrari-Trecate
26
9
0
01 May 2024
The Effective Horizon Explains Deep RL Performance in Stochastic Environments
Cassidy Laidlaw
Banghua Zhu
Stuart J. Russell
Anca Dragan
28
2
0
13 Dec 2023
Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare
Daniel Garces
Stephanie Gil
AI4TS
19
2
0
05 Jul 2023
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
Hang Wang
Sen Lin
Junshan Zhang
OffRL
OnRL
26
3
0
20 Jun 2023
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
33
4
0
01 Jun 2023
Online augmentation of learned grasp sequence policies for more adaptable and data-efficient in-hand manipulation
E. Gordon
Rana Soltani-Zarrin
OffRL
24
5
0
04 Apr 2023
Deep networks for system identification: a Survey
G. Pillonetto
Aleksandr Aravkin
Daniel Gedon
L. Ljung
Antônio H. Ribeiro
Thomas B. Schon
OOD
35
35
0
30 Jan 2023
Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation
Dimitri Bertsekas
27
3
0
15 Dec 2022
Multiagent Reinforcement Learning for Autonomous Routing and Pickup Problem with Adaptation to Variable Demand
Daniel Garces
Sushmita Bhattacharya
Stephanie Gil
Dimitri Bertsekas
22
10
0
28 Nov 2022
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Siddhant Bhambri
Amrita Bhattacharjee
Dimitri Bertsekas
11
9
0
15 Nov 2022
Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation
Anna Winnicki
R. Srikant
OffRL
8
6
0
13 Oct 2022
Statistical Hypothesis Testing Based on Machine Learning: Large Deviations Analysis
P. Braca
L. Millefiori
A. Aubry
S. Maranò
A. De Maio
P. Willett
29
12
0
22 Jul 2022
New Auction Algorithms for Path Planning, Network Transport, and Reinforcement Learning
Dimitri Bertsekas
9
2
0
19 Jul 2022
Bayesian Learning Approach to Model Predictive Control
Namhoon Cho
Seokwon Lee
Hyo-Sang Shin
Antonios Tsourdos
10
1
0
05 Mar 2022
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems
Christos K. Verginis
Zhe Xu
Ufuk Topcu
9
3
0
11 Oct 2021
The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation
Anna Winnicki
Joseph Lubars
Michael Livesay
R. Srikant
15
3
0
28 Sep 2021
Non-Parametric Neuro-Adaptive Control Subject to Task Specifications
Christos K. Verginis
Zhe Xu
Ufuk Topcu
10
4
0
25 Jun 2021
Adaptive Variants of Optimal Feedback Policies
B. Lopez
Jean-Jacques E. Slotine
OffRL
21
4
0
06 Apr 2021
Multiagent Value Iteration Algorithms in Dynamic Programming and Reinforcement Learning
Dimitri Bertsekas
38
38
0
04 May 2020
1