v1v2v3v4 (latest)

Naive Exploration is Optimal for Online LQR

International Conference on Machine Learning (ICML), 2020

27 January 2020

Max Simchowitz

Dylan J. Foster

ArXiv (abs)PDF HTML

Papers citing "Naive Exploration is Optimal for Online LQR"

50 / 118 papers shown

SOMBRL: Scalable and Optimistic Model-Based RL

318

25 Nov 2025

Adversarially Robust Multitask Adaptive Control

Kasra Fallah

Leonardo F. Toso

James Anderson

155

07 Nov 2025

Learning Soft Robotic Dynamics with Active Exploration

Robert K. Katzschmann

183

31 Oct 2025

Universal Learning of Nonlinear Dynamics

Evan Dogariu

Anand Brahmbhatt

Elad Hazan

161

16 Aug 2025

Suboptimality analysis of receding horizon quadratic control with unknown linear systems and its applications in learning-based controlIEEE Transactions on Automatic Control (TAC), 2023

Shengli Shi

Anastasios Tsiamis

B. de Schutter

310

01 Jul 2025

Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function ApproximationConference on Uncertainty in Artificial Intelligence (UAI), 2025

292

20 May 2025

Policy Gradient for LQR with Domain Randomization

276

31 Mar 2025

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information

Ziyi Zhang

Yorie Nakahira

Guannan Qu

440

13 Sep 2024

Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control

411

08 Jul 2024

NeoRL: Efficient Exploration for Nonepisodic RL

671

03 Jun 2024

On the Sample Complexity of Set Membership Estimation for Linear Systems with Disturbances Bounded by Convex Sets

Haonan Xu

Yingying Li

395

01 Jun 2024

Quantum Non-Identical Mean Estimation: Efficient Algorithms and Fundamental Limits

208

21 May 2024

Sample Complexity of the Linear Quadratic Regulator: A Reinforcement Learning Lens

Amirreza Neshaei Moghaddam

A. Olshevsky

Bahman Gharesifard

302

16 Apr 2024

Active Learning for Control-Oriented Identification of Nonlinear Systems

Bruce D. Lee

Ingvar M. Ziemann

George J. Pappas

Nikolai Matni

345

13 Apr 2024

A least-square method for non-asymptotic identification in linear switching control

Haoyuan Sun

Ali Jadbabaie

221

11 Apr 2024

Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG

Ting-Jui Chang

Shahin Shahrampour

OffRL

337

13 Mar 2024

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

431

03 Mar 2024

Predictive Linear Online Tracking for Unknown Targets

552

15 Feb 2024

Online Control of Linear Systems under Unbounded Noise

Kaito Ito

Taira Tsuchiya

252

15 Feb 2024

Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

340

05 Feb 2024

Nonasymptotic Regret Analysis of Adaptive Linear Quadratic Control with Model MisspecificationConference on Learning for Dynamics & Control (L4DC), 2023

Bruce D. Lee

Anders Rantzer

Nikolai Matni

522

29 Dec 2023

PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNsAAAI Conference on Artificial Intelligence (AAAI), 2023

226

15 Dec 2023

On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQRConference on Learning for Dynamics & Control (L4DC), 2023

Insoon Yang

312

09 Dec 2023

Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning AlgorithmsConference on Learning for Dynamics & Control (L4DC), 2023

362

30 Nov 2023

Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive ExplorationEuropean Control Conference (ECC), 2023

294

05 Nov 2023

Efficient Exploration in Continuous-time Model-based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023

301

30 Oct 2023

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and AutoregressionInternational Conference on Learning Representations (ICLR), 2023

Max Simchowitz

323

17 Oct 2023

Regret Analysis of Distributed Online Control for LTI Systems with Adversarial Disturbances

Ting-Jui Chang

Shahin Shahrampour

285

04 Oct 2023

Learning the Uncertainty Sets for Control Dynamics via Set Membership: A Non-Asymptotic AnalysisInternational Conference on Machine Learning (ICML), 2023

Adam Wierman

284

26 Sep 2023

Meta-Learning Operators to Optimality from Multi-Task Non-IID DataInternational Conference on Learning Representations (ICLR), 2023

Thomas T. Zhang

Leonardo F. Toso

James Anderson

Nikolai Matni

296

08 Aug 2023

Optimistic Active Exploration of Dynamical SystemsNeural Information Processing Systems (NeurIPS), 2023

623

21 Jun 2023

Optimal Exploration for Model-Based RL in Nonlinear SystemsNeural Information Processing Systems (NeurIPS), 2023

Andrew Wagenmaker

Guanya Shi

Kevin Jamieson

345

15 Jun 2023

Finite Time Regret Bounds for Minimum Variance Control of Autoregressive Systems with Exogenous Inputs

197

26 May 2023

Optimal Rates for Bandit Nonstochastic ControlNeural Information Processing Systems (NeurIPS), 2023

Y. Jennifer Sun

Stephen Newman

Elad Hazan

448

24 May 2023

Exact Recovery for System Identification with More Corrupt Data than Clean Data

452

17 May 2023

Stability Bounds for Learning-Based Adaptive Control of Discrete-Time Multi-Dimensional Stochastic Linear Systems with Input ConstraintsIEEE Conference on Decision and Control (CDC), 2023

215

02 Apr 2023

PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss

268

29 Mar 2023

Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision MakingAnnual Conference Computational Learning Theory (COLT), 2023

Adam Block

Alexander Rakhlin

Max Simchowitz

429

10 Feb 2023

Smoothed Online Learning for Prediction in Piecewise Affine SystemsNeural Information Processing Systems (NeurIPS), 2023

Adam Block

Max Simchowitz

Russ Tedrake

282

26 Jan 2023

PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models

245

30 Dec 2022

Best of Both Worlds in Online Control: Competitive Ratio and Policy RegretConference on Learning for Dynamics & Control (L4DC), 2022

254

21 Nov 2022

Implications of Regret on Stability of Linear Dynamical SystemsIFAC-PapersOnLine (IFAC-PapersOnLine), 2022

198

14 Nov 2022

Provable Sim-to-real Transfer in Continuous Domain with Partial ObservationsInternational Conference on Learning Representations (ICLR), 2022

371

27 Oct 2022

Online Convex Optimization with Unbounded MemoryNeural Information Processing Systems (NeurIPS), 2022

Raunak Kumar

Sarah Dean

Robert D. Kleinberg

526

18 Oct 2022

$Learning Decentralized Linear Quadratic Regulators with $\sqrt{T}$ Regret$

Learning Decentralized Linear Quadratic Regulators with

\sqrt{T}

RegretSIAM Journal of Control and Optimization (SICON), 2022

392

17 Oct 2022

Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

Na Li

338

10 Oct 2022

Learning-Based Adaptive Control for Stochastic Linear Systems with Input ConstraintsIEEE Control Systems Letters (L-CSS), 2022

289

15 Sep 2022

Statistical Learning Theory for Control: A Finite Sample PerspectiveIEEE Control Systems (IEEE Control Syst. Mag.), 2022

Anastasios Tsiamis

Ingvar M. Ziemann

Nikolai Matni

George J. Pappas

585

12 Sep 2022

Meta-Learning Online Control for Linear Dynamical SystemsIEEE Conference on Decision and Control (CDC), 2022

Deepan Muthirayan

D. Kalathil

Pramod P. Khargonekar

279

18 Aug 2022

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

Lukasz Szpruch

Tanut Treetanthiploet

Yufei Zhang

416

08 Aug 2022