v1v2 (latest)

Agnostic System Identification for Model-Based Reinforcement Learning

International Conference on Machine Learning (ICML), 2012

5 March 2012

Stéphane Ross

Drew Bagnell

ArXiv (abs)PDF HTML

Papers citing "Agnostic System Identification for Model-Based Reinforcement Learning"

50 / 76 papers shown

Multi-agent Coordination via Flow Matching

Dongsu Lee

Daehee Lee

Amy Zhang

191

07 Nov 2025

Finite-Time Bounds for Average-Reward Fitted Q-Iteration

Jongmin Lee

Ernest K. Ryu

OffRL

135

20 Oct 2025

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies

153

06 Sep 2025

Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis

503

01 Jul 2025

A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

437

05 Jun 2025

Trajectory First: A Curriculum for Discovering Diverse Policies

Cornelius V. Braun

Sayantan Auddy

Marc Toussaint

372

02 Jun 2025

Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning

521

19 May 2025

ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

501

22 Dec 2024

Hybrid Reinforcement Learning from Offline Observation Alone

348

11 Jun 2024

The Virtues of Pessimism in Inverse Reinforcement Learning

369

04 Feb 2024

MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning

303

21 Jan 2024

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Tongzhou Mu

H. Su

OffRL

454

23 Mar 2023

On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent SamplesAAAI Conference on Artificial Intelligence (AAAI), 2023

Mustafa O. Karabag

Ufuk Topcu

OffRL

339

07 Mar 2023

The Virtues of Laziness in Model-based RL: A Unified Objective and AlgorithmsInternational Conference on Machine Learning (ICML), 2023

Aarti Singh

292

01 Mar 2023

Predictable MDP Abstraction for Unsupervised Model-Based RLInternational Conference on Machine Learning (ICML), 2023

Seohong Park

Sergey Levine

313

08 Feb 2023

Efficient Online Reinforcement Learning with Offline DataInternational Conference on Machine Learning (ICML), 2023

697

310

06 Feb 2023

Policy Expansion for Bridging Offline-to-Online Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

632

104

02 Feb 2023

Selective Uncertainty Propagation in Offline RLAAAI Conference on Artificial Intelligence (AAAI), 2023

Sanath Kumar Krishnamurthy

728

01 Feb 2023

Leveraging Offline Data in Online Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

Andrew Wagenmaker

Aldo Pacchiano

OffRL OnRL

411

09 Nov 2022

Hybrid RL: Using Both Offline and Online Data Can Make RL EfficientInternational Conference on Learning Representations (ICLR), 2022

458

144

13 Oct 2022

A Unified Framework for Alternating Offline Model Training and Policy LearningNeural Information Processing Systems (NeurIPS), 2022

322

12 Oct 2022

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARLNeural Information Processing Systems (NeurIPS), 2022

Vincent Y. F. Tan

326

20 Sep 2022

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

Shen Zhang

215

16 Sep 2022

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based ImaginationNeural Information Processing Systems (NeurIPS), 2022

389

16 Jun 2022

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

289

14 Jun 2022

Online No-regret Model-Based Meta RL for Personalized NavigationConference on Learning for Dynamics & Control (L4DC), 2022

265

05 Apr 2022

Value Gradient weighted Model-Based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

C. Voelcker

Victor Liao

Animesh Garg

Amir-massoud Farahmand

251

04 Apr 2022

AKF-SR: Adaptive Kalman Filtering-based Successor Representation

Parvin Malekzadeh

Mohammad Salimibeni

Ming Hou

Arash Mohammadi

Konstantinos N. Plataniotis

297

31 Mar 2022

How to Leverage Unlabeled Data in Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

606

03 Feb 2022

Offline Reinforcement Learning: Fundamental Barriers for Value Function ApproximationAnnual Conference Computational Learning Theory (COLT), 2021

358

21 Nov 2021

Mismatched No More: Joint Model-Policy Optimization for Model-Based RL

556

06 Oct 2021

DROMO: Distributionally Robust Offline Model-based Policy Optimization

224

15 Sep 2021

Non-Markovian Reinforcement Learning using Fractional DynamicsIEEE Conference on Decision and Control (CDC), 2021

Gaurav Gupta

Chenzhong Yin

Jyotirmoy V. Deshmukh

P. Bogdan

OffRL

204

29 Jul 2021

PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided ExplorationInternational Conference on Machine Learning (ICML), 2021

Yuda Song

Wen Sun

309

15 Jul 2021

Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage

Masatoshi Uehara

Wen Sun

OffRL

530

169

13 Jul 2021

Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences

Mustafa O. Karabag

Melkior Ornik

Ufuk Topcu

293

12 May 2021

Instabilities of Offline RL with Pre-Trained Neural RepresentationInternational Conference on Machine Learning (ICML), 2021

393

08 Mar 2021

COMBO: Conservative Offline Model-Based Policy OptimizationNeural Information Processing Systems (NeurIPS), 2021

Aravind Rajeswaran

768

510

16 Feb 2021

Blending MPC & Value Function Approximation for Efficient Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2020

M. Bhardwaj

Sanjiban Choudhury

Byron Boots

370

10 Dec 2020

What are the Statistical Limits of Offline RL with Linear Function Approximation?

464

171

22 Oct 2020

Driving Through Ghosts: Behavioral Cloning with False PositivesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2020

Adrien Gaidon

Wolfram Burgard

265

29 Aug 2020

Explaining Fast Improvement in Online Imitation Learning

251

06 Jul 2020

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain ClassifiersInternational Conference on Learning Representations (ICLR), 2020

462

120

24 Jun 2020

Neural Dynamical Systems: Balancing Structure and Flexibility in Physical PredictionIEEE Conference on Decision and Control (CDC), 2020

Youngseog Chung

218

23 Jun 2020

Information Theoretic Regret Bounds for Online Nonlinear Control

303

132

22 Jun 2020

Active Learning for Nonlinear System Identification with Guarantees

Horia Mania

Sai Li

Benjamin Recht

269

123

18 Jun 2020

Provably Efficient Model-based Policy AdaptationInternational Conference on Machine Learning (ICML), 2020

239

14 Jun 2020

Learning Active Task-Oriented Exploration Policies for Bridging the Sim-to-Real Gap

Jacky Liang

Saumya Saxena

Oliver Kroemer

307

02 Jun 2020

MM-KTD: Multiple Model Kalman Temporal Differences for Reinforcement LearningIEEE Access (IEEE Access), 2020

Parvin Malekzadeh

Mohammad Salimibeni

Arash Mohammadi

A. Assa

Konstantinos N. Plataniotis

OffRL

132

30 May 2020

MOReL : Model-Based Offline Reinforcement Learning

Aravind Rajeswaran

555

793

12 May 2020