v1v2v3v4 (latest)

Continuous Doubly Constrained Batch Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2021

18 February 2021

ArXiv (abs)PDF HTML Github

Papers citing "Continuous Doubly Constrained Batch Reinforcement Learning"

23 / 23 papers shown

Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

208

02 Dec 2025

An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025

479

17 Apr 2025

Towards Optimal Offline Reinforcement Learning

363

15 Mar 2025

AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree SearchInternational Conference on Quantum Computing and Engineering (QCE), 2024

221

07 Oct 2024

Grounded Answers for Multi-agent Decision-making Problem through Generative World ModelNeural Information Processing Systems (NeurIPS), 2024

431

03 Oct 2024

SelfBC: Self Behavior Cloning for Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2024

326

04 Aug 2024

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

463

30 May 2024

Exclusively Penalized Q-learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024

374

23 May 2024

Zero-Shot Reinforcement Learning from Low Quality DataNeural Information Processing Systems (NeurIPS), 2023

403

26 Sep 2023

Offline Reinforcement Learning with On-Policy Q-Function Regularization

308

25 Jul 2023

PASTA: Pretrained Action-State Transformer Agents

396

20 Jul 2023

Budgeting Counterfactual for Offline RLNeural Information Processing Systems (NeurIPS), 2023

371

12 Jul 2023

Offline Minimax Soft-Q-learning Under Realizability and Partial CoverageNeural Information Processing Systems (NeurIPS), 2023

402

05 Feb 2023

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed DatasetsIndustrial Conference on Data Mining (IDM), 2022

Wei Shen

Lei Song

Jiang Bian

Tao Qin

Tie-Yan Liu

OffRL

232

05 Dec 2022

State Advantage Weighting for Offline RL

363

09 Oct 2022

Time-Varying Propensity Score to Bridge the Gap between the Past and PresentInternational Conference on Learning Representations (ICLR), 2022

581

04 Oct 2022

Mildly Conservative Q-Learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

443

148

09 Jun 2022

Model-Based Offline Meta-Reinforcement Learning with RegularizationInternational Conference on Learning Representations (ICLR), 2022

434

07 Feb 2022

The Difficulty of Passive Learning in Deep Reinforcement Learning

203

26 Oct 2021

Offline Reinforcement Learning with Implicit Q-LearningInternational Conference on Learning Representations (ICLR), 2021

647

1,372

12 Oct 2021

A Workflow for Offline Model-Free Robotic Reinforcement LearningConference on Robot Learning (CoRL), 2021

Stephen Tian

397

22 Sep 2021

Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage

Masatoshi Uehara

Wen Sun

OffRL

538

171

13 Jul 2021

Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

382

06 Jun 2021