ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.09225
  4. Cited By
Continuous Doubly Constrained Batch Reinforcement Learning
v1v2v3v4 (latest)

Continuous Doubly Constrained Batch Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2021
18 February 2021
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
    OffRL
ArXiv (abs)PDFHTMLGithub

Papers citing "Continuous Doubly Constrained Batch Reinforcement Learning"

23 / 23 papers shown
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Chenjia Bai
Xiu Li
Zhuoran Yang
Siyang Gao
208
0
0
02 Dec 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
479
4
0
17 Apr 2025
Towards Optimal Offline Reinforcement Learning
Towards Optimal Offline Reinforcement Learning
Mengmeng Li
Daniel Kuhn
Tobias Sutter
OffRL
363
3
0
15 Mar 2025
AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and
  Tree Search
AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree SearchInternational Conference on Quantum Computing and Engineering (QCE), 2024
Wei Tang
Yiheng Duan
Yaroslav Kharkov
Rasool Fakoor
Eric Kessler
Yunong Shi
221
13
0
07 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World ModelNeural Information Processing Systems (NeurIPS), 2024
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
431
6
0
03 Oct 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2024
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
326
3
0
04 Aug 2024
Bridging Model-Based Optimization and Generative Modeling via
  Conservative Fine-Tuning of Diffusion Models
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
Masatoshi Uehara
Yulai Zhao
Ehsan Hajiramezanali
Gabriele Scalia
Gökçen Eraslan
Avantika Lal
Sergey Levine
Tommaso Biancalani
463
28
0
30 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
374
7
0
23 May 2024
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality DataNeural Information Processing Systems (NeurIPS), 2023
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRLOnRL
403
17
0
26 Sep 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
Pablo Samuel Castro
Matthieu Geist
OffRL
308
6
0
25 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&RoOffRL
396
6
0
20 Jul 2023
Budgeting Counterfactual for Offline RL
Budgeting Counterfactual for Offline RLNeural Information Processing Systems (NeurIPS), 2023
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
371
4
0
12 Jul 2023
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage
Offline Minimax Soft-Q-learning Under Realizability and Partial CoverageNeural Information Processing Systems (NeurIPS), 2023
Masatoshi Uehara
Nathan Kallus
Jason D. Lee
Wen Sun
OffRL
402
8
0
05 Feb 2023
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from
  Mixed Datasets
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed DatasetsIndustrial Conference on Data Mining (IDM), 2022
Yuanying Cai
Wei Shen
Li Zhao
Wei Shen
Xuyun Zhang
Lei Song
Jiang Bian
Tao Qin
Tie-Yan Liu
OffRL
232
6
0
05 Dec 2022
State Advantage Weighting for Offline RL
State Advantage Weighting for Offline RL
Jiafei Lyu
Aicheng Gong
Le Wan
Zongqing Lu
Xiu Li
OffRL
363
9
0
09 Oct 2022
Time-Varying Propensity Score to Bridge the Gap between the Past and
  Present
Time-Varying Propensity Score to Bridge the Gap between the Past and PresentInternational Conference on Learning Representations (ICLR), 2022
Rasool Fakoor
Jonas W. Mueller
Zachary Chase Lipton
Pratik Chaudhari
Alexander J. Smola
OODAI4TS
581
4
0
04 Oct 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Mildly Conservative Q-Learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
443
148
0
09 Jun 2022
Model-Based Offline Meta-Reinforcement Learning with Regularization
Model-Based Offline Meta-Reinforcement Learning with RegularizationInternational Conference on Learning Representations (ICLR), 2022
Sen Lin
Jialin Wan
Tengyu Xu
Yingbin Liang
Junshan Zhang
OffRL
434
20
0
07 Feb 2022
The Difficulty of Passive Learning in Deep Reinforcement Learning
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
203
70
0
26 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-LearningInternational Conference on Learning Representations (ICLR), 2021
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
647
1,372
0
12 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement LearningConference on Robot Learning (CoRL), 2021
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
397
91
0
22 Sep 2021
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
538
171
0
13 Jul 2021
Mitigating Covariate Shift in Imitation Learning via Offline Data
  Without Great Coverage
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage
Jonathan D. Chang
Masatoshi Uehara
Dhruv Sreenivas
Rahul Kidambi
Wen Sun
OffRL
382
37
0
06 Jun 2021
1
Page 1 of 1