Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.07219
Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"D4RL: Datasets for Deep Data-Driven Reinforcement Learning"
50 / 927 papers shown
Title
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Vincent Liu
P. Nagarajan
Andrew Patterson
Martha White
OffRL
26
2
0
04 Dec 2023
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
29
9
0
30 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
21
1
0
28 Nov 2023
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala
Thomas Lampe
José Enrique Chen
Tuomas Haarnoja
Sandy Huang
...
Tim Hertweck
Leonard Hasenclever
Martin Riedmiller
N. Heess
Markus Wulfmeier
OffRL
37
8
0
27 Nov 2023
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick
Gaurav Manek
Felix Berkenkamp
J. Zico Kolter
OffRL
OnRL
19
0
0
25 Nov 2023
Guided Flows for Generative Modeling and Decision Making
Qinqing Zheng
Matt Le
Neta Shaul
Y. Lipman
Aditya Grover
Ricky T. Q. Chen
26
35
0
22 Nov 2023
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi Ma
Sergey Levine
OffRL
30
14
0
21 Nov 2023
Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge
Sang-Hyun Lee
Seung-Woo Seo
ODL
CLL
SSL
24
2
0
15 Nov 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Cheng Chen
Yi Tian Xu
Xiangyang Ji
OffRL
34
14
0
15 Nov 2023
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations
Joey Hong
Sergey Levine
Anca Dragan
OffRL
LLMAG
42
24
0
09 Nov 2023
Accelerating Exploration with Unlabeled Prior Data
Qiyang Li
Jason Zhang
Dibya Ghosh
Amy Zhang
Sergey Levine
OffRL
OnRL
34
9
0
09 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
58
13
0
06 Nov 2023
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion
Firas Al-Hafez
Guoping Zhao
Jan Peters
Davide Tateo
21
20
0
04 Nov 2023
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
31
7
0
03 Nov 2023
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
26
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
32
5
0
02 Nov 2023
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Yi Ma
Chenjun Xiao
Hebin Liang
Jianye Hao
OffRL
19
6
0
01 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
22
5
0
31 Oct 2023
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRL
RALM
34
18
0
31 Oct 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
34
1
0
31 Oct 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
32
9
0
30 Oct 2023
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Zhaoyi Zhou
Chuning Zhu
Runlong Zhou
Qiwen Cui
Abhishek Gupta
S. S. Du
OffRL
40
8
0
30 Oct 2023
Label Poisoning is All You Need
Rishi Jha
J. Hayase
Sewoong Oh
AAML
22
28
0
29 Oct 2023
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Jin Zhu
Runzhe Wan
Zhengling Qi
Shuang Luo
C. Shi
OffRL
40
0
0
28 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
34
6
0
28 Oct 2023
Multi Time Scale World Models
Vaisakh Shaj
Saleh Gholam Zadeh
Ozan Demir
L. R. Douat
Gerhard Neumann
AI4CE
28
3
0
27 Oct 2023
State-Action Similarity-Based Representations for Off-Policy Evaluation
Brahma S. Pavse
Josiah P. Hanna
OffRL
38
4
0
27 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
35
2
0
27 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
35
13
0
27 Oct 2023
CROP: Conservative Reward for Model-based Offline Policy Optimization
Hao Li
Xiaohu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
...
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Bo-Xian Yao
Zeng-Guang Hou
OffRL
35
2
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
29
5
0
26 Oct 2023
Finetuning Offline World Models in the Real World
Yunhai Feng
Nicklas Hansen
Ziyan Xiong
Chandramouli Rajagopalan
Xiaolong Wang
OffRL
OnRL
22
20
0
24 Oct 2023
Course Correcting Koopman Representations
Mahan Fathi
Clement Gehring
Jonathan Pilault
David Kanaa
Pierre-Luc Bacon
Ross Goroshin
29
1
0
23 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
33
17
0
23 Oct 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
Joey Hejna
Rafael Rafailov
Harshit S. Sikchi
Chelsea Finn
S. Niekum
W. B. Knox
Dorsa Sadigh
OffRL
27
50
0
20 Oct 2023
CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning
Liyiming Ke
Yunchu Zhang
Abhay Deshpande
S. Srinivasa
Abhishek Gupta
OffRL
27
12
0
19 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
41
15
0
19 Oct 2023
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
36
18
0
18 Oct 2023
Adaptive Online Replanning with Diffusion Models
Siyuan Zhou
Yilun Du
Shun Zhang
Mengdi Xu
Yikang Shen
Wei Xiao
Dit-Yan Yeung
Chuang Gan
30
22
0
14 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
31
1
0
12 Oct 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
S. Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
27
66
0
11 Oct 2023
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
31
20
0
11 Oct 2023
f
f
f
-Policy Gradients: A General Framework for Goal Conditioned RL using
f
f
f
-Divergences
Siddhant Agarwal
Ishan Durugkar
Peter Stone
Amy Zhang
31
4
0
10 Oct 2023
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
36
4
0
10 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
33
6
0
10 Oct 2023
Memory-Consistent Neural Networks for Imitation Learning
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
James Weimer
Insup Lee
44
8
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
22
6
0
09 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
29
7
0
09 Oct 2023
Previous
1
2
3
...
7
8
9
...
17
18
19
Next