ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07219
  4. Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
    GP
    OffRL
ArXivPDFHTML

Papers citing "D4RL: Datasets for Deep Data-Driven Reinforcement Learning"

50 / 927 papers shown
Title
When is Offline Policy Selection Sample Efficient for Reinforcement
  Learning?
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Vincent Liu
P. Nagarajan
Andrew Patterson
Martha White
OffRL
26
2
0
04 Dec 2023
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy
  Evaluation
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
29
9
0
30 Nov 2023
Goal-conditioned Offline Planning from Curious Exploration
Goal-conditioned Offline Planning from Curious Exploration
Marco Bagatella
Georg Martius
OffRL
21
1
0
28 Nov 2023
Replay across Experiments: A Natural Extension of Off-Policy RL
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala
Thomas Lampe
José Enrique Chen
Tuomas Haarnoja
Sandy Huang
...
Tim Hertweck
Leonard Hasenclever
Martin Riedmiller
N. Heess
Markus Wulfmeier
OffRL
37
8
0
27 Nov 2023
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline
  Reinforcement Learning
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick
Gaurav Manek
Felix Berkenkamp
J. Zico Kolter
OffRL
OnRL
19
0
0
25 Nov 2023
Guided Flows for Generative Modeling and Decision Making
Guided Flows for Generative Modeling and Decision Making
Qinqing Zheng
Matt Le
Neta Shaul
Y. Lipman
Aditya Grover
Ricky T. Q. Chen
26
35
0
22 Nov 2023
RLIF: Interactive Imitation Learning as Reinforcement Learning
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi Ma
Sergey Levine
OffRL
30
14
0
21 Nov 2023
Self-Supervised Curriculum Generation for Autonomous Reinforcement
  Learning without Task-Specific Knowledge
Self-Supervised Curriculum Generation for Autonomous Reinforcement Learning without Task-Specific Knowledge
Sang-Hyun Lee
Seung-Woo Seo
ODL
CLL
SSL
24
2
0
15 Nov 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Cheng Chen
Yi Tian Xu
Xiangyang Ji
OffRL
34
14
0
15 Nov 2023
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations
Joey Hong
Sergey Levine
Anca Dragan
OffRL
LLMAG
42
24
0
09 Nov 2023
Accelerating Exploration with Unlabeled Prior Data
Accelerating Exploration with Unlabeled Prior Data
Qiyang Li
Jason Zhang
Dibya Ghosh
Amy Zhang
Sergey Levine
OffRL
OnRL
34
9
0
09 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
58
13
0
06 Nov 2023
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion
LocoMuJoCo: A Comprehensive Imitation Learning Benchmark for Locomotion
Firas Al-Hafez
Guoping Zhao
Jan Peters
Davide Tateo
21
20
0
04 Nov 2023
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
31
7
0
03 Nov 2023
Offline Imitation from Observation via Primal Wasserstein State
  Occupancy Matching
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
26
0
0
02 Nov 2023
A Simple Solution for Offline Imitation from Observations and Examples
  with Possibly Incomplete Trajectories
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
32
5
0
02 Nov 2023
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Yi Ma
Chenjun Xiao
Hebin Liang
Jianye Hao
OffRL
19
6
0
01 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample
  Complexity
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
22
5
0
31 Oct 2023
Unleashing the Power of Pre-trained Language Models for Offline
  Reinforcement Learning
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRL
RALM
34
18
0
31 Oct 2023
Contrastive Difference Predictive Coding
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
A Tractable Inference Perspective of Offline RL
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
34
1
0
31 Oct 2023
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic
  Detection of Infeasible Plans
Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans
Kyowoon Lee
Seongun Kim
Jaesik Choi
DiffM
32
9
0
30 Oct 2023
Free from Bellman Completeness: Trajectory Stitching via Model-based
  Return-conditioned Supervised Learning
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Zhaoyi Zhou
Chuning Zhu
Runlong Zhou
Qiwen Cui
Abhishek Gupta
S. S. Du
OffRL
40
8
0
30 Oct 2023
Label Poisoning is All You Need
Label Poisoning is All You Need
Rishi Jha
J. Hayase
Sewoong Oh
AAML
22
28
0
29 Oct 2023
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Jin Zhu
Runzhe Wan
Zhengling Qi
Shuang Luo
C. Shi
OffRL
40
0
0
28 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
34
6
0
28 Oct 2023
Multi Time Scale World Models
Multi Time Scale World Models
Vaisakh Shaj
Saleh Gholam Zadeh
Ozan Demir
L. R. Douat
Gerhard Neumann
AI4CE
28
3
0
27 Oct 2023
State-Action Similarity-Based Representations for Off-Policy Evaluation
State-Action Similarity-Based Representations for Off-Policy Evaluation
Brahma S. Pavse
Josiah P. Hanna
OffRL
38
4
0
27 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and
  Imitation Learning
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
35
2
0
27 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
35
13
0
27 Oct 2023
CROP: Conservative Reward for Model-based Offline Policy Optimization
CROP: Conservative Reward for Model-based Offline Policy Optimization
Hao Li
Xiaohu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
...
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Bo-Xian Yao
Zeng-Guang Hou
OffRL
35
2
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based
  Representations in Offline Reinforcement Learning
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
29
5
0
26 Oct 2023
Finetuning Offline World Models in the Real World
Finetuning Offline World Models in the Real World
Yunhai Feng
Nicklas Hansen
Ziyan Xiong
Chandramouli Rajagopalan
Xiaolong Wang
OffRL
OnRL
22
20
0
24 Oct 2023
Course Correcting Koopman Representations
Course Correcting Koopman Representations
Mahan Fathi
Clement Gehring
Jonathan Pilault
David Kanaa
Pierre-Luc Bacon
Ross Goroshin
29
1
0
23 Oct 2023
Corruption-Robust Offline Reinforcement Learning with General Function
  Approximation
Corruption-Robust Offline Reinforcement Learning with General Function Approximation
Chen Ye
Rui Yang
Quanquan Gu
Tong Zhang
OffRL
33
17
0
23 Oct 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
Contrastive Preference Learning: Learning from Human Feedback without RL
Joey Hejna
Rafael Rafailov
Harshit S. Sikchi
Chelsea Finn
S. Niekum
W. B. Knox
Dorsa Sadigh
OffRL
27
50
0
20 Oct 2023
CCIL: Continuity-based Data Augmentation for Corrective Imitation
  Learning
CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning
Liyiming Ke
Yunchu Zhang
Abhay Deshpande
S. Srinivasa
Abhishek Gupta
OffRL
27
12
0
19 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
41
15
0
19 Oct 2023
Action-Quantized Offline Reinforcement Learning for Robotic Skill
  Learning
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
36
18
0
18 Oct 2023
Adaptive Online Replanning with Diffusion Models
Adaptive Online Replanning with Diffusion Models
Siyuan Zhou
Yilun Du
Shun Zhang
Mengdi Xu
Yikang Shen
Wei Xiao
Dit-Yan Yeung
Chuang Gan
30
22
0
14 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
31
1
0
12 Oct 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
S. Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
27
66
0
11 Oct 2023
Score Regularized Policy Optimization through Diffusion Behavior
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
31
20
0
11 Oct 2023
$f$-Policy Gradients: A General Framework for Goal Conditioned RL using
  $f$-Divergences
fff-Policy Gradients: A General Framework for Goal Conditioned RL using fff-Divergences
Siddhant Agarwal
Ishan Durugkar
Peter Stone
Amy Zhang
31
4
0
10 Oct 2023
Boosting Continuous Control with Consistency Policy
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
36
4
0
10 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
33
6
0
10 Oct 2023
Memory-Consistent Neural Networks for Imitation Learning
Memory-Consistent Neural Networks for Imitation Learning
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
James Weimer
Insup Lee
44
8
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
22
6
0
09 Oct 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
29
7
0
09 Oct 2023
Previous
123...789...171819
Next