ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07219
  4. Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
    GP
    OffRL
ArXivPDFHTML

Papers citing "D4RL: Datasets for Deep Data-Driven Reinforcement Learning"

50 / 927 papers shown
Title
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
38
10
0
15 Sep 2022
Can Offline Reinforcement Learning Help Natural Language Understanding?
Can Offline Reinforcement Learning Help Natural Language Understanding?
Ziqi Zhang
Yile Wang
Yue Zhang
Donglin Wang
OffRL
33
0
0
15 Sep 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for
  Conditional Sequence Modelling in Offline RL
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
160
72
0
08 Sep 2022
MO2: Model-Based Offline Options
MO2: Model-Based Offline Options
Sasha Salter
Markus Wulfmeier
Dhruva Tirumala
N. Heess
Martin Riedmiller
R. Hadsell
Dushyant Rao
OffRL
16
13
0
05 Sep 2022
Some Supervision Required: Incorporating Oracle Policies in
  Reinforcement Learning via Epistemic Uncertainty Metrics
Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics
Jun Jet Tai
Jordan Terry
M. Innocente
J. Brusey
N. Horri
19
1
0
22 Aug 2022
Efficient Planning in a Compact Latent Action Space
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
21
36
0
22 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
40
27
0
19 Aug 2022
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
Nolan Wagener
Andrey Kolobov
Felipe Vieira Frujeri
Ricky Loynd
Ching-An Cheng
Matthew J. Hausknecht
19
21
0
15 Aug 2022
Diffusion Policies as an Expressive Policy Class for Offline
  Reinforcement Learning
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
Zhendong Wang
Jonathan J. Hunt
Mingyuan Zhou
OffRL
24
333
0
12 Aug 2022
Robust Reinforcement Learning using Offline Data
Robust Reinforcement Learning using Offline Data
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OffRL
34
66
0
10 Aug 2022
AdaCat: Adaptive Categorical Discretization for Autoregressive Models
AdaCat: Adaptive Categorical Discretization for Autoregressive Models
Qiyang Li
Ajay Jain
Pieter Abbeel
OffRL
45
4
0
03 Aug 2022
A Maintenance Planning Framework using Online and Offline Deep
  Reinforcement Learning
A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning
Z. Bukhsh
N. Jansen
Hajo Molegraaf
OffRL
AI4CE
17
6
0
01 Aug 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and
  Early Termination
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
23
2
0
31 Jul 2022
Offline Reinforcement Learning at Multiple Frequencies
Offline Reinforcement Learning at Multiple Frequencies
Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
OffRL
22
6
0
26 Jul 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
25
23
0
21 Jul 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
26
66
0
20 Jul 2022
Skill-based Model-based Reinforcement Learning
Skill-based Model-based Reinforcement Learning
Lu Shi
Joseph J. Lim
Youngwoon Lee
24
44
0
15 Jul 2022
Making Linear MDPs Practical via Contrastive Representation Learning
Making Linear MDPs Practical via Contrastive Representation Learning
Tianjun Zhang
Tongzheng Ren
Mengjiao Yang
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
25
44
0
14 Jul 2022
GriddlyJS: A Web IDE for Reinforcement Learning
GriddlyJS: A Web IDE for Reinforcement Learning
C. Bamford
Minqi Jiang
Mikayel Samvelyan
Tim Rocktaschel
OnRL
38
4
0
13 Jul 2022
Offline RL Policies Should be Trained to be Adaptive
Offline RL Policies Should be Trained to be Adaptive
Dibya Ghosh
Anurag Ajay
Pulkit Agrawal
Sergey Levine
OffRL
35
45
0
05 Jul 2022
An Empirical Study of Implicit Regularization in Deep Offline RL
An Empirical Study of Implicit Regularization in Deep Offline RL
Çağlar Gülçehre
Srivatsan Srinivasan
Jakub Sygnowski
Georg Ostrovski
Mehrdad Farajtabar
Matt Hoffman
Razvan Pascanu
Arnaud Doucet
OffRL
14
16
0
05 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
30
36
0
03 Jul 2022
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible Actions
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
25
5
0
01 Jul 2022
Discriminator-Guided Model-Based Offline Imitation Learning
Discriminator-Guided Model-Based Offline Imitation Learning
Wenjia Zhang
Haoran Xu
Haoyi Niu
Peng Cheng
Ming Li
Heming Zhang
Guyue Zhou
Xianyuan Zhan
OffRL
14
16
0
01 Jul 2022
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online
  Reinforcement Learning
When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
Haoyi Niu
Shubham Sharma
Yiwen Qiu
Ming Li
Guyue Zhou
Jianming Hu
Xianyuan Zhan
OffRL
OnRL
27
46
0
27 Jun 2022
Behavior Transformers: Cloning $k$ modes with one stone
Behavior Transformers: Cloning kkk modes with one stone
Nur Muhammad (Mahi) Shafiullah
Zichen Jeff Cui
Ariuntuya Altanzaya
Lerrel Pinto
OffRL
28
221
0
22 Jun 2022
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent
  Behavioral Analysis
Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis
Shayegan Omidshafiei
A. Kapishnikov
Yannick Assogba
Lucas Dixon
Been Kim
OffRL
36
5
0
17 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
48
348
0
17 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning
  Environments
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
13
4
0
17 Jun 2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement
  Learning
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
Tianhao Wu
Shengjie Wang
Xidong Feng
Jiechuan Jiang
...
Yiran Geng
Hao Dong
Zongqing Lu
Song-Chun Zhu
Yaodong Yang
OffRL
38
108
0
17 Jun 2022
Bootstrapped Transformer for Offline Reinforcement Learning
Bootstrapped Transformer for Offline Reinforcement Learning
Kerong Wang
Hanye Zhao
Xufang Luo
Kan Ren
Weinan Zhang
Dongsheng Li
OffRL
16
37
0
17 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware
  Bidirectional Offline Model-Based Imagination
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
29
25
0
16 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
25
139
0
15 Jun 2022
Regularizing a Model-based Policy Stationary Distribution to Stabilize
  Offline Reinforcement Learning
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Shentao Yang
Yihao Feng
Shujian Zhang
Mi Zhou
OffRL
34
12
0
14 Jun 2022
Adversarial Counterfactual Environment Model Learning
Adversarial Counterfactual Environment Model Learning
Xiong-Hui Chen
Yang Yu
Zhenghong Zhu
Zhihua Yu
Zhen-Yu Chen
...
Yinan Wu
Hongqiu Wu
Rongjun Qin
Rui Ding
Fangsheng Huang
CML
OffRL
15
12
0
10 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
29
52
0
09 Jun 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
34
101
0
09 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline
  Reinforcement Learning
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
34
8
0
09 Jun 2022
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on
  Exploration and Performance
Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance
Jakob J. Hollenstein
Sayantan Auddy
Matteo Saveriano
Erwan Renaudo
J. Piater
33
17
0
08 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
29
18
0
07 Jun 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Rui Yang
Chenjia Bai
Xiaoteng Ma
Zhaoran Wang
Chongjie Zhang
Lei Han
OffRL
29
74
0
06 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
125
102
0
05 Jun 2022
Hybrid Value Estimation for Off-policy Evaluation and Offline
  Reinforcement Learning
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
31
4
0
04 Jun 2022
When does return-conditioned supervised learning work for offline
  reinforcement learning?
When does return-conditioned supervised learning work for offline reinforcement learning?
David Brandfonbrener
A. Bietti
Jacob Buckman
Romain Laroche
Joan Bruna
OffRL
27
60
0
02 Jun 2022
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in
  Offline RL
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Wonjoon Goo
S. Niekum
OffRL
30
20
0
01 Jun 2022
Model Generation with Provable Coverability for Offline Reinforcement Learning
Chengxing Jia
Hao Yin
Chenxiao Gao
Tian Xu
Lei Yuan
Zongzhang Zhang
Yang Yu
OffRL
20
0
0
01 Jun 2022
You Can't Count on Luck: Why Decision Transformers and RvS Fail in
  Stochastic Environments
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
OffRL
167
27
0
31 May 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
34
1
0
27 May 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through
  Ensembles, and Why Their Independence Matters
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
25
69
0
27 May 2022
Training and Inference on Any-Order Autoregressive Models the Right Way
Training and Inference on Any-Order Autoregressive Models the Right Way
Andy Shih
Dorsa Sadigh
Stefano Ermon
BDL
TPM
OOD
CML
35
23
0
26 May 2022
Previous
123...141516171819
Next