ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07219
  4. Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
    GP
    OffRL
ArXivPDFHTML

Papers citing "D4RL: Datasets for Deep Data-Driven Reinforcement Learning"

50 / 927 papers shown
Title
The Impact of Data Distribution on Q-learning with Function
  Approximation
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
A. Sardinha
Francisco S. Melo
OffRL
11
2
0
23 Nov 2021
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement
  Learning with Actor Rectification
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
L. Pan
Longbo Huang
Tengyu Ma
Huazhe Xu
OffRL
OnRL
27
51
0
22 Nov 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
13
99
0
19 Nov 2021
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning
Robert Kirk
Amy Zhang
Edward Grefenstette
Tim Rocktaschel
OffRL
17
155
0
18 Nov 2021
Improving Learning from Demonstrations by Learning from Experience
Improving Learning from Demonstrations by Learning from Experience
Hao-Kang Liu
Yiwen Chen
Jiayi Tan
M. Ang
OffRL
14
1
0
16 Nov 2021
Exploiting Action Impact Regularity and Exogenous State Variables for
  Offline Reinforcement Learning
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning
Vincent Liu
James Wright
Martha White
OffRL
31
1
0
15 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
M. Tomizuka
Wei Zhan
OffRL
11
21
0
09 Nov 2021
A Dataset Perspective on Offline Reinforcement Learning
A Dataset Perspective on Offline Reinforcement Learning
Kajetan Schweighofer
Andreas Radler
Marius-Constantin Dinu
M. Hofmarcher
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
OffRL
25
17
0
08 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
60
100
0
06 Nov 2021
B-Pref: Benchmarking Preference-Based Reinforcement Learning
B-Pref: Benchmarking Preference-Based Reinforcement Learning
Kimin Lee
Laura M. Smith
Anca Dragan
Pieter Abbeel
OffRL
31
93
0
04 Nov 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
30
23
0
04 Nov 2021
Curriculum Offline Imitation Learning
Curriculum Offline Imitation Learning
Minghuan Liu
Hanye Zhao
Zhengyu Yang
Jian Shen
Weinan Zhang
Li Zhao
Tie-Yan Liu
OffRL
16
1
0
03 Nov 2021
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of
  Dynamics
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics
Matthias Weissenbacher
Samarth Sinha
Animesh Garg
Yoshinobu Kawahara
OffRL
22
20
0
02 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action
  Primitives
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
Murtaza Dalal
Deepak Pathak
Ruslan Salakhutdinov
27
90
0
28 Oct 2021
URLB: Unsupervised Reinforcement Learning Benchmark
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSL
OffRL
30
132
0
28 Oct 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Towards Hyperparameter-free Policy Selection for Offline Reinforcement
  Learning
Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning
Siyuan Zhang
Nan Jiang
OffRL
11
39
0
26 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
39
9
0
24 Oct 2021
Offline Reinforcement Learning with Value-based Episodic Memory
Offline Reinforcement Learning with Value-based Episodic Memory
Xiaoteng Ma
Yiqin Yang
Haotian Hu
Qihan Liu
Jun Yang
Chongjie Zhang
Qianchuan Zhao
Bin Liang
OffRL
24
42
0
19 Oct 2021
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender
  System
RL4RS: A Real-World Dataset for Reinforcement Learning based Recommender System
Kai Wang
Zhene Zou
Minghao Zhao
Qilin Deng
Yue Shang
Yile Liang
Runze Wu
Xudong Shen
Tangjie Lyu
Changjie Fan
OffRL
23
9
0
18 Oct 2021
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism
Ming Yin
Yu-Xiang Wang
OffRL
24
82
0
17 Oct 2021
Value Penalized Q-Learning for Recommender Systems
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
50
20
0
15 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
21
31
0
14 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise
  Datasets
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
16
5
0
10 Oct 2021
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior
  Engineering beyond Reward Maximization
Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization
S. Gu
Manfred Diaz
Daniel Freeman
Hiroki Furuta
Seyed Kamyar Seyed Ghasemipour
Anton Raichuk
Byron David
Erik Frey
Erwin Coumans
Olivier Bachem
38
14
0
10 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from
  Single-agent Demonstrations
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang
Wenze Chen
Longfei Zhang
Shizhen Xu
Ziyang Li
Fengming Zhu
Deheng Ye
Tingling Chen
Jun Zhu
OffRL
37
25
0
09 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation
  Budget Matters
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
18
24
0
08 Oct 2021
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
Revisiting Design Choices in Offline Model-Based Reinforcement Learning
Cong Lu
Philip J. Ball
Jack Parker-Holder
Michael A. Osborne
Stephen J. Roberts
OffRL
29
53
0
08 Oct 2021
Augmenting Reinforcement Learning with Behavior Primitives for Diverse
  Manipulation Tasks
Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks
Soroush Nasiriany
Huihan Liu
Yuke Zhu
82
109
0
07 Oct 2021
Offline RL With Resource Constrained Online Deployment
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
13
2
0
07 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Wonjoon Goo
S. Niekum
OffRL
35
8
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
105
262
0
04 Oct 2021
BRAC+: Improved Behavior Regularized Actor Critic for Offline
  Reinforcement Learning
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
41
15
0
02 Oct 2021
Offline Reinforcement Learning with Reverse Model-based Imagination
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang
Wenzhe Li
Haozhe Jiang
Guangxiang Zhu
Siyuan Li
Chongjie Zhang
OffRL
101
59
0
01 Oct 2021
Dual Behavior Regularized Reinforcement Learning
Dual Behavior Regularized Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
OffRL
15
1
0
19 Sep 2021
Accelerating Offline Reinforcement Learning Application in Real-Time
  Bidding and Recommendation: Potential Use of Simulation
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
24
12
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
32
77
0
16 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement
  Learning
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRL
OnRL
21
0
0
15 Sep 2021
Video2Skill: Adapting Events in Demonstration Videos to Skills in an
  Environment using Cyclic MDP Homomorphisms
Video2Skill: Adapting Events in Demonstration Videos to Skills in an Environment using Cyclic MDP Homomorphisms
S. Sontakke
Sumegh Roychowdhury
Mausoom Sarkar
Nikaash Puri
Balaji Krishnamurthy
Laurent Itti
34
1
0
08 Sep 2021
Implicit Behavioral Cloning
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
49
368
0
01 Sep 2021
Optimal Actor-Critic Policy with Optimized Training Datasets
Optimal Actor-Critic Policy with Optimized Training Datasets
C. Banerjee
Zhiyong Chen
N. Noman
M. Zamani
OffRL
17
7
0
16 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
13
42
0
11 Aug 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
161
472
0
06 Aug 2021
Policy Gradients Incorporating the Future
Policy Gradients Incorporating the Future
David Venuto
Elaine Lau
Doina Precup
Ofir Nachum
OffRL
11
9
0
04 Aug 2021
A Pragmatic Look at Deep Imitation Learning
A Pragmatic Look at Deep Imitation Learning
Kai Arulkumaran
D. Lillrank
26
9
0
04 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
22
37
0
04 Aug 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale
  Demonstrations
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu
Z. Ling
Fanbo Xiang
Derek Yang
Xuanlin Li
Stone Tao
Zhiao Huang
Zhiwei Jia
Hao Su
39
130
0
30 Jul 2021
Demonstration-Guided Reinforcement Learning with Learned Skills
Demonstration-Guided Reinforcement Learning with Learned Skills
Karl Pertsch
Youngwoon Lee
Yue Wu
Joseph J. Lim
OffRL
21
82
0
21 Jul 2021
Offline Preference-Based Apprenticeship Learning
Offline Preference-Based Apprenticeship Learning
Daniel Shin
Daniel S. Brown
Anca D. Dragan
OffRL
22
17
0
20 Jul 2021
Previous
123...16171819
Next