Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.07219
Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"D4RL: Datasets for Deep Data-Driven Reinforcement Learning"
50 / 927 papers shown
Title
AED: Adaptable Error Detection for Few-shot Imitation Policy
Jia-Fong Yeh
Kuo-Han Hung
Pang-Chi Lo
Chi-Ming Chung
Tsung-Han Wu
Hung-Ting Su
Yi-Ting Chen
Winston H. Hsu
32
1
0
06 Feb 2024
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
Xixi Hu
Bo Liu
Xingchao Liu
Qiang Liu
36
11
0
06 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
19
9
0
06 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
47
17
0
05 Feb 2024
Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Abdelhakim Benechehab
Albert Thomas
Balázs Kégl
OffRL
35
2
0
05 Feb 2024
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
42
1
0
05 Feb 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
33
0
0
04 Feb 2024
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li
Yixiang Shan
Zhengbang Zhu
Ting Long
Weinan Zhang
OffRL
28
9
0
04 Feb 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
Yifu Yuan
Jianye Hao
Yi Ma
Zibin Dong
Hebin Liang
Jinyi Liu
Zhixin Feng
Kai-Wen Zhao
Yan Zheng
OffRL
ALM
24
14
0
04 Feb 2024
Distilling LLMs' Decomposition Abilities into Compact Language Models
Denis Tarasov
Kumar Shridhar
SyDa
OffRL
LRM
48
2
0
02 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
34
10
0
01 Feb 2024
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Ziqi Zhang
Jingzehua Xu
Jinxin Liu
Zifeng Zhuang
Donglin Wang
Miao Liu
Shuai Zhang
OffRL
48
4
0
29 Jan 2024
DiffuserLite: Towards Real-time Diffusion Planning
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
83
15
0
27 Jan 2024
P2DT: Mitigating Forgetting in task-incremental Learning with progressive prompt Decision Transformer
Zhiyuan Wang
Xiaoyang Qu
Jing Xiao
Bokui Chen
Jianzong Wang
CLL
OffRL
18
1
0
22 Jan 2024
MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning
Mao Hong
Zhiyue Zhang
Yue Wu
Yan Xu
OffRL
48
0
0
21 Jan 2024
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View
Raj Ghugare
Matthieu Geist
Glen Berseth
Benjamin Eysenbach
OffRL
35
14
0
20 Jan 2024
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
40
15
0
18 Jan 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
26
0
0
18 Jan 2024
Learning from Sparse Offline Datasets via Conservative Density Estimation
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
28
7
0
16 Jan 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
43
2
0
11 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
9
0
06 Jan 2024
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning
Dohyeok Lee
Seung Han
Taehyun Cho
Jungwoo Lee
OffRL
28
2
0
06 Jan 2024
Simple Hierarchical Planning with Diffusion
Chang Chen
Fei Deng
Kenji Kawaguchi
Çağlar Gülçehre
Sungjin Ahn
OffRL
DiffM
40
24
0
05 Jan 2024
Policy-regularized Offline Multi-objective Reinforcement Learning
Qian Lin
Chao Yu
Zongkai Liu
Zifan Wu
OffRL
13
6
0
04 Jan 2024
GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation, Demonstration, and Imitation
Zifan Wang
Junyu Chen
Ziqing Chen
Pengwei Xie
Rui Chen
Li Yi
34
9
0
01 Jan 2024
Self-supervised Pretraining for Decision Foundation Model: Formulation, Pipeline and Challenges
Xiaoqian Liu
Jianbin Jiao
Junge Zhang
OffRL
LRM
40
2
0
29 Dec 2023
Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?
Gunshi Gupta
Tim G. J. Rudner
R. McAllister
Adrien Gaidon
Y. Gal
OffRL
53
3
0
28 Dec 2023
RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems
Jiahong Zhou
Shunhui Mao
Guoliang Yang
Bo Tang
Qianlong Xie
Lebin Lin
Xingxing Wang
Dong Wang
35
8
0
27 Dec 2023
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Ziyue Li
Zhiwei Xu
Hao Chen
Yiqun Chen
Bin Zhang
Zhen Xiao
Junge Zhang
Jiangjin Yin
OffRL
16
8
0
26 Dec 2023
Explicit-Implicit Subgoal Planning for Long-Horizon Tasks with Sparse Reward
Fangyuan Wang
Anqing Duan
Peng Zhou
Shengzeng Huo
Guodong Guo
Chenguang Yang
D. Navarro-Alarcon
OffRL
VLM
33
0
0
25 Dec 2023
Critic-Guided Decision Transformer for Offline Reinforcement Learning
Yuanfu Wang
Chao Yang
Yinghong Wen
Yu Liu
Yu Qiao
OffRL
27
11
0
21 Dec 2023
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
40
3
0
20 Dec 2023
Value Explicit Pretraining for Learning Transferable Representations
Kiran Lekkala
Henghui Bao
S. Sontakke
Laurent Itti
SSL
35
0
0
19 Dec 2023
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
35
1
0
19 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
26
25
0
19 Dec 2023
Neural Network Approximation for Pessimistic Offline Reinforcement Learning
Di Wu
Yuling Jiao
Li Shen
Haizhao Yang
Xiliang Lu
OffRL
29
1
0
19 Dec 2023
Robot Crowd Navigation in Dynamic Environment with Offline Reinforcement Learning
Shuai Zhou
Hao Fu
Haodong He
Wei Liu
OffRL
37
0
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
24
2
0
17 Dec 2023
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
OnRL
27
3
0
15 Dec 2023
HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents
Dániel Horváth
Jesús Bujalance Martín
Ferenc Gàbor Erdos
Z. Istenes
Fabien Moutarde
OffRL
28
1
0
14 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
46
11
0
12 Dec 2023
Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills
Hongcai He
Anjie Zhu
Shuang Liang
Feiyu Chen
Jie Shao
OffRL
46
4
0
11 Dec 2023
Toward Open-ended Embodied Tasks Solving
William Wei Wang
Dongqi Han
Xufang Luo
Yifei Shen
Charles Ling
Boyu Wang
Dongsheng Li
AI4CE
10
5
0
10 Dec 2023
Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization
Takuya Hiraoka
OffRL
27
1
0
10 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
84
10
0
10 Dec 2023
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
33
3
0
07 Dec 2023
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
Xiao-Yin Liu
Xiao-Hu Zhou
Guo-Tao Li
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
39
4
0
07 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
39
6
0
06 Dec 2023
MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment
Ziyan Wang
Yali Du
Yudi Zhang
Meng Fang
Biwei Huang
OffRL
31
1
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
21
6
0
06 Dec 2023
Previous
1
2
3
...
6
7
8
...
17
18
19
Next