Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.07219
Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"D4RL: Datasets for Deep Data-Driven Reinforcement Learning"
50 / 927 papers shown
Title
Identifying Selections for Unsupervised Subtask Discovery
Yiwen Qiu
Yujia Zheng
Anton van den Hengel
32
0
0
28 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
36
0
0
27 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
50
8
0
26 Oct 2024
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Hai Zhong
Xun Wang
Zhuoran Li
Longbo Huang
OffRL
OnRL
29
0
0
25 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
48
6
0
25 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space
Jingdi Chen
Hanhan Zhou
Yongsheng Mei
Carlee Joe-Wong
Gina Adam
Nathaniel D. Bastian
Tian-Shing Lan
OffRL
30
0
0
21 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
36
0
0
21 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Rui Liu
Wenguan Wang
Yuqing Yang
40
3
0
18 Oct 2024
Latent Weight Diffusion: Generating Policies from Trajectories
Shashank Hegde
G. Salhotra
Gaurav Sukhatme
25
0
0
17 Oct 2024
An Evolved Universal Transformer Memory
Edoardo Cetin
Qi Sun
Tianyu Zhao
Yujin Tang
146
0
0
17 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
79
13
0
17 Oct 2024
Off-dynamics Conditional Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffM
OffRL
35
0
0
16 Oct 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Lyna Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
37
6
0
15 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
30
1
0
15 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
31
1
0
15 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
133
0
0
11 Oct 2024
Diffusion Imitation from Observation
Bo-Ruei Huang
Chun-Kai Yang
Chun-Mao Lai
Dai-Jie Wu
Shao-Hua Sun
39
4
0
07 Oct 2024
Diffusion Model Predictive Control
Guangyao Zhou
Sivaramakrishnan Swaminathan
Rajkumar Vasudeva Raju
J. S. Guntupalli
Wolfgang Lehrach
Joseph Ortiz
Antoine Dedieu
Miguel Lázaro-Gredilla
Kevin P. Murphy
31
6
0
07 Oct 2024
Active Fine-Tuning of Generalist Policies
Marco Bagatella
Jonas Hübotter
Georg Martius
Andreas Krause
32
0
0
07 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
42
0
0
07 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
A. Roy-Chowdhury
OffRL
24
1
0
04 Oct 2024
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization
Tung M. Luu
Thanh Nguyen
Tee Joshua Tian Jin
Sungwoon Kim
Chang D. Yoo
AAML
28
0
0
04 Oct 2024
Predictive Coding for Decision Transformer
Tung M. Luu
Donghoon Lee
Chang D. Yoo
OffRL
58
2
0
04 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks
Zeyu Feng
Hao Luan
Kevin Yuchen Ma
Harold Soh
32
2
0
03 Oct 2024
Were RNNs All We Needed?
Leo Feng
Frederick Tung
Mohamed Osama Ahmed
Yoshua Bengio
Hossein Hajimirsadegh
AI4TS
29
14
1
02 Oct 2024
Contrastive Abstraction for Reinforcement Learning
Vihang Patil
M. Hofmarcher
Elisabeth Rumetshofer
Sepp Hochreiter
OffRL
SSL
24
2
0
01 Oct 2024
RAIL: Reachability-Aided Imitation Learning for Safe Policy Execution
Wonsuhk Jung
Dennis Anthony
Utkarsh Aashu Mishra
Nadun Ranawaka Arachchige
Matthew Bronars
Danfei Xu
Shreyas Kousik
31
0
0
28 Sep 2024
DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors
Joseph Ortiz
Antoine Dedieu
Wolfgang Lehrach
Swaroop Guntupalli
Carter Wendelken
Ahmad Humayun
Guangyao Zhou
Sivaramakrishnan Swaminathan
Miguel Lázaro-Gredilla
Kevin P. Murphy
OffRL
49
1
0
26 Sep 2024
COSBO: Conservative Offline Simulation-Based Policy Optimization
E. Kargar
Ville Kyrki
OffRL
33
0
0
22 Sep 2024
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning
Claude Formanek
Louise Beyers
C. Tilbury
Jonathan P. Shock
Arnu Pretorius
OffRL
34
0
0
18 Sep 2024
KAN v.s. MLP for Offline Reinforcement Learning
Haihong Guo
Fengxin Li
Jiao Li
Hongyan Liu
OffRL
33
0
0
15 Sep 2024
xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing
Haoyi Niu
Qimao Chen
Tenglong Liu
Jianxiong Li
Guyue Zhou
Yi Zhang
Jianming Hu
Xianyuan Zhan
34
0
0
13 Sep 2024
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
Teng Yan
Zhendong Ruan
Yaobang Cai
Yu Han
Wenxian Li
Yang Zhang
OffRL
33
0
0
12 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
53
1
0
11 Sep 2024
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
Luo Ji
Runji Lin
OffRL
AI4CE
LM&Ro
26
0
0
11 Sep 2024
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention
Wenhao Zhao
Qiushui Xu
Linjie Xu
Lei Song
Jinyu Wang
Chunlai Zhou
Jiang Bian
34
0
0
11 Sep 2024
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan
Chenyou Fan
Shuang Qiu
Jiyuan Shi
Chenjia Bai
40
4
0
09 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
40
1
0
07 Sep 2024
The Prevalence of Neural Collapse in Neural Multivariate Regression
George Andriopoulos
Zixuan Dong
Li Guo
Zifan Zhao
Keith Ross
47
3
0
06 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
34
0
0
06 Sep 2024
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
45
36
0
01 Sep 2024
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
53
3
0
27 Aug 2024
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning
Zhongjian Qiao
Jiafei Lyu
Kechen Jiao
Qi Liu
Xiu Li
OffRL
35
4
0
23 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
46
1
0
23 Aug 2024
Domain Adaptation for Offline Reinforcement Learning with Limited Samples
Weiqin Chen
Sandipan Mishra
Santiago Paternain
OffRL
40
2
0
22 Aug 2024
Offline Policy Learning via Skill-step Abstraction for Long-horizon Goal-Conditioned Tasks
Donghoon Kim
Minjong Yoo
Honguk Woo
OffRL
19
0
0
21 Aug 2024
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
Yi Zhao
Le Chen
Jan Schneider
Quankai Gao
Arno Solin
Bernhard Scholkopf
Joni Pajarinen
Le Chen
30
1
0
20 Aug 2024
Offline Model-Based Reinforcement Learning with Anti-Exploration
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
51
0
0
20 Aug 2024
Previous
1
2
3
4
5
6
...
17
18
19
Next