Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2004.07219
Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"D4RL: Datasets for Deep Data-Driven Reinforcement Learning"
50 / 927 papers shown
Title
TorchRL: A data-driven decision-making library for PyTorch
Albert Bou
Matteo Bettini
Sebastian Dittert
Vikash Kumar
Shagun Sodhani
Xiaomeng Yang
Gianni De Fabritiis
Vincent Moens
OffRL
AI4CE
27
37
0
01 Jun 2023
Improving Offline RL by Blending Heuristics
Sinong Geng
Aldo Pacchiano
Andrey Kolobov
Ching-An Cheng
OffRL
30
7
0
01 Jun 2023
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation
Yingyi Chen
Qinghua Tao
F. Tonin
Johan A. K. Suykens
42
19
0
31 May 2023
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
21
7
0
30 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
38
90
0
29 May 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Kang Xu
Chenjia Bai
Xiaoteng Ma
Dong Wang
Bingyan Zhao
Zhen Wang
Xuelong Li
Wei Li
37
14
0
28 May 2023
On the Value of Myopic Behavior in Policy Reuse
Kang Xu
Chenjia Bai
Shuang Qiu
Haoran He
Bin Zhao
Zhen Wang
Wei Li
Xuelong Li
32
1
0
28 May 2023
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem
Paul Barde
Jakob N. Foerster
Derek Nowrouzezahrai
Amy Zhang
OffRL
28
8
0
26 May 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
Zhihui Xie
Zichuan Lin
Deheng Ye
Qiang Fu
Wei Yang
Shuai Li
OffRL
OnRL
51
22
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
38
25
0
26 May 2023
Coherent Soft Imitation Learning
Joe Watson
Sandy H. Huang
Nicholas Heess
32
11
0
25 May 2023
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang
Dingxu Shi
Jinxin Liu
Li He
Donglin Wang
OffRL
32
31
0
25 May 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya Zhang
OffRL
OnRL
36
19
0
25 May 2023
Inverse Preference Learning: Preference-based RL without a Reward Function
Joey Hejna
Dorsa Sadigh
OffRL
32
48
0
24 May 2023
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
Q. Wang
Jun Yang
Yunbo Wang
Xin Jin
Wenjun Zeng
Xiaokang Yang
OffRL
OnRL
35
3
0
24 May 2023
OER: Offline Experience Replay for Continual Offline Reinforcement Learning
Sibo Gai
Donglin Wang
Li He
CLL
OffRL
48
3
0
23 May 2023
INVICTUS: Optimizing Boolean Logic Circuit Synthesis via Synergistic Learning and Search
A. B. Chowdhury
Marco Romanelli
Benjamin Tan
Ramesh Karri
S. Garg
10
2
0
22 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
29
6
0
22 May 2023
Multi-task Hierarchical Adversarial Inverse Reinforcement Learning
Jiayu Chen
Dipesh Tamboli
Tian-Shing Lan
Vaneet Aggarwal
31
12
0
22 May 2023
Diffusion Co-Policy for Synergistic Human-Robot Collaborative Tasks
Eley Ng
Ziang Liu
Monroe Kennedy
DiffM
31
22
0
20 May 2023
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Wenhao Ding
Tong Che
Ding Zhao
Marco Pavone
BDL
OffRL
19
2
0
18 May 2023
Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning
Yinglun Xu
Gagandeep Singh
OffRL
AAML
34
3
0
18 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
37
0
16 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
30
14
0
16 May 2023
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
Patrick Wienhoft
Marnix Suilen
T. D. Simão
Clemens Dubslaff
C. Baier
N. Jansen
OffRL
24
6
0
13 May 2023
Decentralized Governance for Virtual Community(DeGov4VC): Optimal Policy Design of Human-plant Symbiosis Co-creation
Yan Xiang
Qianhui Fan
Kejiang Qian
Jiajie Li
Yuying Tang
Ze-Feng Gao
12
2
0
11 May 2023
Explaining RL Decisions with Trajectories
Shripad Deshmukh
Arpan Dasgupta
Balaji Krishnamurthy
Nan Jiang
Chirag Agarwal
Georgios Theocharous
J. Subramanian
OffRL
28
3
0
06 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
32
1
0
04 May 2023
Masked Trajectory Models for Prediction, Representation, and Control
Philipp Wu
Arjun Majumdar
Kevin Stone
Yixin Lin
Igor Mordatch
Pieter Abbeel
Aravind Rajeswaran
OffRL
36
38
0
04 May 2023
Toward Evaluating Robustness of Reinforcement Learning with Adversarial Policy
Jiawei Zhao
Xingjun Ma
Florian Schäfer
Xinyu Wang
Anima Anandkumar
Cong Wang
AAML
26
1
0
04 May 2023
Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL
Baiting Zhu
Meihua Dang
Aditya Grover
OffRL
74
23
0
30 Apr 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
13
0
26 Apr 2023
A Control-Centric Benchmark for Video Prediction
Stephen Tian
Chelsea Finn
Jiajun Wu
42
10
0
26 Apr 2023
Dynamic Datasets and Market Environments for Financial Reinforcement Learning
Xiao-Yang Liu
Ziyi Xia
Hongyang Yang
Jiechao Gao
Daochen Zha
Ming Zhu
Chris Wang
Zhaoran Wang
Jian Guo
OffRL
32
27
0
25 Apr 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffM
OffRL
27
58
0
25 Apr 2023
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Philippe Hansen-Estruch
Ilya Kostrikov
Michael Janner
J. Kuba
Sergey Levine
OffRL
34
130
0
20 Apr 2023
CASOG: Conservative Actor-critic with SmOoth Gradient for Skill Learning in Robot-Assisted Intervention
Hao Li
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Z. Hou
OffRL
16
11
0
19 Apr 2023
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Lina Mezghani
Piotr Bojanowski
Alahari Karteek
Sainbayar Sukhbaatar
LM&Ro
OffRL
LRM
21
8
0
18 Apr 2023
Affordances from Human Videos as a Versatile Representation for Robotics
Shikhar Bahl
Russell Mendonca
Lili Chen
Unnat Jain
Deepak Pathak
50
164
0
17 Apr 2023
Hyper-Decision Transformer for Efficient Online Policy Adaptation
Mengdi Xu
Yuchen Lu
Songlin Yang
Shun Zhang
Ding Zhao
Chuang Gan
OffRL
31
39
0
17 Apr 2023
Reinforcement Learning from Passive Data via Latent Intentions
Dibya Ghosh
Chethan Bhateja
Sergey Levine
OffRL
26
43
0
10 Apr 2023
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning
Junjie Zhang
Jiafei Lyu
Xiaoteng Ma
Jiangpeng Yan
Jun Yang
Le Wan
Xiu Li
OffRL
24
5
0
10 Apr 2023
RoboPianist: Dexterous Piano Playing with Deep Reinforcement Learning
Kevin Zakka
Philipp Wu
Laura M. Smith
Nimrod Gileadi
Taylor A. Howell
...
Sumeet Singh
Yuval Tassa
Pete Florence
Andy Zeng
Pieter Abbeel
21
32
0
09 Apr 2023
CRISP: Curriculum inducing Primitive Informed Subgoal Prediction
Utsav Singh
Vinay P. Namboodiri
31
3
0
07 Apr 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
32
33
0
03 Apr 2023
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
39
20
0
03 Apr 2023
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions
Yicheng Luo
Jackie Kay
Edward Grefenstette
M. Deisenroth
OffRL
OnRL
27
15
0
30 Mar 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
28
16
0
30 Mar 2023
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations
Li Haofeng
C. Yiwen
Tan Jiayi
Marcelo H. Ang Jr
OffRL
20
1
0
29 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
73
0
28 Mar 2023
Previous
1
2
3
...
10
11
12
...
17
18
19
Next