ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.07219
  4. Cited By
D4RL: Datasets for Deep Data-Driven Reinforcement Learning

D4RL: Datasets for Deep Data-Driven Reinforcement Learning

15 April 2020
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
    GP
    OffRL
ArXivPDFHTML

Papers citing "D4RL: Datasets for Deep Data-Driven Reinforcement Learning"

50 / 927 papers shown
Title
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yixuan Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
31
0
0
15 May 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Minh Hoang Nguyen
Linh Le Pham Van
Thommen George Karimpanal
Sunil Gupta
Hung Le
OffRL
LRM
32
0
0
14 May 2025
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model
George Andriopoulos
Soyuj Jung Basnet
Juan Guevara
Li Guo
Keith Ross
27
0
0
14 May 2025
Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Symbolically-Guided Visual Plan Inference from Uncurated Video Data
Wenyan Yang
Ahmet Tikna
Yi Zhao
Yuying Zhang
Luigi Palopoli
Marco Roveri
Joni Pajarinen
VGen
26
0
0
13 May 2025
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
31
0
0
12 May 2025
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
CHD: Coupled Hierarchical Diffusion for Long-Horizon Tasks
Ce Hao
Anxing Xiao
Zhiwei Xue
Harold Soh
46
0
0
12 May 2025
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Minting Pan
Yitao Zheng
J. Li
Yunbo Wang
Xiaokang Yang
OffRL
48
0
0
10 May 2025
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
Xuyang Chen
Keyu Yan
Lin Zhao
OffRL
51
0
0
08 May 2025
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
Anthony Liang
Pavel Czempin
Matthew Hong
Yutai Zhou
Erdem Biyik
Stephen Tu
47
0
0
08 May 2025
GRAML: Dynamic Goal Recognition As Metric Learning
GRAML: Dynamic Goal Recognition As Metric Learning
Matan Shamir
Reuth Mirsky
26
0
0
06 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
149
0
0
03 May 2025
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Andreas Sochopoulos
Nikolay Malkin
Nikolaos Tsagkas
João Moura
Michael Gienger
S. Vijayakumar
47
1
0
02 May 2025
Learning Neural Control Barrier Functions from Offline Data with Conservatism
Learning Neural Control Barrier Functions from Offline Data with Conservatism
Ihab Tabbara
Hussein Sibai
OffRL
65
0
0
01 May 2025
TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching
TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching
Yue Meng
Chuchu Fan
38
0
0
01 May 2025
Fine-Tuning without Performance Degradation
Fine-Tuning without Performance Degradation
Han Wang
Adam White
Martha White
OnRL
161
0
0
01 May 2025
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
FAST-Q: Fast-track Exploration with Adversarially Balanced State Representations for Counterfactual Action Estimation in Offline Reinforcement Learning
Pulkit Agrawal
Rukma Talwadker
Aditya Pareek
Tridib Mukherjee
OffRL
32
0
0
30 Apr 2025
Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator
Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator
Chenhao Li
Andreas Krause
Marco Hutter
OffRL
26
0
0
23 Apr 2025
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment
Dynamic Contrastive Skill Learning with State-Transition Based Skill Clustering and Dynamic Length Adjustment
Jinwoo Choi
Seung-Woo Seo
OffRL
29
0
0
21 Apr 2025
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision
Shilin Zhang
Zican Hu
Wenhao Wu
Xinyi Xie
Jianxiang Tang
Chunlin Chen
Daoyi Dong
Yu Cheng
Zhenhong Sun
Zhi Wang
OffRL
139
0
0
21 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
27
0
0
17 Apr 2025
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
Xuyang Chen
Guojian Wang
Keyu Yan
Lin Zhao
OffRL
37
1
0
16 Apr 2025
A Clean Slate for Offline Reinforcement Learning
A Clean Slate for Offline Reinforcement Learning
Matthew Jackson
Uljad Berdica
Jarek Liesen
Shimon Whiteson
Jakob Foerster
OffRL
OnRL
49
0
0
15 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
51
1
0
11 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
26
0
0
06 Apr 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
29
1
0
04 Apr 2025
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Ke Jiang
Wen Jiang
Y. Li
Xiaoyang Tan
OffRL
38
0
0
02 Apr 2025
Exploration and Adaptation in Non-Stationary Tasks with Diffusion Policies
Exploration and Adaptation in Non-Stationary Tasks with Diffusion Policies
Gunbir Singh Baveja
37
0
0
31 Mar 2025
Robust Offline Imitation Learning Through State-level Trajectory Stitching
Robust Offline Imitation Learning Through State-level Trajectory Stitching
Shuze Wang
Yunpeng Mei
Hongjie Cao
Yetian Yuan
Gang Wang
Jian Sun
Jie Chen
OffRL
34
0
0
28 Mar 2025
Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation
Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation
Sicong Liu
Yang Shu
Chenjuan Guo
Bin Yang
OffRL
58
3
0
27 Mar 2025
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao
Fan Feng
Jing Huo
Shangdong Yang
Meng Fang
Tianpei Yang
Yang Gao
AAML
OffRL
58
0
0
26 Mar 2025
Offline Reinforcement Learning with Discrete Diffusion Skills
Offline Reinforcement Learning with Discrete Diffusion Skills
Ruixi Qiao
Jie Cheng
Xingyuan Dai
Yonglin Tian
Yisheng Lv
OffRL
84
0
0
26 Mar 2025
Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion
Extendable Long-Horizon Planning via Hierarchical Multiscale Diffusion
Chang Chen
Hany Hamed
Doojin Baek
Taegu Kang
Yoshua Bengio
Sungjin Ahn
54
0
0
25 Mar 2025
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic Scenarios
Songyi Gao
Zuolin Tu
Rong-Jun Qin
Yi-Hao Sun
Xiong-Hui Chen
Yang Yu
OffRL
40
0
0
25 Mar 2025
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Yuan Xu
Tianwei Zhang
41
0
0
24 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
66
0
0
19 Mar 2025
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better
Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better
Meng Song
OffRL
41
0
0
19 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRL
OnRL
70
0
0
15 Mar 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Jingyi Wang
Shuicheng Yan
OffRL
53
0
0
10 Mar 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
56
0
0
07 Mar 2025
THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks
Chaoran Xiong
Litao Wei
Kehui Ma
Zhen Sun
Yan Xiang
Zihan Nan
Trieu-Kien Truong
Ling Pei
41
0
0
07 Mar 2025
Personalized Generation In Large Model Era: A Survey
Yiyan Xu
Jinghao Zhang
Alireza Salemi
Xinting Hu
Luu Anh Tuan
Fuli Feng
Hamed Zamani
Xiangnan He
Tat-Seng Chua
3DV
79
2
0
04 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
34
0
0
04 Mar 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
70
0
0
03 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
33
0
0
02 Mar 2025
What Makes a Good Diffusion Planner for Decision Making?
Haofei Lu
Dongqi Han
Yifei Shen
Dongsheng Li
DiffM
38
3
0
01 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
43
1
0
28 Feb 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
47
0
0
26 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
BOSS: Benchmark for Observation Space Shift in Long-Horizon Task
BOSS: Benchmark for Observation Space Shift in Long-Horizon Task
Yue Yang
Linfeng Zhao
Mingyu Ding
Gedas Bertasius
D. Szafir
75
0
0
24 Feb 2025
1234...171819
Next