Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2005.01643
Cited By
v1
v2
v3 (latest)
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
4 May 2020
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems"
50 / 1,433 papers shown
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Chenjia Bai
Xiu Li
Zhuoran Yang
Siyang Gao
207
0
0
02 Dec 2025
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
Zhongjian Qiao
Rui Yang
Jiafei Lyu
Xiu Li
Zhongxiang Dai
Zhuoran Yang
Siyang Gao
Shuang Qiu
OffRL
227
2
0
02 Dec 2025
FOVA: Offline Federated Reinforcement Learning with Mixed-Quality Data
Nan Qiao
Sheng Yue
Ju Ren
Yaoxue Zhang
OffRL
163
2
0
02 Dec 2025
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
S. E. Ada
Georg Martius
Emre Ugur
Erhan Öztop
OffRL
261
0
0
01 Dec 2025
Outcome-Aware Spectral Feature Learning for Instrumental Variable Regression
Dimitri Meunier
Jakub Wornbard
Vladimir Kostic
Antoine Moulin
Alek Fröhlich
Karim Lounici
Massimiliano Pontil
Arthur Gretton
CML
146
2
0
30 Nov 2025
Algorithmic Guarantees for Distilling Supervised and Offline RL Datasets
Aaryan Gupta
Rishi Saket
A. Raghuveer
OffRL
DD
280
0
0
29 Nov 2025
BAMAS: Structuring Budget-Aware Multi-Agent Systems
Liming Yang
Junyu Luo
Xuanzhe Liu
Yiling Lou
Zhenpeng Chen
LLMAG
412
0
0
26 Nov 2025
SOMBRL: Scalable and Optimistic Model-Based RL
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Florian Dorfler
Pieter Abbeel
Andreas Krause
OffRL
318
4
0
25 Nov 2025
A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
Ali Murtaza Caunhye
Asad Jeewa
189
0
0
20 Nov 2025
π
0.6
∗
π^{*}_{0.6}
π
0.6
∗
: a VLA That Learns From Experience
Physical Intelligence
Ali Amin
Raichelle Aniceto
Ashwin Balakrishna
Kevin Black
...
Blake Williams
Sukwon Yoo
Lili Yu
Ury Zhilinsky
Zhiyuan Zhou
OffRL
VLM
1.3K
100
0
18 Nov 2025
Soft Conflict-Resolution Decision Transformer for Offline Multi-Task Reinforcement Learning
Shudong Wang
Xinfei Wang
Chenhao Zhang
Shanchen Pang
Haiyuan Gui
Wenhao Ji
Xiaojian Liao
OffRL
161
2
0
17 Nov 2025
Integrating Neural Differential Forecasting with Safe Reinforcement Learning for Blood Glucose Regulation
Yushen Liu
Yanfu Zhang
Xugui Zhou
AI4TS
94
0
0
16 Nov 2025
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Xinming Gao
Shangzhe Li
Yujin Cai
Wenwu Yu
OffRL
GP
164
0
0
15 Nov 2025
Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies
Dong-Hee Shin
Deok-Joong Lee
Young-Han Son
Tae-Eui Kam
OffRL
205
3
0
15 Nov 2025
PROF: An LLM-based Reward Code Preference Optimization Framework for Offline Imitation Learning
Shengjie Sun
Jiafei Lyu
Runze Liu
Mengbei Yan
Bo Liu
Deheng Ye
Xiu Li
OffRL
388
0
0
14 Nov 2025
Enhancing Robustness of Offline Reinforcement Learning Under Data Corruption via Sharpness-Aware Minimization
Le Xu
Jiayu Chen
AAML
119
0
0
14 Nov 2025
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Yunchang Ma
Tenglong Liu
Yixing Lan
Xin Yin
Changxin Zhang
Xinglong Zhang
Xin Xu
OffRL
297
0
0
12 Nov 2025
Multi-agent Coordination via Flow Matching
Dongsu Lee
Daehee Lee
Amy Zhang
191
2
0
07 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
194
1
0
04 Nov 2025
Closing the Expression Gap in LLM Instructions via Socratic Questioning
Jianwen Sun
Yukang Feng
Yifan Chang
Chuanhao Li
Zizhen Li
Jiaxin Ai
Fanrui Zhang
Yu Dai
Kaipeng Zhang
239
0
0
31 Oct 2025
Data-Efficient RLVR via Off-Policy Influence Guidance
Erle Zhu
Dazhi Jiang
Y. Wang
X. Li
Jiale Cheng
...
Yilin Niu
A. Zeng
J. Tang
Shiyu Huang
Hongning Wang
OffRL
206
3
0
30 Oct 2025
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Wenli Xiao
Haotian Lin
Andy Peng
Haoru Xue
Tairan He
...
Jimmy Wu
Zhengyi Luo
Linxi Fan
Guanya Shi
Yuke Zhu
VLM
694
21
0
30 Oct 2025
Offline Clustering of Preference Learning with Active-data Augmentation
Jingyuan Liu
Fatemeh Ghaffari
Xuchuang Wang
Xutong Liu
Mohammad Hajiesmaili
Carlee Joe-Wong
OffRL
280
0
0
30 Oct 2025
ZTRS: Zero-Imitation End-to-end Autonomous Driving with Trajectory Scoring
Zhenxin Li
Wenhao Yao
Zi Wang
Xinglong Sun
Jingde Chen
...
Maying Shen
Jingyu Song
Zuxuan Wu
Shiyi Lan
Jose M. Alvarez
207
7
0
28 Oct 2025
Mixed-Density Diffuser: Efficient Planning with Non-Uniform Temporal Resolution
Crimson Stambaugh
Rajesh P. N. Rao
DiffM
288
0
0
27 Oct 2025
Human-Like Goalkeeping in a Realistic Football Simulation: a Sample-Efficient Reinforcement Learning Approach
Alessandro Sestini
Joakim Bergdahl
Jean-Philippe Barrette-LaPierre
Florian Fuchs
Brady Chen
Michael Jones
Linus Gisslén
210
0
0
27 Oct 2025
Transitive RL: Value Learning via Divide and Conquer
S. Park
Aditya Oberai
P. Atreya
Sergey Levine
OffRL
170
2
0
26 Oct 2025
Reducing the Probability of Undesirable Outputs in Language Models Using Probabilistic Inference
S. Zhao
Aidan Li
Rob Brekelmans
Roger C. Grosse
130
0
0
24 Oct 2025
Confounding Robust Deep Reinforcement Learning: A Causal Approach
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
CML
245
2
0
24 Oct 2025
Online Optimization for Offline Safe Reinforcement Learning
Yassine Chemingui
Aryan Deshwal
Alan Fern
Thanh Nguyen-Tang
J. Doppa
OffRL
179
0
0
24 Oct 2025
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
Kevin Huang
Rosario Scalise
Cleah Winston
Ayush Agrawal
Yunchu Zhang
...
Byron Boots
Benjamin Burchfiel
Hongkai Dai
Masha Itkina
Paarth Shah
OffRL
345
0
0
22 Oct 2025
Implicit State Estimation via Video Replanning
Po-Chen Ko
Jiayuan Mao
Yu-Hsiang Fu
Hsien-Jeng Yeh
Chu-Rong Chen
Wei-Chiu Ma
Yilun Du
Shao-Hua Sun
174
1
0
20 Oct 2025
Finite-Time Bounds for Average-Reward Fitted Q-Iteration
Jongmin Lee
Ernest K. Ryu
OffRL
137
0
0
20 Oct 2025
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
Woo-Jin Ahn
Sang-Ryul Baek
Yong-Jun Lee
H. Choi
M. Lim
OffRL
154
0
0
17 Oct 2025
RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation
Xiangyu Chen
Chuhao Zhou
Yuxi Liu
Jianfei Yang
OffRL
233
0
0
16 Oct 2025
Reinforcement Learning Meets Masked Generative Models: Mask-GRPO for Text-to-Image Generation
Yifu Luo
Xinhao Hu
Keyu Fan
Haoyuan Sun
Zeyu Chen
Bo Xia
Tiantian Zhang
Yongzhe Chang
Xueqian Wang
195
3
0
15 Oct 2025
Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation
Yi Zhang
Lili Xie
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
159
1
0
15 Oct 2025
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Shingo Ayabe
Hiroshi Kera
K. Kawamoto
AAML
OffRL
OnRL
371
0
0
15 Oct 2025
Expert or not? assessing data quality in offline reinforcement learning
Arip Asadulaev
Fakhri Karray
Martin Takáč
OffRL
160
0
0
14 Oct 2025
FOSSIL: Harnessing Feedback on Suboptimal Samples for Data-Efficient Generalisation with Imitation Learning for Embodied Vision-and-Language Tasks
Sabrina McCallum
Amit Parekh
Alessandro Suglia
LM&Ro
171
0
0
13 Oct 2025
Taxonomy and Trends in Reinforcement Learning for Robotics and Control Systems: A Structured Review
Kumater Ter
Ore-Ofe Ajayi
Daniel Udekwe
341
2
0
11 Oct 2025
Scalable Offline Metrics for Autonomous Driving
Animikh Aich
Adwait Kulkarni
Eshed Ohn-Bar
OffRL
273
0
0
09 Oct 2025
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
Changyeon Kim
Haeone Lee
Younggyo Seo
Kimin Lee
Yuke Zhu
OffRL
179
2
0
09 Oct 2025
Energy-Guided Diffusion Sampling for Long-Term User Behavior Prediction in Reinforcement Learning-based Recommendation
Xiaocong Chen
Siyu Wang
Lina Yao
OffRL
143
1
0
09 Oct 2025
Expressive Value Learning for Scalable Offline Reinforcement Learning
Nicolas Espinosa-Dice
Kianté Brantley
Wen Sun
OffRL
308
1
0
09 Oct 2025
Maximum In-Support Return Modeling for Dynamic Recommendation with Language Model Prior
Xiaocong Chen
Siyu Wang
Lina Yao
OffRL
AI4TS
139
0
0
09 Oct 2025
Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
Noor Islam S. Mohammad
99
0
0
09 Oct 2025
TS-Agent: Understanding and Reasoning Over Raw Time Series via Iterative Insight Gathering
Penghang Liu
Elizabeth Fons
Svitlana Vyetrenko
Daniel Borrajo
Vamsi K. Potluru
Manuela Veloso
Vamsi K. Potluru
Manuela Veloso
AI4TS
AIFin
LRM
277
2
0
08 Oct 2025
Dual Goal Representations
S. Park
Deepinder Mann
Sergey Levine
269
3
0
08 Oct 2025
A Case for Leveraging Generative AI to Expand and Enhance Training in the Provision of Mental Health Services
Hannah R. Lawrence
Shannon Wiltsey Stirman
Samuel Dorison
Taedong Yun
Megan Jones Bell
AI4MH
202
0
0
08 Oct 2025
1
2
3
4
...
27
28
29
Next
Page 1 of 29
Page
of 29
Go