Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2006.09359
Cited By
v1
v2
v3
v4
v5
v6 (latest)
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
16 June 2020
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AWAC: Accelerating Online Reinforcement Learning with Offline Datasets"
50 / 496 papers shown
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Hyeongyu Kang
Jaewoo Lee
Woocheol Shin
Kiyoung Om
Jinkyoo Park
186
0
0
04 Dec 2025
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
Franki Nguimatsia Tiofack
Théotime Le Hellard
Fabian Schramm
Nicolas Perrin-Gilbert
Justin Carpentier
312
1
0
03 Dec 2025
Real-World Reinforcement Learning of Active Perception Behaviors
E. Hu
Jie Wang
Xingfang Yuan
Fiona Luo
Muyao Li
Gaspard Lambrechts
Oleh Rybkin
Dinesh Jayaraman
OffRL
289
3
0
01 Dec 2025
Forecasting in Offline Reinforcement Learning for Non-stationary Environments
S. E. Ada
Georg Martius
Emre Ugur
Erhan Öztop
OffRL
259
0
0
01 Dec 2025
Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories
Rushuai Yang
Zhiyuan Feng
Tianxiang Zhang
Kaixin Wang
Chuheng Zhang
Li Zhao
Xiu Su
Yi-Ling Chen
Jiang Bian
OffRL
254
0
0
24 Nov 2025
One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow
Zeyuan Wang
Da Li
Yulin Chen
Ye-ling Shi
Liang Bai
Tianyuan Yu
Yanwei Fu
OffRL
221
4
0
17 Nov 2025
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Xinming Gao
Shangzhe Li
Yujin Cai
Wenwu Yu
OffRL
GP
164
0
0
15 Nov 2025
Treatment Stitching with Schrödinger Bridge for Enhancing Offline Reinforcement Learning in Adaptive Treatment Strategies
Dong-Hee Shin
Deok-Joong Lee
Young-Han Son
Tae-Eui Kam
OffRL
205
3
0
15 Nov 2025
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Yunchang Ma
Tenglong Liu
Yixing Lan
Xin Yin
Changxin Zhang
Xinglong Zhang
Xin Xu
OffRL
297
0
0
12 Nov 2025
Partial Action Replacement: Tackling Distribution Shift in Offline MARL
Yue Jin
Giovanni Montana
OffRL
181
1
0
10 Nov 2025
From Static to Dynamic: Enhancing Offline-to-Online Reinforcement Learning via Energy-Guided Diffusion Stratification
Lipeng Zu
Hansong Zhou
Xiaonan Zhang
OffRL
OnRL
508
0
0
05 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
192
1
0
04 Nov 2025
Leveraging Discrete Function Decomposability for Scientific Design
James C. Bowden
Sergey Levine
Jennifer Listgarten
153
0
0
04 Nov 2025
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
Wenli Xiao
Haotian Lin
Andy Peng
Haoru Xue
Tairan He
...
Jimmy Wu
Zhengyi Luo
Linxi Fan
Guanya Shi
Yuke Zhu
VLM
693
21
0
30 Oct 2025
LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies
Ximan Sun
Xiang Cheng
OffRL
136
0
0
28 Oct 2025
RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation
Xiangyu Chen
Chuhao Zhou
Yuxi Liu
Jianfei Yang
OffRL
230
0
0
16 Oct 2025
RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning
Kun Lei
Huanyu Li
Dongjie Yu
Zhenyu Wei
Lingxiao Guo
Zhennan Jiang
Ziyu Wang
Shiyu Liang
Huazhe Xu
OffRL
VLM
460
24
0
16 Oct 2025
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
Nikita Kachaev
Daniil Zelezetsky
Egor Cherepanov
Alexey K. Kovelev
Aleksandr I. Panov
OffRL
192
2
0
15 Oct 2025
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Shingo Ayabe
Hiroshi Kera
K. Kawamoto
AAML
OffRL
OnRL
370
0
0
15 Oct 2025
Human-in-the-Loop Bandwidth Estimation for Quality of Experience Optimization in Real-Time Video Communication
Sami Khairy
Gabriel Mittag
Vishak Gopal
Ross Cutler
120
0
0
14 Oct 2025
Offline Reinforcement Learning with Generative Trajectory Policies
Xinsong Feng
Leshu Tang
Chenan Wang
Haipeng Chen
OffRL
185
0
0
13 Oct 2025
Dejavu: Towards Experience Feedback Learning for Embodied Intelligence
Shaokai Wu
Yanbiao Ji
Qiuchang Li
Zhiyi Zhang
Shalayiding Sirejiding
Wenyuan Xie
Guodong Zhang
Bayram Bayramli
Yue Ding
Hongtao Lu
197
0
0
11 Oct 2025
Continual Learning for Adaptive AI Systems
Md Hasibul Amin
Tamzid Tanvi Alam
CLL
297
3
0
09 Oct 2025
Expressive Value Learning for Scalable Offline Reinforcement Learning
Nicolas Espinosa-Dice
Kianté Brantley
Wen Sun
OffRL
307
1
0
09 Oct 2025
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
Changyeon Kim
Haeone Lee
Younggyo Seo
Kimin Lee
Yuke Zhu
OffRL
179
2
0
09 Oct 2025
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
Kai Fukazawa
Kunal Mundada
Iman Soltani
OffRL
216
0
0
03 Oct 2025
Diffusion Alignment as Variational Expectation-Maximization
Jaewoo Lee
Minsu Kim
S. Choi
Inhyuck Song
Sujin Yun
Hyeongyu Kang
Woocheol Shin
Taeyoung Yun
Kiyoung Om
Jinkyoo Park
162
0
0
01 Oct 2025
Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation
Run Su
Hao Fu
Shuai Zhou
Yingao Fu
OffRL
OnRL
263
0
0
01 Oct 2025
Realistic CDSS Drug Dosing with End-to-end Recurrent Q-learning for Dual Vasopressor Control
Will Y. Zou
Jean Feng
Alexandre Kalimouttou
Jennifer Yuntong Zhang
Christopher W. Seymour
Romain Pirracchio
OffRL
198
0
0
01 Oct 2025
Accelerating Transformers in Online RL
Daniil Zelezetsky
A. Kovalev
Aleksandr I. Panov
OffRL
159
0
0
30 Sep 2025
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Longxiang He
Deheng Ye
Junbo Tan
Xueqian Wang
Li Shen
OnRL
394
0
0
29 Sep 2025
Residual Off-Policy RL for Finetuning Behavior Cloning Policies
Lars Ankile
Zhenyu Jiang
Rocky Duan
Guanya Shi
Pieter Abbeel
Anusha Nagabandi
OffRL
293
15
0
23 Sep 2025
Diffusion Policies with Offline and Inverse Reinforcement Learning for Promoting Physical Activity in Older Adults Using Wearable Sensors
Chang Liu
Ladda Thiamwong
Yanjie Fu
Rui Xie
OffRL
187
0
0
22 Sep 2025
LLM-Guided Task- and Affordance-Level Exploration in Reinforcement Learning
Jelle Luijkx
Runyu Ma
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
178
1
0
20 Sep 2025
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
305
0
0
19 Sep 2025
Solving Robotics Tasks with Prior Demonstration via Exploration-Efficient Deep Reinforcement Learning
Chengyandan Shen
Christoffer Sloth
OffRL
195
0
0
04 Sep 2025
Retrosynthesis Planning via Worst-path Policy Optimisation in Tree-structured MDPs
Mianchu Wang
Giovanni Montana
207
0
0
01 Sep 2025
Re:Frame -- Retrieving Experience From Associative Memory
Daniil Zelezetsky
Egor Cherepanov
A. Kovalev
Aleksandr Panov
OffRL
106
1
0
26 Aug 2025
Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
Yue Pei
Hongming Zhang
Chao Gao
Martin Müller
Mengxiao Zhu
Hao Sheng
Ziliang Chen
Liang Lin
Haogang Zhu
OffRL
221
0
0
22 Aug 2025
Exploiting Policy Idling for Dexterous Manipulation
Annie S. Chen
Philemon Brakel
Antonia Bronars
Annie Xie
Sandy Huang
Oliver Groth
Maria Bauzá
Markus Wulfmeier
N. Heess
Dushyant Rao
240
1
0
21 Aug 2025
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Xiao Huang
Xu Liu
Enze Zhang
T. Yu
Shuai Li
OffRL
OnRL
273
3
0
09 Aug 2025
DiWA: Diffusion Policy Adaptation with World Models
Akshay L Chandra
Iman Nematollahi
Chenguang Huang
Tim Welschehold
Wolfram Burgard
Abhinav Valada
OffRL
233
15
0
05 Aug 2025
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Lu Guo
Yixiang Shan
Zhengbang Zhu
Qifan Liang
Lichang Song
Ting Long
Weinan Zhang
Yi-Ju Chang
OffRL
254
0
0
21 Jul 2025
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Chongli Qin
Jost Tobias Springenberg
OffRL
307
17
0
17 Jul 2025
Reinforcement Learning with Action Chunking
Qiyang Li
Zhiyuan Zhou
Sergey Levine
OffRL
OnRL
496
37
0
10 Jul 2025
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
504
0
0
01 Jul 2025
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning
Prajwal Koirala
Cody Fleming
OffRL
452
5
0
26 Jun 2025
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Samin Yeasar Arnob
Scott Fujimoto
Doina Precup
OffRL
288
0
0
20 Jun 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
329
0
0
18 Jun 2025
Steering Your Diffusion Policy with Latent Space Reinforcement Learning
Andrew Wagenmaker
Mitsuhiko Nakamoto
Yunchu Zhang
S. Park
Waleed Yagoub
Anusha Nagabandi
Abhishek Gupta
Sergey Levine
OffRL
378
68
0
18 Jun 2025
1
2
3
4
...
8
9
10
Next
Page 1 of 10