ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.02948
  4. Cited By
Efficient Online Reinforcement Learning with Offline Data

Efficient Online Reinforcement Learning with Offline Data

6 February 2023
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Efficient Online Reinforcement Learning with Offline Data"

50 / 128 papers shown
Title
Automatic Reward Shaping from Confounded Offline Data
Automatic Reward Shaping from Confounded Offline Data
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
OnRL
28
1
0
16 May 2025
What Matters for Batch Online Reinforcement Learning in Robotics?
What Matters for Batch Online Reinforcement Learning in Robotics?
Perry Dong
Suvir Mirchandani
Dorsa Sadigh
Chelsea Finn
OffRL
28
0
0
12 May 2025
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
Tianjian Li
Daniel Khashabi
55
0
0
05 May 2025
Implicit Neural-Representation Learning for Elastic Deformable-Object Manipulations
Implicit Neural-Representation Learning for Elastic Deformable-Object Manipulations
Minseok Song
JeongHo Ha
Bonggyeong Park
Daehyung Park
132
0
0
01 May 2025
Fine-Tuning without Performance Degradation
Fine-Tuning without Performance Degradation
Han Wang
Adam White
Martha White
OnRL
161
0
0
01 May 2025
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Next-Future: Sample-Efficient Policy Learning for Robotic-Arm Tasks
Fikrican Özgür
René Zurbrugg
Suryansh Kumar
35
0
0
15 Apr 2025
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories
MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories
Natalie Tirabassi
Sathish A. P. Kumar
S. Jha
Arvind Ramanathan
LM&Ro
OffRL
51
0
0
04 Apr 2025
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
Zhihao Xu
Yinuo Zhao
Kun Wu
Ning Liu
Junjie Ji
Zhengping Che
C. Liu
Jian Tang
47
0
0
31 Mar 2025
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
46
0
0
17 Mar 2025
Refined Policy Distillation: From VLA Generalists to RL Experts
Tobias Jülg
Wolfram Burgard
Florian Walter
OffRL
39
1
0
06 Mar 2025
A comparison of visual representations for real-world reinforcement learning in the context of vacuum gripping
Nico Sutter
Valentin N. Hartmann
Stelian Coros
OffRL
66
0
0
04 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
36
0
0
04 Mar 2025
Active Robot Curriculum Learning from Online Human Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
67
0
0
04 Mar 2025
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza
Nicklas Hansen
Stone Tao
Tongzhou Mu
H. Su
OffRL
60
0
0
03 Mar 2025
Subtask-Aware Visual Reward Learning from Segmented Demonstrations
Subtask-Aware Visual Reward Learning from Segmented Demonstrations
Changyeon Kim
Minho Heo
Doohyun Lee
Jinwoo Shin
Honglak Lee
Joseph J. Lim
Kimin Lee
44
1
0
28 Feb 2025
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Uncertainty Comes for Free: Human-in-the-Loop Policies with Diffusion Models
Zhanpeng He
Yifeng Cao
M. Ciocarlie
61
0
0
26 Feb 2025
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Yi Zhao
Aidan Scannell
Wenshuai Zhao
Yuxin Hou
Tianyu Cui
Le Chen
Dieter Büchler
Arno Solin
Juho Kannala
Joni Pajarinen
OffRL
OnRL
96
1
0
26 Feb 2025
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Mingyang Sun
Pengxiang Ding
Weinan Zhang
Donglin Wang
OT
83
0
0
24 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
MILE: Model-based Intervention Learning
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
88
2
0
21 Feb 2025
A Survey of Reinforcement Learning for Optimization in Automation
A Survey of Reinforcement Learning for Optimization in Automation
Ahmad Farooq
Kamran Iqbal
OffRL
89
1
0
13 Feb 2025
Deep Reinforcement Learning based Triggering Function for Early Classifiers of Time Series
Aurélien Renault
A. Bondu
Antoine Cornuéjols
Vincent Lemaire
49
0
0
10 Feb 2025
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
Yuhui Chen
Shuai Tian
Shugao Liu
Yingting Zhou
Haoran Li
Dongbin Zhao
OffRL
106
1
0
08 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
78
2
0
04 Feb 2025
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network
Jijia Liu
Feng Gao
Q. Liao
Chao Yu
Yu-Xiang Wang
OffRL
70
0
0
01 Feb 2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Haque Ishfaq
Guangyuan Wang
Sami Nur Islam
Doina Precup
57
2
0
29 Jan 2025
SR-Reward: Taking The Path More Traveled
SR-Reward: Taking The Path More Traveled
Seyed Mahdi Basiri Azad
Zahra Padar
Gabriel Kalweit
Joschka Boedecker
OffRL
67
0
0
04 Jan 2025
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo
  Cancellation
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Fei Zhao
Xueliang Zhang
36
0
0
25 Dec 2024
Policy Decorator: Model-Agnostic Online Refinement for Large Policy
  Model
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
66
3
0
18 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
84
0
0
16 Dec 2024
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
Charles Xu
Qiyang Li
Jianlan Luo
Sergey Levine
OffRL
85
5
0
13 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class
  and Backbone
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
95
4
0
09 Dec 2024
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
Arth Shukla
Stone Tao
Hao Su
91
6
0
09 Dec 2024
Accelerating Proximal Policy Optimization Learning Using Task Prediction
  for Solving Environments with Delayed Rewards
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
A. Ahmad
Mehdi Kermanshah
Kevin J. Leahy
Zachary Serlin
H. Siu
Makai Mann
C. Vasile
Roberto Tron
C. Belta
OffRL
66
0
0
26 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A
  Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
35
0
0
06 Nov 2024
So You Think You Can Scale Up Autonomous Robot Data Collection?
So You Think You Can Scale Up Autonomous Robot Data Collection?
Suvir Mirchandani
Suneel Belkhale
Joey Hejna
Evelyn Choi
Md Sazzad Islam
Dorsa Sadigh
OffRL
38
5
0
04 Nov 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and
  Diversity of Language Use
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
Jiajun Xi
Yinong He
Jianing Yang
Yinpei Dai
Joyce Chai
LM&Ro
24
2
0
31 Oct 2024
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value
  Function Memory and Sequential Exploration
Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration
Hai Zhong
Xun Wang
Zhuoran Li
Longbo Huang
OffRL
OnRL
29
0
0
25 Oct 2024
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
Ondrej Biza
Thomas Weng
Lingfeng Sun
Karl Schmeckpeper
Tarik Kelestemur
Yecheng Jason Ma
Robert Platt
Jan-Willem van de Meent
Lawson L. S. Wong
OffRL
45
0
0
25 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
57
0
0
23 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
36
0
0
21 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
59
0
0
19 Oct 2024
Traversability-Aware Legged Navigation by Learning from Real-World
  Visual Data
Traversability-Aware Legged Navigation by Learning from Real-World Visual Data
Hongbo Zhang
Zhongyu Li
Xuanqi Zeng
Laura Smith
Kyle Stachowicz
...
Zhitao Song
Weipeng Xia
Sergey Levine
K. Sreenath
Yun-hui Liu
36
2
0
14 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
39
1
0
11 Oct 2024
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for
  Generalizable Embodied AI
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Stone Tao
Fanbo Xiang
Arth Shukla
Yuzhe Qin
Xander Hinrichsen
...
Zhiao Huang
Roberto Calandra
Rui Chen
Shan Luo
Hao Su
21
27
0
01 Oct 2024
Continuously Improving Mobile Manipulation with Autonomous Real-World RL
Continuously Improving Mobile Manipulation with Autonomous Real-World RL
Russell Mendonca
Emmanuel Panov
Bernadette Bucher
Jiuguang Wang
Deepak Pathak
OffRL
35
5
0
30 Sep 2024
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
33
3
0
30 Sep 2024
SoloParkour: Constrained Reinforcement Learning for Visual Locomotion
  from Privileged Experience
SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience
Elliot Chane-Sane
Joseph Amigo
Thomas Flayols
Ludovic Righetti
Nicolas Mansard
54
6
0
20 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
53
1
0
11 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective
  Subgoal Guidance
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
34
0
0
06 Sep 2024
123
Next