ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.11802
  4. Cited By
Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and
  Stable Online Fine-Tuning

Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

21 November 2022
Alex Beeson
Giovanni Montana
    OffRLOnRL
ArXiv (abs)PDFHTML

Papers citing "Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning"

16 / 16 papers shown
Title
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations
Yujie Zhu
Charles A. Hepburn
Matthew Thorpe
Giovanni Montana
132
0
0
19 Sep 2025
Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data
Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data
Jeonghye Kim
Yongjae Shin
Whiyoung Jung
Sunghoon Hong
Deunsol Yoon
Y. Sung
Kanghoon Lee
Woohyung Lim
OffRL
261
0
0
11 Jul 2025
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
Qin-Wen Luo
Ming-Kun Xie
Ye-Wen Wang
Sheng-Jun Huang
OffRL
156
0
0
26 May 2025
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
162
0
0
26 Apr 2025
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
SAMG: Offline-to-Online Reinforcement Learning via State-Action-Conditional Offline Model Guidance
Liyu Zhang
Haochi Wu
Xu Wan
Quan Kong
Ruilong Deng
Mingyang Sun
OffRLOnRL
175
0
0
24 Feb 2025
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo
  Cancellation
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Fei Zhao
Xueliang Zhang
196
3
0
25 Dec 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2024
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
236
3
0
04 Aug 2024
Hierarchical Decision Making Based on Structural Information Principles
Hierarchical Decision Making Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
233
2
0
15 Apr 2024
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for
  Autonomous Real-World Reinforcement Learning
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2023
Jingyun Yang
Max Sobol Mark
Brandon Vu
Archit Sharma
Jeannette Bohg
Chelsea Finn
OffRLOnRL
179
41
0
23 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRLOnRL
235
7
0
09 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Xuetao Zhang
OffRLOnRL
307
0
0
07 Oct 2023
Iteratively Refined Behavior Regularization for Offline Reinforcement
  Learning
Iteratively Refined Behavior Regularization for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Xiao Hu
Yi-An Ma
Chenjun Xiao
Yan Zheng
Zhaopeng Meng
OffRL
121
7
0
09 Jun 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
254
72
0
16 May 2023
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent
  Reinforcement Learning
Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning
Claude Formanek
C. Tilbury
Jonathan P. Shock
Kale-ab Tessera
Arnu Pretorius
139
3
0
31 Mar 2023
Balancing policy constraint and ensemble size in uncertainty-based
  offline reinforcement learning
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learningMachine-mediated learning (ML), 2023
Alex Beeson
Giovanni Montana
OffRL
189
15
0
26 Mar 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-TuningNeural Information Processing Systems (NeurIPS), 2023
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRLOnRL
449
174
0
09 Mar 2023
1