ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2303.01728
  4. Cited By
Guarded Policy Optimization with Imperfect Online Demonstrations

Guarded Policy Optimization with Imperfect Online Demonstrations

3 March 2023
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
    OffRL
ArXivPDFHTML

Papers citing "Guarded Policy Optimization with Imperfect Online Demonstrations"

9 / 9 papers shown
Title
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Bo An
Shuicheng Yan
OffRL
42
0
0
10 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
43
0
0
06 Mar 2025
Knowledge Transfer from Simple to Complex: A Safe and Efficient
  Reinforcement Learning Framework for Autonomous Driving Decision-Making
Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making
Rongliang Zhou
Jiakun Huang
Mingjun Li
Hepeng Li
Haotian Cao
Xiaolin Song
19
0
0
18 Oct 2024
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human
  Feedback and Physics Knowledge for Safe Autonomous Driving
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving
Zilin Huang
Zihao Sheng
Sikai Chen
28
4
0
01 Sep 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human
  Racing Gameplay
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
24
0
0
22 Feb 2024
Residual Q-Learning: Offline and Online Policy Customization without
  Value
Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li
Chen Tang
Haruki Nishimura
Jean-Pierre Mercat
M. Tomizuka
Wei Zhan
OffRL
17
6
0
15 Jun 2023
State Regularized Policy Optimization on Data with Dynamics Shift
State Regularized Policy Optimization on Data with Dynamics Shift
Zhenghai Xue
Qingpeng Cai
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
20
16
0
06 Jun 2023
Efficient Learning of Safe Driving Policy via Human-AI Copilot
  Optimization
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
69
35
0
17 Feb 2022
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,944
0
04 May 2020
1