Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.01728
Cited By
Guarded Policy Optimization with Imperfect Online Demonstrations
3 March 2023
Zhenghai Xue
Zhenghao Peng
Quanyi Li
Zhihan Liu
Bolei Zhou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Guarded Policy Optimization with Imperfect Online Demonstrations"
9 / 9 papers shown
Title
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
Zhenghai Xue
Lang Feng
Jiacheng Xu
Kang Kang
Xiang Wen
Bo An
Shuicheng Yan
OffRL
42
0
0
10 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
43
0
0
06 Mar 2025
Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making
Rongliang Zhou
Jiakun Huang
Mingjun Li
Hepeng Li
Haotian Cao
Xiaolin Song
19
0
0
18 Oct 2024
Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving
Zilin Huang
Zihao Sheng
Sikai Chen
28
4
0
01 Sep 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
24
0
0
22 Feb 2024
Residual Q-Learning: Offline and Online Policy Customization without Value
Chenran Li
Chen Tang
Haruki Nishimura
Jean-Pierre Mercat
M. Tomizuka
Wei Zhan
OffRL
17
6
0
15 Jun 2023
State Regularized Policy Optimization on Data with Dynamics Shift
Zhenghai Xue
Qingpeng Cai
Shuchang Liu
Dong Zheng
Peng Jiang
Kun Gai
Bo An
OffRL
20
16
0
06 Jun 2023
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
69
35
0
17 Feb 2022
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,944
0
04 May 2020
1