Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations

30 December 2023

Papers citing "Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations"

2 / 2 papers shown

Title
Hierarchical Reinforcement Learning By Discovering Intrinsic Options Jesse Zhang Haonan Yu W. Xu BDL 125 82 0 16 Jan 2021
Learning Guidance Rewards with Trajectory-space Smoothing Tanmay Gangwani Yuanshuo Zhou Jian Peng 24 32 0 23 Oct 2020