Offline Supervised Learning V.S. Online Direct Policy Optimization: A
Comparative Study and A Unified Training Paradigm for Neural Network-Based
Optimal Feedback Control
Papers citing "Offline Supervised Learning V.S. Online Direct Policy Optimization: A
Comparative Study and A Unified Training Paradigm for Neural Network-Based
Optimal Feedback Control"