26
0

Cost Function Estimation Using Inverse Reinforcement Learning with Minimal Observations

Abstract

We present an iterative inverse reinforcement learning algorithm to infer optimal cost functions in continuous spaces. Based on a popular maximum entropy criteria, our approach iteratively finds a weight improvement step and proposes a method to find an appropriate step size that ensures learned cost function features remain similar to the demonstrated trajectory features. In contrast to similar approaches, our algorithm can individually tune the effectiveness of each observation for the partition function and does not need a large sample set, enabling faster learning. We generate sample trajectories by solving an optimal control problem instead of random sampling, leading to more informative trajectories. The performance of our method is compared to two state of the art algorithms to demonstrate its benefits in several simulated environments.

View on arXiv
@article{mehrdad2025_2505.08619,
  title={ Cost Function Estimation Using Inverse Reinforcement Learning with Minimal Observations },
  author={ Sarmad Mehrdad and Avadesh Meduri and Ludovic Righetti },
  journal={arXiv preprint arXiv:2505.08619},
  year={ 2025 }
}
Comments on this paper