Mutual-Information Regularization in Markov Decision Processes and
Actor-Critic LearningConference on Robot Learning (CoRL), 2019 |
A Unified Bellman Optimality Principle Combining Reward Maximization and
EmpowermentNeural Information Processing Systems (NeurIPS), 2019 |