Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

29 September 2015

S. Mohamed

Danilo Jimenez Rezende

DRL

SSL

ArXiv (abs)PDF HTML

Papers citing "Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning"

50 / 157 papers shown

Title
Policy-Based Trajectory Clustering in Offline Reinforcement Learning Hao Hu Xinqi Wang Simon S. Du OffRL 36 0 0 10 Jun 2025
Plasticity as the Mirror of Empowerment David Abel Michael Bowling André Barreto Will Dabney Shi Dong ... Doina Precup Jonathan Richens Mark Rowland Tom Schaul Satinder Singh AI4CE 73 0 0 15 May 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models Cansu Sancaktar Christian Gumbsch Andrii Zadaianchuk Pavel Kolev Georg Martius LM&Ro VLM 163 2 0 03 Mar 2025
Universal AI maximizes Variational Empowerment Yusuke Hayashi Koichi Takahashi 83 0 0 20 Feb 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL Hongye Cao Fan Feng Meng Fang Shaokang Dong Tianpei Yang Jing Huo Yang Gao 124 1 0 14 Feb 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation Yuanchen Yuan Jin Cheng Núria Armengol Urpí Stelian Coros 135 1 0 02 Feb 2025
Learning to Assist Humans without Inferring Rewards Vivek Myers Evan Ellis Sergey Levine Benjamin Eysenbach Anca Dragan 138 5 0 17 Jan 2025
In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation Vasanth Reddy Baddam Behdad Chalaki Vaishnav Tadiparthi Hossein Nourkhiz Mahjoub Ehsan Moradi-Pari Hoda Eldardiry Almuatazbellah Boker 79 0 0 02 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory Yangchun Zhang Wang Zhou Yirui Zhou 95 0 0 31 Dec 2024
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator Andrew Levy A. Allievi George Konidaris 105 0 0 15 Oct 2024
Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning Abdulaziz Alwalan Miguel Arana-Catania 66 0 0 01 Jul 2024
Potential-Based Reward Shaping For Intrinsic Motivation Grant C. Forbes Nitish Gupta Leonardo Villalobos-Arias Colin M. Potts Arnav Jhala David L. Roberts 18 5 0 12 Feb 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems Changshuo Zhang Sirui Chen Xiao Zhang Sunhao Dai Weijie Yu Jun Xu OffRL 103 1 0 17 Jan 2024
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Seohong Park Oleh Rybkin Sergey Levine OffRL 89 44 0 13 Oct 2023
ELDEN: Exploration via Local Dependencies Jiaheng Hu Zizhao Wang Peter Stone Roberto Martin-Martin 95 8 0 12 Oct 2023
Learning How to Propagate Messages in Graph Neural Networks Teng Xiao Zhengyu Chen Donglin Wang Suhang Wang GNN 96 80 0 01 Oct 2023
Curious Replay for Model-based Adaptation Isaac Kauvar Christopher Doyle Linqi Zhou Nick Haber 62 12 0 28 Jun 2023
Environmental path-entropy and collective motion H. Devereux M. S. Turner 26 6 0 31 Mar 2023
Sample-efficient Adversarial Imitation Learning Dahuin Jung Hyungyu Lee Sung-Hoon Yoon SSL 74 2 0 14 Mar 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization D. Grytskyy Jorge Ramírez-Ruiz R. Moreno-Bote 88 3 0 02 Feb 2023
Centralized Cooperative Exploration Policy for Continuous Control Tasks Chong Li Chen Gong Qiang He Xinwen Hou Yu Liu 80 1 0 06 Jan 2023
Intrinsic Motivation in Dynamical Control Systems Stas Tiomkin I. Nemenman Daniel Polani Naftali Tishby 59 5 0 29 Dec 2022
Representation Learning in Deep RL via Discrete Information Bottleneck Riashat Islam Hongyu Zang Manan Tomar Aniket Didolkar Md. Mofijul Islam ... Tariq Iqbal Xin-hui Li Anirudh Goyal N. Heess Alex Lamb SSL OffRL 69 8 0 28 Dec 2022
Automated Gadget Discovery in Science Lea M. Trenkwalder Andrea López-Incera Hendrik Poulsen Nautrup Fulvio Flamini Hans J. Briegel 55 3 0 24 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning Zhihai Wang Taoxing Pan Qi Zhou Jie Wang OffRL 54 12 0 14 Dec 2022
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization Xiaodong Li Pangjing Wu Chenxin Zou Qing Li 54 3 0 11 Dec 2022
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu Jack Parker-Holder Aldo Pacchiano Philip J. Ball Oleh Rybkin Stephen J. Roberts Tim Rocktaschel Edward Grefenstette OffRL 112 10 0 23 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning Andrew Zhao Matthieu Lin Yangguang Li Yang Liu Gao Huang 68 13 0 13 Oct 2022
Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions Chenhao Li Sebastian Blaes Pavel Kolev Marin Vlastelica Jonas Frey Georg Martius SSL 124 31 0 16 Sep 2022
Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards Yixiang Wang Yujing Hu Feng Wu Yingfeng Chen 60 2 0 29 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models Alex Lamb Riashat Islam Yonathan Efroni Aniket Didolkar Dipendra Kumar Misra Dylan J. Foster Lekan Molu Rajan Chari A. Krishnamurthy John Langford 99 24 0 17 Jul 2022
Uniqueness and Complexity of Inverse MDP Models Marcus Hutter Steven Hansen 75 5 0 02 Jun 2022
First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization S. Reddy Sergey Levine Anca Dragan SSL 73 13 0 24 May 2022
Exploration in Deep Reinforcement Learning: A Survey Pawel Ladosz Lilian Weng Minwoo Kim H. Oh OffRL 93 365 0 02 May 2022
Discovering Intrinsic Reward with Contrastive Random Walk Zixuan Pan Zihao Wei Yidong Huang Aditya Gupta 55 0 0 23 Apr 2022
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL Homanga Bharadhwaj Mohammad Babaeizadeh D. Erhan Sergey Levine 91 31 0 18 Apr 2022
Open-Ended Reinforcement Learning with Neural Reward Functions Robert Meier Asier Mujika 101 7 0 16 Feb 2022
Generative Adversarial Exploration for Reinforcement Learning Weijun Hong Menghui Zhu Minghuan Liu Weinan Zhang Ming Zhou Yong Yu Peng Sun OnRL 68 7 0 27 Jan 2022
Solving Dynamic Principal-Agent Problems with a Rationally Inattentive Principal Tong Mu Stephan Zheng Alexander R. Trott 42 3 0 18 Jan 2022
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration Lu Zheng Jiarui Chen Jianhao Wang Jiamin He Yujing Hu Yingfeng Chen Changjie Fan Yang Gao Chongjie Zhang 71 86 0 22 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision Process Allen Nie Emma Brunskill Chris Piech 70 11 0 27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching Pierre-Alexandre Kamienny Jean Tarbouriech Sylvain Lamprier A. Lazaric Ludovic Denoyer SSL 111 18 0 27 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning Benjamin Eysenbach Ruslan Salakhutdinov Sergey Levine SSL OffRL 118 35 0 06 Oct 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration Oliver Groth Markus Wulfmeier Giulia Vezzani Vibhavari Dasagi Tim Hertweck Roland Hafner N. Heess Martin Riedmiller LRM 78 20 0 17 Sep 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 110 123 0 31 Aug 2021
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration Chen Wang Claudia Pérez-DÁrpino Danfei Xu Li Fei-Fei Chenxi Liu Silvio Savarese 138 34 0 13 Aug 2021
Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments F. Massari Martin Biehl L. Meeden Ryota Kanai 24 0 0 14 Jul 2021
Explore and Control with Adversarial Surprise Arnaud Fickinger Natasha Jaques Samyak Parajuli Michael Chang Nicholas Rhinehart Glen Berseth Stuart J. Russell Sergey Levine 73 8 0 12 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative Coding Alexander Ororbia A. Mali 92 17 0 10 Jul 2021
MADE: Exploration via Maximizing Deviation from Explored Regions Tianjun Zhang Paria Rashidinejad Jiantao Jiao Yuandong Tian Joseph E. Gonzalez Stuart J. Russell OffRL 96 44 0 18 Jun 2021