Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1509.08731
Cited By
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
29 September 2015
S. Mohamed
Danilo Jimenez Rezende
DRL
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning"
50 / 157 papers shown
Title
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu
Xinqi Wang
Simon S. Du
OffRL
36
0
0
10 Jun 2025
Plasticity as the Mirror of Empowerment
David Abel
Michael Bowling
André Barreto
Will Dabney
Shi Dong
...
Doina Precup
Jonathan Richens
Mark Rowland
Tom Schaul
Satinder Singh
AI4CE
73
0
0
15 May 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Andrii Zadaianchuk
Pavel Kolev
Georg Martius
LM&Ro
VLM
163
2
0
03 Mar 2025
Universal AI maximizes Variational Empowerment
Yusuke Hayashi
Koichi Takahashi
83
0
0
20 Feb 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
Hongye Cao
Fan Feng
Meng Fang
Shaokang Dong
Tianpei Yang
Jing Huo
Yang Gao
124
1
0
14 Feb 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan
Jin Cheng
Núria Armengol Urpí
Stelian Coros
135
1
0
02 Feb 2025
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
138
5
0
17 Jan 2025
In Search of a Lost Metric: Human Empowerment as a Pillar of Socially Conscious Navigation
Vasanth Reddy Baddam
Behdad Chalaki
Vaishnav Tadiparthi
Hossein Nourkhiz Mahjoub
Ehsan Moradi-Pari
Hoda Eldardiry
Almuatazbellah Boker
79
0
0
02 Jan 2025
On Reward Transferability in Adversarial Inverse Reinforcement Learning: Insights from Random Matrix Theory
Yangchun Zhang
Wang Zhou
Yirui Zhou
95
0
0
31 Dec 2024
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
105
0
0
15 Oct 2024
Wind Estimation in Unmanned Aerial Vehicles with Causal Machine Learning
Abdulaziz Alwalan
Miguel Arana-Catania
66
0
0
01 Jul 2024
Potential-Based Reward Shaping For Intrinsic Motivation
Grant C. Forbes
Nitish Gupta
Leonardo Villalobos-Arias
Colin M. Potts
Arnav Jhala
David L. Roberts
18
5
0
12 Feb 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
103
1
0
17 Jan 2024
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
89
44
0
13 Oct 2023
ELDEN: Exploration via Local Dependencies
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martin-Martin
95
8
0
12 Oct 2023
Learning How to Propagate Messages in Graph Neural Networks
Teng Xiao
Zhengyu Chen
Donglin Wang
Suhang Wang
GNN
96
80
0
01 Oct 2023
Curious Replay for Model-based Adaptation
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
62
12
0
28 Jun 2023
Environmental path-entropy and collective motion
H. Devereux
M. S. Turner
26
6
0
31 Mar 2023
Sample-efficient Adversarial Imitation Learning
Dahuin Jung
Hyungyu Lee
Sung-Hoon Yoon
SSL
74
2
0
14 Mar 2023
A general Markov decision process formalism for action-state entropy-regularized reward maximization
D. Grytskyy
Jorge Ramírez-Ruiz
R. Moreno-Bote
88
3
0
02 Feb 2023
Centralized Cooperative Exploration Policy for Continuous Control Tasks
Chong Li
Chen Gong
Qiang He
Xinwen Hou
Yu Liu
80
1
0
06 Jan 2023
Intrinsic Motivation in Dynamical Control Systems
Stas Tiomkin
I. Nemenman
Daniel Polani
Naftali Tishby
59
5
0
29 Dec 2022
Representation Learning in Deep RL via Discrete Information Bottleneck
Riashat Islam
Hongyu Zang
Manan Tomar
Aniket Didolkar
Md. Mofijul Islam
...
Tariq Iqbal
Xin-hui Li
Anirudh Goyal
N. Heess
Alex Lamb
SSL
OffRL
69
8
0
28 Dec 2022
Automated Gadget Discovery in Science
Lea M. Trenkwalder
Andrea López-Incera
Hendrik Poulsen Nautrup
Fulvio Flamini
Hans J. Briegel
55
3
0
24 Dec 2022
Efficient Exploration in Resource-Restricted Reinforcement Learning
Zhihai Wang
Taoxing Pan
Qi Zhou
Jie Wang
OffRL
54
12
0
14 Dec 2022
Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization
Xiaodong Li
Pangjing Wu
Chenxin Zou
Qing Li
54
3
0
11 Dec 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
112
10
0
23 Oct 2022
A Mixture of Surprises for Unsupervised Reinforcement Learning
Andrew Zhao
Matthieu Lin
Yangguang Li
Yang Liu
Gao Huang
68
13
0
13 Oct 2022
Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions
Chenhao Li
Sebastian Blaes
Pavel Kolev
Marin Vlastelica
Jonas Frey
Georg Martius
SSL
124
31
0
16 Sep 2022
Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards
Yixiang Wang
Yujing Hu
Feng Wu
Yingfeng Chen
60
2
0
29 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
99
24
0
17 Jul 2022
Uniqueness and Complexity of Inverse MDP Models
Marcus Hutter
Steven Hansen
75
5
0
02 Jun 2022
First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
S. Reddy
Sergey Levine
Anca Dragan
SSL
73
13
0
24 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
93
365
0
02 May 2022
Discovering Intrinsic Reward with Contrastive Random Walk
Zixuan Pan
Zihao Wei
Yidong Huang
Aditya Gupta
55
0
0
23 Apr 2022
INFOrmation Prioritization through EmPOWERment in Visual Model-Based RL
Homanga Bharadhwaj
Mohammad Babaeizadeh
D. Erhan
Sergey Levine
91
31
0
18 Apr 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
101
7
0
16 Feb 2022
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
68
7
0
27 Jan 2022
Solving Dynamic Principal-Agent Problems with a Rationally Inattentive Principal
Tong Mu
Stephan Zheng
Alexander R. Trott
42
3
0
18 Jan 2022
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
71
86
0
22 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Allen Nie
Emma Brunskill
Chris Piech
70
11
0
27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
111
18
0
27 Oct 2021
The Information Geometry of Unsupervised Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
118
35
0
06 Oct 2021
Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration
Oliver Groth
Markus Wulfmeier
Giulia Vezzani
Vibhavari Dasagi
Tim Hertweck
Roland Hafner
N. Heess
Martin Riedmiller
LRM
78
20
0
17 Sep 2021
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
110
123
0
31 Aug 2021
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Chen Wang
Claudia Pérez-DÁrpino
Danfei Xu
Li Fei-Fei
Chenxi Liu
Silvio Savarese
138
34
0
13 Aug 2021
Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments
F. Massari
Martin Biehl
L. Meeden
Ryota Kanai
24
0
0
14 Jul 2021
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
73
8
0
12 Jul 2021
Backprop-Free Reinforcement Learning with Active Neural Generative Coding
Alexander Ororbia
A. Mali
92
17
0
10 Jul 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
96
44
0
18 Jun 2021
1
2
3
4
Next