Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2012.08621
Cited By
BeBold: Exploration Beyond the Boundary of Explored Regions
15 December 2020
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BeBold: Exploration Beyond the Boundary of Explored Regions"
30 / 30 papers shown
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
162
1
0
02 Aug 2025
β
β
β
-DQN: Improving Deep Q-Learning By Evolving the Behavior
Adaptive Agents and Multi-Agent Systems (AAMAS), 2025
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
396
3
0
01 Jan 2025
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
399
12
0
28 Jul 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
528
9
0
29 May 2024
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
174
4
0
20 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
465
50
0
19 Dec 2023
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
354
11
0
21 Nov 2023
A Neuro-mimetic Realization of the Common Model of Cognition via Hebbian Learning and Free Energy Minimization
Alexander Ororbia
Mary Alexandria Kelly
AI4CE
228
4
0
14 Oct 2023
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
180
28
0
15 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
252
5
0
06 Jul 2023
Approximate information state based convergence analysis of recurrent Q-learning
Erfan Seyedsalehi
N. Akbarzadeh
Amit Sinha
Aditya Mahajan
198
6
0
09 Jun 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Neurocomputing (Neurocomputing), 2023
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
222
7
0
18 Apr 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Adaptive Agents and Multi-Agent Systems (AAMAS), 2023
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
315
32
0
15 Feb 2023
BIMRL: Brain Inspired Meta Reinforcement Learning
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Seyed Roozbeh Razavi Rohani
Saeed Hedayatian
M. Baghshah
145
5
0
29 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
276
53
0
19 Sep 2022
Play with Emotion: Affect-Driven Reinforcement Learning
Affective Computing and Intelligent Interaction (ACII), 2022
M. Barthet
Ahmed Khalifa
Antonios Liapis
Georgios N. Yannakakis
CVBM
190
10
0
26 Aug 2022
Generative Personas That Behave and Experience Like Humans
International Conference on Foundations of Digital Games (FDG), 2022
M. Barthet
Ahmed Khalifa
Antonios Liapis
Georgios N. Yannakakis
230
26
0
26 Aug 2022
Maze Learning using a Hyperdimensional Predictive Processing Cognitive Architecture
Artificial General Intelligence (AGI), 2022
Alexander Ororbia
Mary Alexandria Kelly
AI4CE
203
6
0
31 Mar 2022
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks
International Conference on Learning Representations (ICLR), 2021
Tianjun Zhang
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
Joseph E. Gonzalez
OffRL
220
18
0
22 Oct 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration
Neural Information Processing Systems (NeurIPS), 2021
Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
163
36
0
20 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang
Wenze Chen
Longfei Zhang
Shizhen Xu
Ziyang Li
Fengming Zhu
Deheng Ye
Tingling Chen
Jun Zhu
OffRL
349
28
0
09 Oct 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
503
105
0
27 Sep 2021
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
189
22
0
21 Sep 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
259
21
0
14 Sep 2021
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
268
8
0
12 Jul 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
Neural Information Processing Systems (NeurIPS), 2021
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
237
50
0
18 Jun 2021
Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
212
10
0
20 May 2021
Co-Imitation Learning without Expert Demonstration
Hai-Jian Ke
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
326
4
0
27 Mar 2021
Prioritized Level Replay
Minqi Jiang
Edward Grefenstette
Tim Rocktaschel
OffRL
571
203
0
08 Oct 2020
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
601
976
0
06 Jan 2017
1
Page 1 of 1