ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.08621
  4. Cited By
BeBold: Exploration Beyond the Boundary of Explored Regions

BeBold: Exploration Beyond the Boundary of Explored Regions

15 December 2020
Tianjun Zhang
Huazhe Xu
Xiaolong Wang
Yi Wu
Kurt Keutzer
Joseph E. Gonzalez
Yuandong Tian
ArXiv (abs)PDFHTML

Papers citing "BeBold: Exploration Beyond the Boundary of Explored Regions"

30 / 30 papers shown
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
162
1
0
02 Aug 2025
$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior
βββ-DQN: Improving Deep Q-Learning By Evolving the BehaviorAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
396
3
0
01 Jan 2025
NAVIX: Scaling MiniGrid Environments with JAX
NAVIX: Scaling MiniGrid Environments with JAX
Eduardo Pignatelli
Jarek Liesen
R. T. Lange
Chris Xiaoxuan Lu
Pablo Samuel Castro
Laura Toni
399
12
0
28 Jul 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
528
9
0
29 May 2024
OpenRL: A Unified Reinforcement Learning Framework
OpenRL: A Unified Reinforcement Learning Framework
Shiyu Huang
Wentse Chen
Yiwen Sun
Fuqing Bie
Weijuan Tu
174
4
0
20 Dec 2023
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Artem Agarkov
Viacheslav Sinii
Sergey Kolesnikov
465
50
0
19 Dec 2023
minimax: Efficient Baselines for Autocurricula in JAX
minimax: Efficient Baselines for Autocurricula in JAX
Minqi Jiang
Michael Dennis
Edward Grefenstette
Tim Rocktaschel
354
11
0
21 Nov 2023
A Neuro-mimetic Realization of the Common Model of Cognition via Hebbian
  Learning and Free Energy Minimization
A Neuro-mimetic Realization of the Common Model of Cognition via Hebbian Learning and Free Energy Minimization
Alexander Ororbia
Mary Alexandria Kelly
AI4CE
228
4
0
14 Oct 2023
The SocialAI School: Insights from Developmental Psychology Towards
  Artificial Socio-Cultural Agents
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
180
28
0
15 Jul 2023
Learning to Solve Tasks with Exploring Prior Behaviours
Learning to Solve Tasks with Exploring Prior BehavioursIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Ruiqi Zhu
Siyuan Li
Tianhong Dai
Chongjie Zhang
Oya Celiktutan
252
5
0
06 Jul 2023
Approximate information state based convergence analysis of recurrent
  Q-learning
Approximate information state based convergence analysis of recurrent Q-learning
Erfan Seyedsalehi
N. Akbarzadeh
Amit Sinha
Aditya Mahajan
198
6
0
09 Jun 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated EnvironmentsNeurocomputing (Neurocomputing), 2023
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRLOnRL
222
7
0
18 Apr 2023
TiZero: Mastering Multi-Agent Football with Curriculum Learning and
  Self-Play
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-PlayAdaptive Agents and Multi-Agent Systems (AAMAS), 2023
Fanqing Lin
Shiyu Huang
Tim Pearce
Wenze Chen
Weijuan Tu
315
32
0
15 Feb 2023
BIMRL: Brain Inspired Meta Reinforcement Learning
BIMRL: Brain Inspired Meta Reinforcement LearningIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Seyed Roozbeh Razavi Rohani
Saeed Hedayatian
M. Baghshah
145
5
0
29 Oct 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
276
53
0
19 Sep 2022
Play with Emotion: Affect-Driven Reinforcement Learning
Play with Emotion: Affect-Driven Reinforcement LearningAffective Computing and Intelligent Interaction (ACII), 2022
M. Barthet
Ahmed Khalifa
Antonios Liapis
Georgios N. Yannakakis
CVBM
190
10
0
26 Aug 2022
Generative Personas That Behave and Experience Like Humans
Generative Personas That Behave and Experience Like HumansInternational Conference on Foundations of Digital Games (FDG), 2022
M. Barthet
Ahmed Khalifa
Antonios Liapis
Georgios N. Yannakakis
230
26
0
26 Aug 2022
Maze Learning using a Hyperdimensional Predictive Processing Cognitive
  Architecture
Maze Learning using a Hyperdimensional Predictive Processing Cognitive ArchitectureArtificial General Intelligence (AGI), 2022
Alexander Ororbia
Mary Alexandria Kelly
AI4CE
203
6
0
31 Mar 2022
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks
C-Planning: An Automatic Curriculum for Learning Goal-Reaching TasksInternational Conference on Learning Representations (ICLR), 2021
Tianjun Zhang
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
Joseph E. Gonzalez
OffRL
220
18
0
22 Oct 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration
Dynamic Bottleneck for Robust Self-Supervised ExplorationNeural Information Processing Systems (NeurIPS), 2021
Chenjia Bai
Lingxiao Wang
Lei Han
Animesh Garg
Jianye Hao
Peng Liu
Zhaoran Wang
163
36
0
20 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from
  Single-agent Demonstrations
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang
Wenze Chen
Longfei Zhang
Shizhen Xu
Ziyang Li
Fengming Zhu
Deheng Ye
Tingling Chen
Jun Zhu
OffRL
349
28
0
09 Oct 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
503
105
0
27 Sep 2021
Generalization in Text-based Games via Hierarchical Reinforcement
  Learning
Generalization in Text-based Games via Hierarchical Reinforcement Learning
Yunqiu Xu
Meng Fang
Ling Chen
Yali Du
Chengqi Zhang
AI4CE
189
22
0
21 Sep 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
259
21
0
14 Sep 2021
Explore and Control with Adversarial Surprise
Explore and Control with Adversarial Surprise
Arnaud Fickinger
Natasha Jaques
Samyak Parajuli
Michael Chang
Nicholas Rhinehart
Glen Berseth
Stuart J. Russell
Sergey Levine
268
8
0
12 Jul 2021
MADE: Exploration via Maximizing Deviation from Explored Regions
MADE: Exploration via Maximizing Deviation from Explored RegionsNeural Information Processing Systems (NeurIPS), 2021
Tianjun Zhang
Paria Rashidinejad
Jiantao Jiao
Yuandong Tian
Joseph E. Gonzalez
Stuart J. Russell
OffRL
237
50
0
18 Jun 2021
Don't Do What Doesn't Matter: Intrinsic Motivation with Action
  Usefulness
Don't Do What Doesn't Matter: Intrinsic Motivation with Action UsefulnessInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Mathieu Seurin
Florian Strub
Philippe Preux
Olivier Pietquin
212
10
0
20 May 2021
Co-Imitation Learning without Expert Demonstration
Co-Imitation Learning without Expert Demonstration
Hai-Jian Ke
Hu Xu
Kun Zhu
Sheng-Jun Huang
OffRL
326
4
0
27 Mar 2021
Prioritized Level Replay
Prioritized Level Replay
Minqi Jiang
Edward Grefenstette
Tim Rocktaschel
OffRL
571
203
0
08 Oct 2020
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker
Matej Moravcík
Martin Schmid
Neil Burch
Viliam Lisý
Dustin Morrill
Nolan Bard
Trevor Davis
Kevin Waugh
Michael Bradley Johanson
Michael Bowling
BDL
601
976
0
06 Jan 2017
1
Page 1 of 1