Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.08090
Cited By
Count-Based Exploration in Feature Space for Reinforcement Learning
25 June 2017
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Count-Based Exploration in Feature Space for Reinforcement Learning"
26 / 26 papers shown
Title
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Seungjae Lee
Daniel Ekpo
Haowen Liu
Furong Huang
Abhinav Shrivastava
Jia-Bin Huang
LM&Ro
40
0
0
12 May 2025
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
29
5
0
06 Aug 2024
Robust Q-Learning for finite ambiguity sets
Cécile Decker
Julian Sester
30
0
0
05 Jul 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
34
0
0
29 May 2024
Self-supervised network distillation: an effective approach to exploration in sparse reward environments
Matej Pecháč
M. Chovanec
Igor Farkaš
24
3
0
22 Feb 2023
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
B. Kveton
A. Rangi
OffRL
59
0
0
01 Feb 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
24
7
0
26 Jan 2023
Foundation Models for Semantic Novelty in Reinforcement Learning
Tarun Gupta
Peter Karkus
Tong Che
Danfei Xu
Marco Pavone
VLM
OffRL
LRM
39
7
0
09 Nov 2022
Exploration via Elliptical Episodic Bonuses
Mikael Henaff
Roberta Raileanu
Minqi Jiang
Tim Rocktaschel
OffRL
29
39
0
11 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
31
35
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
26
12
0
19 Sep 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
24
11
0
12 Jul 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
26
18
0
16 Jun 2022
Nuclear Norm Maximization Based Curiosity-Driven Learning
Chao Chen
Zijian Gao
Kele Xu
Sen Yang
Yiying Li
Bo Ding
Dawei Feng
Huaimin Wang
131
5
0
21 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
23
323
0
02 May 2022
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
30
67
0
08 Apr 2022
Learning Causal Overhypotheses through Exploration in Children and Computational Models
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
22
9
0
21 Feb 2022
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
35
6
0
21 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
33
92
0
14 Sep 2021
A Survey of Exploration Methods in Reinforcement Learning
Susan Amin
Maziar Gomrokchi
Harsh Satija
H. V. Hoof
Doina Precup
OffRL
16
80
0
01 Sep 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
39
89
0
14 Jul 2021
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
38
348
0
27 Apr 2020
Deep Policies for Width-Based Planning in Pixel Domains
Miquel Junyent
Anders Jonsson
Vicencc Gómez
28
10
0
12 Apr 2019
Malthusian Reinforcement Learning
Joel Z. Leibo
Julien Perolat
Edward Hughes
S. Wheelwright
Adam H. Marblestone
Edgar A. Duénez-Guzmán
P. Sunehag
Iain Dunning
T. Graepel
AI4CE
14
37
0
17 Dec 2018
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes
Jian Qian
Ronan Fruit
Matteo Pirotta
A. Lazaric
6
10
0
11 Dec 2018
Universal Reinforcement Learning Algorithms: Survey and Experiments
John Aslanides
Jan Leike
Marcus Hutter
OffRL
26
19
0
30 May 2017
1