ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11593
  4. Cited By
Observe and Look Further: Achieving Consistent Performance on Atari

Observe and Look Further: Achieving Consistent Performance on Atari

29 May 2018
Tobias Pohlen
Bilal Piot
Todd Hester
M. G. Azar
Dan Horgan
David Budden
Gabriel Barth-Maron
H. V. Hasselt
John Quan
Mel Vecerík
Matteo Hessel
Rémi Munos
Olivier Pietquin
ArXivPDFHTML

Papers citing "Observe and Look Further: Achieving Consistent Performance on Atari"

31 / 31 papers shown
Title
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
65
0
0
03 Jan 2025
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Policy-Based Self-Competition for Planning Problems
Policy-Based Self-Competition for Planning Problems
Jonathan Pirnay
Q. Göttl
Jakob Burger
D. G. Grimm
34
3
0
07 Jun 2023
Robust Scheduling with GFlowNets
Robust Scheduling with GFlowNets
David W. Zhang
Corrado Rainone
M. Peschl
Roberto Bondesan
29
49
0
17 Jan 2023
MAN: Multi-Action Networks Learning
MAN: Multi-Action Networks Learning
Keqin Wang
Alison Bartsch
A. Farimani
16
3
0
19 Sep 2022
Deep Q-learning: a robust control approach
Deep Q-learning: a robust control approach
B. Varga
Balázs Kulcsár
M. Chehreghani
OOD
22
9
0
21 Jan 2022
DR3: Value-Based Deep Reinforcement Learning Requires Explicit
  Regularization
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
31
65
0
09 Dec 2021
Self-Consistent Models and Values
Self-Consistent Models and Values
Roy Miles
Kate Baumli
Zita Marinho
Angelos Filos
Matteo Hessel
Hado van Hasselt
David Silver
38
8
0
25 Oct 2021
On Bonus-Based Exploration Methods in the Arcade Learning Environment
On Bonus-Based Exploration Methods in the Arcade Learning Environment
Adrien Ali Taïga
W. Fedus
Marlos C. Machado
Aaron Courville
Marc G. Bellemare
16
58
0
22 Sep 2021
Convergent and Efficient Deep Q Network Algorithm
Convergent and Efficient Deep Q Network Algorithm
Zhikang T. Wang
Masahito Ueda
14
12
0
29 Jun 2021
Taylor Expansion of Discount Factors
Taylor Expansion of Discount Factors
Yunhao Tang
Mark Rowland
Rémi Munos
Michal Valko
OffRL
26
5
0
11 Jun 2021
Regularized Behavior Value Estimation
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
23
37
0
17 Mar 2021
A Survey on Deep Reinforcement Learning for Audio-Based Applications
A Survey on Deep Reinforcement Learning for Audio-Based Applications
S. Latif
Heriberto Cuayáhuitl
Farrukh Pervez
Fahad Shamshad
Hafiz Shehbaz Ali
Erik Cambria
OffRL
44
73
0
01 Jan 2021
Offline Learning from Demonstrations and Unlabeled Experience
Offline Learning from Demonstrations and Unlabeled Experience
Konrad Zolna
Alexander Novikov
Ksenia Konyushkova
Çağlar Gülçehre
Ziyun Wang
Y. Aytar
Misha Denil
Nando de Freitas
Scott E. Reed
SSL
OffRL
32
66
0
27 Nov 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
19
5
0
12 Jul 2020
Rinascimento: using event-value functions for playing Splendor
Rinascimento: using event-value functions for playing Splendor
Ivan Bravi
Simon Lucas
22
2
0
10 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
60
225
0
01 Jun 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
13
509
0
30 Mar 2020
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
37
188
0
23 Dec 2019
Integrating Behavior Cloning and Reinforcement Learning for Improved
  Performance in Dense and Sparse Reward Environments
Integrating Behavior Cloning and Reinforcement Learning for Improved Performance in Dense and Sparse Reward Environments
Vinicius G. Goecks
Gregory M. Gremillion
Vernon J. Lawhern
J. Valasek
Nicholas R. Waytowich
OffRL
11
31
0
09 Oct 2019
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from
  forbidden action
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action
Mathieu Seurin
Philippe Preux
Olivier Pietquin
16
12
0
04 Oct 2019
Task-Relevant Adversarial Imitation Learning
Task-Relevant Adversarial Imitation Learning
Konrad Zolna
Scott E. Reed
Alexander Novikov
Sergio Gomez Colmenarejo
David Budden
Serkan Cabi
Misha Denil
Nando de Freitas
Ziyun Wang
GAN
20
61
0
02 Oct 2019
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the
  playing field
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
22
24
0
13 Aug 2019
Attentive Multi-Task Deep Reinforcement Learning
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
15
18
0
05 Jul 2019
Learning and Exploiting Multiple Subgoals for Fast Exploration in
  Hierarchical Reinforcement Learning
Learning and Exploiting Multiple Subgoals for Fast Exploration in Hierarchical Reinforcement Learning
Libo Xing
11
4
0
13 May 2019
World Discovery Models
World Discovery Models
M. G. Azar
Bilal Piot
Bernardo Avila-Pires
Jean-Bastien Grill
Florent Altché
Rémi Munos
21
26
0
20 Feb 2019
Learning Montezuma's Revenge from a Single Demonstration
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
20
136
0
08 Dec 2018
Sample Efficient Adaptive Text-to-Speech
Sample Efficient Adaptive Text-to-Speech
Yutian Chen
Yannis Assael
Brendan Shillingford
David Budden
Scott E. Reed
...
Ben Laurie
Çağlar Gülçehre
Aaron van den Oord
Oriol Vinyals
Nando de Freitas
30
149
0
27 Sep 2018
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Expert-augmented actor-critic for ViZDoom and Montezumas Revenge
Michal Garmulewicz
Henryk Michalewski
Piotr Milos
14
8
0
10 Sep 2018
RUDDER: Return Decomposition for Delayed Rewards
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
24
212
0
20 Jun 2018
Constrained Policy Improvement for Safe and Efficient Reinforcement
  Learning
Constrained Policy Improvement for Safe and Efficient Reinforcement Learning
Elad Sarafian
Aviv Tamar
Sarit Kraus
OffRL
26
11
0
20 May 2018
1