ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.02298
  4. Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning

Rainbow: Combining Improvements in Deep Reinforcement Learning

6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Rainbow: Combining Improvements in Deep Reinforcement Learning"

50 / 362 papers shown
Title
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
19
77
0
16 Jul 2020
Revisiting Fundamentals of Experience Replay
Revisiting Fundamentals of Experience Replay
W. Fedus
Prajit Ramachandran
Rishabh Agarwal
Yoshua Bengio
Hugo Larochelle
Mark Rowland
Will Dabney
KELM
OffRL
30
234
0
13 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive
  Representations
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
41
310
0
12 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
27
5
0
12 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement
  Learning
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng Meng
R. Gorbet
Dana Kulić
OffRL
27
27
0
23 Jun 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free
  learning
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
36
53
0
18 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
14
7
0
14 Jun 2020
Bootstrap your own latent: A new approach to self-supervised Learning
Bootstrap your own latent: A new approach to self-supervised Learning
Jean-Bastien Grill
Florian Strub
Florent Altché
Corentin Tallec
Pierre Harvey Richemond
...
M. G. Azar
Bilal Piot
Koray Kavukcuoglu
Rémi Munos
Michal Valko
SSL
137
6,665
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
23
44
0
12 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
33
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Adversarial Attacks on Reinforcement Learning based Energy Management
  Systems of Extended Range Electric Delivery Vehicles
Adversarial Attacks on Reinforcement Learning based Energy Management Systems of Extended Range Electric Delivery Vehicles
Pengyue Wang
Y. Li
Shashi Shekhar
W. Northrop
AAML
13
8
0
01 Jun 2020
The Adversarial Resilience Learning Architecture for AI-based Modelling,
  Exploration, and Operation of Complex Cyber-Physical Systems
The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems
Eric M. S. P. Veith
Nils Wenninghoff
Emilie Frost
23
5
0
27 May 2020
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement
  Learning: An In Silico Validation
Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
Taiyu Zhu
Kezhi Li
P. Herrero
Pantelis Georgiou
24
80
0
18 May 2020
Learning Adaptive Exploration Strategies in Dynamic Environments Through
  Informed Policy Regularization
Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Pierre-Alexandre Kamienny
Matteo Pirotta
A. Lazaric
Thibault Lavril
Nicolas Usunier
Ludovic Denoyer
6
19
0
06 May 2020
Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners
Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners
Rodrigo Canaan
Xianbo Gao
Youjin Chung
Julian Togelius
Andy Nealen
Stefan Menzel
19
4
0
28 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement
  Learning
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
49
1,061
0
08 Apr 2020
Controlling Rayleigh-Bénard convection via Reinforcement Learning
Controlling Rayleigh-Bénard convection via Reinforcement Learning
Gerben Beintema
Alessandro Corbetta
Luca Biferale
F. Toschi
AI4CE
27
79
0
31 Mar 2020
Incorporating Relational Background Knowledge into Reinforcement
  Learning via Differentiable Inductive Logic Programming
Incorporating Relational Background Knowledge into Reinforcement Learning via Differentiable Inductive Logic Programming
Ali Payani
Faramarz Fekri
21
18
0
23 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Bo-wen Li
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
44
261
0
19 Mar 2020
Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV
  with Deep Reinforcement Learning
Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV with Deep Reinforcement Learning
Yong Zeng
Xiaoli Xu
Shi Jin
Rui Zhang
9
164
0
17 Mar 2020
Sample Efficient Reinforcement Learning through Learning from
  Demonstrations in Minecraft
Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft
Christian Scheller
Yanick Schraner
Manfred Vogel
18
27
0
12 Mar 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from
  Preferences
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel S. Brown
Russell Coleman
R. Srinivasan
S. Niekum
BDL
30
101
0
21 Feb 2020
BADGR: An Autonomous Self-Supervised Learning-Based Navigation System
BADGR: An Autonomous Self-Supervised Learning-Based Navigation System
G. Kahn
Pieter Abbeel
Sergey Levine
SSL
19
260
0
13 Feb 2020
LaProp: Separating Momentum and Adaptivity in Adam
LaProp: Separating Momentum and Adaptivity in Adam
Liu Ziyin
Zhikang T.Wang
Masahito Ueda
ODL
13
18
0
12 Feb 2020
Data-driven control of micro-climate in buildings: an event-triggered
  reinforcement learning approach
Data-driven control of micro-climate in buildings: an event-triggered reinforcement learning approach
A. H. Hosseinloo
Alexander Ryzhov
A. Bischi
H. Ouerdane
K. Turitsyn
M. Dahleh
AI4CE
14
41
0
28 Jan 2020
Discrete and Continuous Action Representation for Practical RL in Video
  Games
Discrete and Continuous Action Representation for Practical RL in Video Games
Olivier Delalleau
Maxim Peter
Eloi Alonso
Adrien Logut
22
52
0
23 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
20
6
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention
  Networks
Learning to Dynamically Coordinate Multi-Robot Teams in Graph Attention Networks
Zheyuan Wang
Matthew C. Gombolay
18
7
0
04 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
45
541
0
03 Dec 2019
End-to-End Model-Free Reinforcement Learning for Urban Driving using
  Implicit Affordances
End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances
Marin Toromanoff
É. Wirbel
Fabien Moutarde
OffRL
44
205
0
25 Nov 2019
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
24
75
0
09 Nov 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
22
31
0
01 Nov 2019
Quantum enhancements for deep reinforcement learning in large spaces
Quantum enhancements for deep reinforcement learning in large spaces
Sofiene Jerbi
Lea M. Trenkwalder
Hendrik Poulsen Nautrup
H. Briegel
Vedran Dunjko
27
5
0
28 Oct 2019
Collision Avoidance in Pedestrian-Rich Environments with Deep
  Reinforcement Learning
Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning
Michael Everett
Yu Fan Chen
Jonathan P. How
OffRL
17
169
0
24 Oct 2019
Dealing with Sparse Rewards in Reinforcement Learning
Dealing with Sparse Rewards in Reinforcement Learning
J. Hare
21
77
0
21 Oct 2019
Adaptive Trade-Offs in Off-Policy Learning
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
25
22
0
16 Oct 2019
On Empirical Comparisons of Optimizers for Deep Learning
On Empirical Comparisons of Optimizers for Deep Learning
Dami Choi
Christopher J. Shallue
Zachary Nado
Jaehoon Lee
Chris J. Maddison
George E. Dahl
14
256
0
11 Oct 2019
Deep Q-Network for Angry Birds
Deep Q-Network for Angry Birds
L. Sy
S. Redmond
16
5
0
04 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
18
181
0
03 Oct 2019
Off-Policy Actor-Critic with Shared Experience Replay
Off-Policy Actor-Critic with Shared Experience Replay
Simon Schmitt
Matteo Hessel
Karen Simonyan
OffRL
27
68
0
25 Sep 2019
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?
Ofir Nachum
Haoran Tang
Xingyu Lu
S. Gu
Honglak Lee
Sergey Levine
29
100
0
23 Sep 2019
MACS: Deep Reinforcement Learning based SDN Controller Synchronization
  Policy Design
MACS: Deep Reinforcement Learning based SDN Controller Synchronization Policy Design
Ziyao Zhang
Liang Ma
Konstantinos Poularakis
K. Leung
J. Tucker
A. Swami
16
14
0
19 Sep 2019
Automated Lane Change Decision Making using Deep Reinforcement Learning
  in Dynamic and Uncertain Highway Environment
Automated Lane Change Decision Making using Deep Reinforcement Learning in Dynamic and Uncertain Highway Environment
Ali Alizadeh
Majid Moghadam
Yunus Bicer
N. K. Üre
M. U. Yavas
C. Kurtulus
13
97
0
18 Sep 2019
Discovery of Useful Questions as Auxiliary Tasks
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah
Matteo Hessel
Zhongwen Xu
Richard L. Lewis
Janarthanan Rajendran
Junhyuk Oh
H. V. Hasselt
David Silver
Satinder Singh
LLMAG
22
86
0
10 Sep 2019
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input
  using Global Encoder and Cost Transformation
Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation
Wonsup Shin
Hyolim Kang
Sunghoon Hong
11
0
0
07 Sep 2019
Previous
12345678
Next