ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.02298
  4. Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning

Rainbow: Combining Improvements in Deep Reinforcement Learning

6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Rainbow: Combining Improvements in Deep Reinforcement Learning"

50 / 362 papers shown
Title
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy
  Environments Based on Deep Reinforcement Learning
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning
Hao Qin
Zhaozhou Wu
Xingqi Zhang
16
0
0
31 Aug 2023
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis
Philip Tobuschat
Hao Ma
Le Chen
Bernhard Schölkopf
Michael Muehlebach
36
1
0
28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent
  Reinforcement Learning
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning
Jacob Wiebe
Ranwa Al Mallah
Li Li
AAML
36
3
0
25 Aug 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning
  Agents
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
27
5
0
03 Aug 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
21
1
0
22 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
40
5
0
20 Jul 2023
Meta-Value Learning: a General Framework for Learning with Learning
  Awareness
Meta-Value Learning: a General Framework for Learning with Learning Awareness
Tim Cooijmans
Milad Aghajohari
Rameswar Panda
27
6
0
17 Jul 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
47
2
0
27 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
41
7
0
14 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
VA-learning as a more efficient alternative to Q-learning
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
21
6
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
Accelerating Value Iteration with Anchoring
Accelerating Value Iteration with Anchoring
Jongmin Lee
Ernest K. Ryu
24
7
0
26 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
44
19
0
22 May 2023
Augmenting Autotelic Agents with Large Language Models
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
30
22
0
21 May 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
Autonomous Agent for Beyond Visual Range Air Combat: A Deep
  Reinforcement Learning Approach
Autonomous Agent for Beyond Visual Range Air Combat: A Deep Reinforcement Learning Approach
Joao P. A. Dantas
Marcos R. O. A. Máximo
Takashi Yoneyama
32
4
0
19 Apr 2023
Deep reinforcement learning applied to an assembly sequence planning
  problem with user preferences
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
M. Neves
Pedro Neto
OffRL
24
17
0
13 Apr 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End
  Policy and Optimistic Smooth Fictitious Play
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Graph Decision Transformer
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
36
15
0
07 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
90
0
24 Feb 2023
Selective experience replay compression using coresets for lifelong deep
  reinforcement learning in medical imaging
Selective experience replay compression using coresets for lifelong deep reinforcement learning in medical imaging
Guangyao Zheng
Samson Zhou
Vladimir Braverman
M. Jacobs
V. Parekh
OffRL
CLL
24
3
0
22 Feb 2023
Understanding the effect of varying amounts of replay per step
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Robot path planning using deep reinforcement learning
Robot path planning using deep reinforcement learning
Miguel Quinones-Ramirez
Jorge Ríos-Martínez
Víctor Uc Cetina
SSL
25
5
0
17 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
45
13
0
13 Feb 2023
Distributional GFlowNets with Quantile Flows
Distributional GFlowNets with Quantile Flows
Dinghuai Zhang
L. Pan
Ricky T. Q. Chen
Aaron Courville
Yoshua Bengio
34
25
0
11 Feb 2023
Neural Episodic Control with State Abstraction
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
26
14
0
27 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna
  Tuning
Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning
Maxime Bouton
Jaeseong Jeong
José Outes Carnero
Adriano Mendo
Alexandros Nikou
24
1
0
20 Jan 2023
Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles
  at Complex and Unsignalized Intersections
Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles at Complex and Unsignalized Intersections
Dawei Wang
Weizi Li
Lei Zhu
Jia-Yu Pan
48
16
0
12 Jan 2023
Predictive World Models from Real-World Partial Observations
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
44
5
0
12 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with
  Object-Level Abstraction and Differentiable Expression Search
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
58
9
0
30 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D
  Shapes
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
37
14
0
05 Dec 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of
  Reinforcement Learning and Model Predictive Control Approaches for Adaptive
  Traffic Signal Control
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
22
5
0
26 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Credit-cognisant reinforcement learning for multi-agent cooperation
Credit-cognisant reinforcement learning for multi-agent cooperation
F. Bredell
S. M. I. H. A. Engelbrecht
M. I. J. C. Schoeman
13
0
0
18 Nov 2022
Active Example Selection for In-Context Learning
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
32
187
0
08 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making
  in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
49
8
0
06 Nov 2022
Achieving mouse-level strategic evasion performance using real-time
  computational planning
Achieving mouse-level strategic evasion performance using real-time computational planning
German Espinosa
Gabrielle E. Wink
Alexander T. Lai
D. Dombeck
Malcolm A. MacIver
16
3
0
04 Nov 2022
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Jiachen Yang
K. Mittal
T. Dzanic
S. Petrides
B. Keith
Brenden K. Petersen
Daniel Faissol
R. Anderson
31
8
0
02 Nov 2022
A Bibliometric Analysis and Review on Reinforcement Learning for
  Transportation Applications
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
35
14
0
26 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
26
0
0
23 Oct 2022
Solving Continuous Control via Q-learning
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
37
22
0
22 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
39
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Deep Reinforcement Learning for Inverse Inorganic Materials Design
Deep Reinforcement Learning for Inverse Inorganic Materials Design
Elton Pan
Christopher Karpovich
E. Olivetti
AI4CE
32
11
0
21 Oct 2022
Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors'
  Reasoning with Deep Reinforcement Learning
Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning
Arsène Fansi Tchango
Rishab Goel
Julien Martel
Zhi Wen
G. Caron
J. Ghosn
32
11
0
13 Oct 2022
Previous
12345678
Next