ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1710.02298
  4. Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning

Rainbow: Combining Improvements in Deep Reinforcement Learning

6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
    OffRL
ArXivPDFHTML

Papers citing "Rainbow: Combining Improvements in Deep Reinforcement Learning"

50 / 355 papers shown
Title
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
7
0
0
17 May 2025
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning
Simo Alami C.
Rim Kaddah
Jesse Read
Marie-Paule Cani
51
0
0
07 May 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
203
0
0
14 Mar 2025
Reinforcement Learning-based Threat Assessment
Reinforcement Learning-based Threat Assessment
Wuzhou Sun
Siyi Li
Qingxiang Zou
Zixing Liao
AAML
78
0
0
04 Mar 2025
Reinforcement learning with combinatorial actions for coupled restless bandits
Reinforcement learning with combinatorial actions for coupled restless bandits
Lily Xu
Bryan Wilder
Elias B. Khalil
Milind Tambe
75
1
0
01 Mar 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
75
5
0
21 Feb 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
93
2
0
22 Jan 2025
A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
Jacob Adkins
Michael Bowling
Adam White
80
1
0
10 Dec 2024
Acceleration for Deep Reinforcement Learning using Parallel and
  Distributed Computing: A Survey
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
Zhihong Liu
Xin Xu
Peng Qiao
Dongsheng Li
OffRL
27
2
0
08 Nov 2024
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC
Tyler Clark
Mark Towers
Christine Evers
Jonathon Hare
OffRL
38
0
0
06 Nov 2024
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Entity-based Reinforcement Learning for Autonomous Cyber Defence
Isaac Symes Thompson
Alberto Caron
Chris Hicks
V. Mavroudis
AAML
51
3
0
23 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
36
0
0
22 Oct 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
Devendra Singh Dhami
Kristian Kersting
43
3
0
15 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
43
1
0
07 Oct 2024
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Hanyang Zhao
Genta Indra Winata
Anirban Das
Shi-Xiong Zhang
D. Yao
Wenpin Tang
Sambit Sahu
62
7
0
05 Oct 2024
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
Ghada Sokar
J. Obando-Ceron
Rameswar Panda
Hugo Larochelle
Pablo Samuel Castro
MoE
150
2
0
02 Oct 2024
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids
Integrating Reinforcement Learning and Model Predictive Control with Applications to Microgrids
Caio Fabio Oliveira da Silva
Azita Dabiri
B. de Schutter
50
4
0
17 Sep 2024
The Role of Deep Learning Regularizations on Actors in Offline RL
The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov
Anja Surina
Çağlar Gülçehre
OffRL
AI4CE
68
1
0
11 Sep 2024
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
Shreyas S R
OffRL
OnRL
31
0
0
10 Sep 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Functional Acceleration for Policy Mirror Descent
Functional Acceleration for Policy Mirror Descent
Veronica Chelu
Doina Precup
30
0
0
23 Jul 2024
Simplifying Deep Temporal Difference Learning
Simplifying Deep Temporal Difference Learning
Matteo Gallici
Mattie Fellows
Benjamin Ellis
B. Pou
Ivan Masmitja
Jakob Foerster
Mario Martin
OffRL
62
16
0
05 Jul 2024
Normalization and effective learning rates in reinforcement learning
Normalization and effective learning rates in reinforcement learning
Clare Lyle
Zeyu Zheng
Khimya Khetarpal
James Martens
H. V. Hasselt
Razvan Pascanu
Will Dabney
24
7
0
01 Jul 2024
Towards shutdownable agents via stochastic choice
Towards shutdownable agents via stochastic choice
Elliott Thornley
Alexander Roman
Christos Ziakas
Leyton Ho
Louis Thomson
46
0
0
30 Jun 2024
World Models with Hints of Large Language Models for Goal Achieving
World Models with Hints of Large Language Models for Goal Achieving
Zeyuan Liu
Ziyu Huan
Xiyao Wang
Jiafei Lyu
Jian Tao
Xiu Li
Furong Huang
Huazhe Xu
LM&Ro
LRM
AI4CE
46
1
0
11 Jun 2024
Mimicry and the Emergence of Cooperative Communication
Mimicry and the Emergence of Cooperative Communication
Dylan R. Cope
Peter McBurney
35
0
0
26 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement
  Learning
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
40
3
0
20 May 2024
Dominion: A New Frontier for AI Research
Dominion: A New Frontier for AI Research
Danny Halawi
Aron Sarmasi
Siena Saltzen
Joshua McCoy
OffRL
22
0
0
10 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
22
6
0
07 May 2024
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
RUMOR: Reinforcement learning for Understanding a Model of the Real World for Navigation in Dynamic Environments
Diego Martínez Baselga
L. Riazuelo
Luis Montano
92
1
0
25 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
50
6
0
09 Apr 2024
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep
  Reinforcement Learning
An Improved Strategy for Blood Glucose Control Using Multi-Step Deep Reinforcement Learning
Weiwei Gu
Senquan Wang
45
5
0
12 Mar 2024
Koopman-Assisted Reinforcement Learning
Koopman-Assisted Reinforcement Learning
Preston Rozwood
Edward Mehrez
Ludger Paehler
Wen Sun
Steven L. Brunton
40
6
0
04 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with
  General Function Approximation
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
42
3
0
28 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Boosting Reinforcement Learning Algorithms in Continuous Robotic
  Reaching Tasks using Adaptive Potential Functions
Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks using Adaptive Potential Functions
Yifei Chen
Lambert Schomaker
Francisco Cruz
38
0
0
07 Feb 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice
  via HyperAgent
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
26
5
0
05 Feb 2024
A Strategy for Preparing Quantum Squeezed States Using Reinforcement
  Learning
A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning
Xiaolong Zhao
Yiming Zhao
Ming Li
Tingting Li
Qian Liu
Shuai Guo
Xuexi Yi
18
1
0
29 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
26
0
0
24 Dec 2023
Human-AI Collaboration in Real-World Complex Environment with
  Reinforcement Learning
Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning
Md Saiful Islam
Srijita Das
S. Gottipati
William Duguay
Clodéric Mars
Jalal Arabneydi
Antoine Fagette
Matthew J. Guzdial
Matthew E. Taylor
41
1
0
23 Dec 2023
Multi-agent Reinforcement Learning: A Comprehensive Survey
Multi-agent Reinforcement Learning: A Comprehensive Survey
Dom Huh
Prasant Mohapatra
AI4CE
36
8
0
15 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
Unsupervised Social Event Detection via Hybrid Graph Contrastive
  Learning and Reinforced Incremental Clustering
Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering
Yuanyuan Guo
Zehua Zang
Hang Gao
Xiao Xu
Rui Wang
Lixiang Liu
Jiangmeng Li
36
5
0
08 Dec 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Jun Wang
Hosein Hasanbeig
Kaiyuan Tan
Zihe Sun
Y. Kantaros
35
3
0
28 Nov 2023
From Images to Connections: Can DQN with GNNs learn the Strategic Game
  of Hex?
From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?
Yannik Keller
Jannis Blüml
Gopika Sudhakaran
Kristian Kersting
GNN
27
0
0
22 Nov 2023
Analyzing Emissions and Energy Efficiency at Unsignalized Real-world
  Intersections Under Mixed Traffic Control
Analyzing Emissions and Energy Efficiency at Unsignalized Real-world Intersections Under Mixed Traffic Control
Michael Villarreal
Dawei Wang
Jia Pan
Weizi Li
10
2
0
20 Nov 2023
Selectively Sharing Experiences Improves Multi-Agent Reinforcement
  Learning
Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning
M. Gerstgrasser
Tom Danino
Sarah Keren
29
5
0
01 Nov 2023
A Kernel Perspective on Behavioural Metrics for Markov Decision
  Processes
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
41
4
0
05 Oct 2023
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy
  Environments Based on Deep Reinforcement Learning
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning
Hao Qin
Zhaozhou Wu
Xingqi Zhang
16
0
0
31 Aug 2023
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis
Philip Tobuschat
Hao Ma
Le Chen
Bernhard Schölkopf
Michael Muehlebach
33
1
0
28 Aug 2023
12345678
Next