Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.02298
Cited By
Rainbow: Combining Improvements in Deep Reinforcement Learning
6 October 2017
Matteo Hessel
Joseph Modayil
H. V. Hasselt
Tom Schaul
Georg Ostrovski
Will Dabney
Dan Horgan
Bilal Piot
M. G. Azar
David Silver
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rainbow: Combining Improvements in Deep Reinforcement Learning"
50 / 362 papers shown
Title
Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning
Hao Qin
Zhaozhou Wu
Xingqi Zhang
16
0
0
31 Aug 2023
Data-Efficient Online Learning of Ball Placement in Robot Table Tennis
Philip Tobuschat
Hao Ma
Le Chen
Bernhard Schölkopf
Michael Muehlebach
36
1
0
28 Aug 2023
Learning Cyber Defence Tactics from Scratch with Multi-Agent Reinforcement Learning
Jacob Wiebe
Ranwa Al Mallah
Li Li
AAML
36
3
0
25 Aug 2023
SMARLA: A Safety Monitoring Approach for Deep Reinforcement Learning Agents
Amirhossein Zolfagharian
Manel Abdellatif
Lionel C. Briand
S. Ramesh
27
5
0
03 Aug 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
21
1
0
22 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
40
5
0
20 Jul 2023
Meta-Value Learning: a General Framework for Learning with Learning Awareness
Tim Cooijmans
Milad Aghajohari
Rameswar Panda
27
6
0
17 Jul 2023
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning
Jinyi Liu
Yi Ma
Jianye Hao
Yujing Hu
Yan Zheng
Tangjie Lv
Changjie Fan
OffRL
47
2
0
27 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
41
7
0
14 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Rameswar Panda
Marc G. Bellemare
Rishabh Agarwal
Pablo Samuel Castro
OffRL
54
85
0
30 May 2023
VA-learning as a more efficient alternative to Q-learning
Yunhao Tang
Rémi Munos
Mark Rowland
Michal Valko
OffRL
21
6
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
36
9
0
29 May 2023
Accelerating Value Iteration with Anchoring
Jongmin Lee
Ernest K. Ryu
24
7
0
26 May 2023
Testing of Deep Reinforcement Learning Agents with Surrogate Models
Matteo Biagiola
Paolo Tonella
44
19
0
22 May 2023
Augmenting Autotelic Agents with Large Language Models
Cédric Colas
Laetitia Teodorescu
Pierre-Yves Oudeyer
Xingdi Yuan
Marc-Alexandre Côté
LLMAG
LM&Ro
30
22
0
21 May 2023
Efficient Deep Reinforcement Learning Requires Regulating Overfitting
Qiyang Li
Aviral Kumar
Ilya Kostrikov
Sergey Levine
OffRL
32
31
0
20 Apr 2023
Autonomous Agent for Beyond Visual Range Air Combat: A Deep Reinforcement Learning Approach
Joao P. A. Dantas
Marcos R. O. A. Máximo
Takashi Yoneyama
32
4
0
19 Apr 2023
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
M. Neves
Pedro Neto
OffRL
24
17
0
13 Apr 2023
Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Wei Xi
Yongxin Zhang
Changnan Xiao
Xuefeng Huang
Shihong Deng
Haowei Liang
Jie Chen
Peng Sun
OffRL
50
8
0
07 Mar 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
36
15
0
07 Mar 2023
The Dormant Neuron Phenomenon in Deep Reinforcement Learning
Ghada Sokar
Rishabh Agarwal
Pablo Samuel Castro
Utku Evci
CLL
51
90
0
24 Feb 2023
Selective experience replay compression using coresets for lifelong deep reinforcement learning in medical imaging
Guangyao Zheng
Samson Zhou
Vladimir Braverman
M. Jacobs
V. Parekh
OffRL
CLL
24
3
0
22 Feb 2023
Understanding the effect of varying amounts of replay per step
A. Paul
Videh Raj Nema
8
0
0
20 Feb 2023
Robot path planning using deep reinforcement learning
Miguel Quinones-Ramirez
Jorge Ríos-Martínez
Víctor Uc Cetina
SSL
25
5
0
17 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
45
13
0
13 Feb 2023
Distributional GFlowNets with Quantile Flows
Dinghuai Zhang
L. Pan
Ricky T. Q. Chen
Aaron Courville
Yoshua Bengio
34
25
0
11 Feb 2023
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
Lei Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
26
14
0
27 Jan 2023
Deep Laplacian-based Options for Temporally-Extended Exploration
Martin Klissarov
Marlos C. Machado
OffRL
26
19
0
26 Jan 2023
Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout
Takuya Hiraoka
Takashi Onishi
Yoshimasa Tsuruoka
OffRL
29
0
0
26 Jan 2023
Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning
Maxime Bouton
Jaeseong Jeong
José Outes Carnero
Adriano Mendo
Alexandros Nikou
24
1
0
20 Jan 2023
Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles at Complex and Unsignalized Intersections
Dawei Wang
Weizi Li
Lei Zhu
Jia-Yu Pan
48
16
0
12 Jan 2023
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
44
5
0
12 Jan 2023
Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search
Wenqing Zheng
S. Sharan
Zhiwen Fan
Kevin Wang
Yihan Xi
Zhangyang Wang
58
9
0
30 Dec 2022
Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees
Hsin-En Su
Yen-Ju Chen
Ping-Chun Hsieh
Xi Liu
OffRL
26
0
0
10 Dec 2022
Learning Physically Realizable Skills for Online Packing of General 3D Shapes
Hang Zhao
Zherong Pan
Yang Yu
Kai Xu
OffRL
37
14
0
05 Dec 2022
A Critical Review of Traffic Signal Control and A Novel Unified View of Reinforcement Learning and Model Predictive Control Approaches for Adaptive Traffic Signal Control
Xiaoyu Wang
Scott Sanner
Baher Abdulhai
22
5
0
26 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
41
0
0
23 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
19
1
0
23 Nov 2022
Credit-cognisant reinforcement learning for multi-agent cooperation
F. Bredell
S. M. I. H. A. Engelbrecht
M. I. J. C. Schoeman
13
0
0
18 Nov 2022
Active Example Selection for In-Context Learning
Yiming Zhang
Shi Feng
Chenhao Tan
SILM
LRM
32
187
0
08 Nov 2022
Graph Reinforcement Learning Application to Co-operative Decision-Making in Mixed Autonomy Traffic: Framework, Survey, and Challenges
Qi Liu
Xueyuan Li
Zirui Li
Jingda Wu
Guodong Du
Xinlu Gao
Fan Yang
Shihua Yuan
49
8
0
06 Nov 2022
Achieving mouse-level strategic evasion performance using real-time computational planning
German Espinosa
Gabrielle E. Wink
Alexander T. Lai
D. Dombeck
Malcolm A. MacIver
16
3
0
04 Nov 2022
Multi-Agent Reinforcement Learning for Adaptive Mesh Refinement
Jiachen Yang
K. Mittal
T. Dzanic
S. Petrides
B. Keith
Brenden K. Petersen
Daniel Faissol
R. Anderson
31
8
0
02 Nov 2022
A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications
Can Li
Lei Bai
L. Yao
S. Waller
Wei Liu
35
14
0
26 Oct 2022
Climate Change Policy Exploration using Reinforcement Learning
Theodore Wolf
26
0
0
23 Oct 2022
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
37
22
0
22 Oct 2022
Probing Transfer in Deep Reinforcement Learning without Task Engineering
Andrei A. Rusu
Sebastian Flennerhag
Dushyant Rao
Razvan Pascanu
R. Hadsell
39
6
0
22 Oct 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
14
1
0
21 Oct 2022
Deep Reinforcement Learning for Inverse Inorganic Materials Design
Elton Pan
Christopher Karpovich
E. Olivetti
AI4CE
32
11
0
21 Oct 2022
Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning
Arsène Fansi Tchango
Rishab Goel
Julien Martel
Zhi Wen
G. Caron
J. Ghosn
32
11
0
13 Oct 2022
Previous
1
2
3
4
5
6
7
8
Next