Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.06560
Cited By
Deep Reinforcement Learning that Matters
19 September 2017
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
D. Meger
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Reinforcement Learning that Matters"
50 / 316 papers shown
Title
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
37
5
0
20 Jul 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
31
9
0
29 May 2023
C-MCTS: Safe Planning with Monte Carlo Tree Search
Dinesh Parthasarathy
G. Kontes
Axel Plinge
Christopher Mutschler
37
3
0
25 May 2023
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
Yameng Zhang
Long Bai
Li Liu
Hongliang Ren
Max Q.-H. Meng
11
9
0
18 May 2023
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Tom Jurgenson
Aviv Tamar
29
1
0
17 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
33
36
0
16 May 2023
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Prakash Panangaden
S. Rezaei-Shoshtari
Rosie Zhao
D. Meger
Doina Precup
25
2
0
09 May 2023
The e-Bike Motor Assembly: Towards Advanced Robotic Manipulation for Flexible Manufacturing
Leonel Rozo
A. Kupcsik
Philipp Schillinger
Meng Guo
R. Krug
...
Patrick Kesper
Sabrina Hoppe
Hanna Ziesche
M. Burger
Kai O. Arras
38
5
0
20 Apr 2023
PED-ANOVA: Efficiently Quantifying Hyperparameter Importance in Arbitrary Subspaces
Shuhei Watanabe
Archit Bansal
Frank Hutter
32
12
0
20 Apr 2023
Learning policies for resource allocation in business processes
J. Middelhuis
R. Bianco
E. Scherzer
Z. A. Bukhsh
I. Adan
R. Dijkman
11
6
0
19 Apr 2023
Convex Dual Theory Analysis of Two-Layer Convolutional Neural Networks with Soft-Thresholding
Chunyan Xiong
Meng Lu
Xiaotong Yu
JIAN-PENG Cao
Zhong Chen
D. Guo
X. Qu
MLT
40
0
0
14 Apr 2023
Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
M. Neves
Pedro Neto
OffRL
16
17
0
13 Apr 2023
Automatic Gradient Descent: Deep Learning without Hyperparameters
Jeremy Bernstein
Chris Mingard
Kevin Huang
Navid Azizan
Yisong Yue
ODL
16
17
0
11 Apr 2023
Deep Occupancy-Predictive Representations for Autonomous Driving
Eivind Meyer
Lars Frederik Peiss
Matthias Althoff
37
3
0
07 Mar 2023
Using Automated Algorithm Configuration for Parameter Control
D. Chen
M. Buzdalov
Carola Doerr
Nguyen Dang
23
3
0
23 Feb 2023
A Reinforcement Learning Framework for Online Speaker Diarization
Baihan Lin
Xinxin Zhang
OffRL
33
2
0
21 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
32
163
0
06 Feb 2023
A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Megan M. Baker
Alexander New
Mario Aguilar-Simon
Ziad Al-Halah
Sébastien M. R. Arnold
...
Zifan Xu
A. Yanguas-Gil
Harel Yedidsion
Shangqun Yu
Gautam K. Vallabha
30
15
0
18 Jan 2023
Mutation Testing of Deep Reinforcement Learning Based on Real Faults
Florian Tambon
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
G. Antoniol
28
7
0
13 Jan 2023
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
Offline Policy Optimization in RL with Variance Regularizaton
Riashat Islam
Samarth Sinha
Homanga Bharadhwaj
Samin Yeasar Arnob
Zhuoran Yang
Animesh Garg
Zhaoran Wang
Lihong Li
Doina Precup
OffRL
23
0
0
29 Dec 2022
Speeding Up Multi-Objective Hyperparameter Optimization by Task Similarity-Based Meta-Learning for the Tree-Structured Parzen Estimator
Shuhei Watanabe
Noor H. Awad
Masaki Onishi
Frank Hutter
31
8
0
13 Dec 2022
Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction
Kaiyuan Yang
Houjing Huang
Olafs Vandans
A. Murali
Fujia Tian
R. Yap
Liang Dai
17
10
0
27 Nov 2022
CACTO: Continuous Actor-Critic with Trajectory Optimization -- Towards global optimality
Gianluigi Grandesso
Elisa Alboni
G. P. R. Papini
Patrick M. Wensing
Andrea Del Prete
22
15
0
12 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
24
7
0
11 Nov 2022
Safe Policy Improvement in Constrained Markov Decision Processes
Luigi Berducci
Radu Grosu
OffRL
36
2
0
20 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
Long N-step Surrogate Stage Reward to Reduce Variances of Deep Reinforcement Learning in Complex Problems
Junmin Zhong
Ruofan Wu
J. Si
LRM
13
0
0
10 Oct 2022
Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI
Baturay Saglam
Doğa Gürgünoğlu
Suleyman Serdar Kozat
16
12
0
10 Oct 2022
Deep Intrinsically Motivated Exploration in Continuous Control
Baturay Saglam
Suleyman Serdar Kozat
18
4
0
01 Oct 2022
Scaling Laws for a Multi-Agent Reinforcement Learning Model
Oren Neumann
C. Gros
24
26
0
29 Sep 2022
Modern Machine Learning Tools for Monitoring and Control of Industrial Processes: A Survey
R. Bhushan Gopaluni
Aditya Tulsyan
Benoît Chachuat
Biao Huang
J. M. Lee
Faraz Amjad
S. Damarla
Jong Woo Kim
Nathan P. Lawrence
AI4CE
16
38
0
22 Sep 2022
Measuring Interventional Robustness in Reinforcement Learning
Katherine Avery
Jack Kenney
Pracheta Amaranath
Erica Cai
David D. Jensen
21
0
0
19 Sep 2022
Learn the Time to Learn: Replay Scheduling in Continual Learning
Marcus Klasson
Hedvig Kjellström
Chen Zhang
CLL
24
9
0
18 Sep 2022
Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning
David Bertoin
Adil Zouitine
Mehdi Zouitine
Emmanuel Rachelson
36
30
0
16 Sep 2022
Deep Reinforcement Learning for Cryptocurrency Trading: Practical Approach to Address Backtest Overfitting
Berend Gort
Xiao-Yang Liu
Xinghang Sun
Jiechao Gao
Shuai Chen
Chris Wang
24
13
0
12 Sep 2022
Unifying Generative Models with GFlowNets and Beyond
Dinghuai Zhang
Ricky T. Q. Chen
Nikolay Malkin
Yoshua Bengio
BDL
AI4CE
54
25
0
06 Sep 2022
Project proposal: A modular reinforcement learning based automated theorem prover
Boris Shminke
18
1
0
06 Sep 2022
Distributed Ensembles of Reinforcement Learning Agents for Electricity Control
Pierrick Pochelu
S. Petiton
B. Conche
AI4CE
28
2
0
30 Aug 2022
Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action Space
Yining Chen
Ke Wang
Guang-hua Song
Xiaohong Jiang
25
3
0
23 Aug 2022
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Aleksandar Stanić
Yujin Tang
David R Ha
Jürgen Schmidhuber
ELM
29
13
0
05 Aug 2022
Towards Augmented Microscopy with Reinforcement Learning-Enhanced Workflows
Michael Xu
Abinash Kumar
J. Lebeau
18
7
0
04 Aug 2022
Robust Knowledge Adaptation for Dynamic Graph Neural Networks
Han Li
Changsheng Li
Kaituo Feng
Ye Yuan
Guoren Wang
H. Zha
32
13
0
22 Jul 2022
Bayesian Generational Population-Based Training
Xingchen Wan
Cong Lu
Jack Parker-Holder
Philip J. Ball
Vu-Linh Nguyen
Binxin Ru
Michael A. Osborne
OffRL
31
15
0
19 Jul 2022
Neural Color Operators for Sequential Image Retouching
Yili Wang
Xin Li
K. Xu
Dongliang He
Qi Zhang
Fu Li
Errui Ding
23
14
0
17 Jul 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach
Zohar Rimon
Aviv Tamar
Gilad Adler
OOD
OffRL
34
8
0
21 Jun 2022
Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Byungchan Ko
Jungseul Ok
OnRL
15
5
0
01 Jun 2022
Comparing interpretation methods in mental state decoding analyses with deep learning models
A. Thomas
Christopher Ré
R. Poldrack
AI4CE
13
2
0
31 May 2022
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning
Yinglun Xu
Qi Zeng
Gagandeep Singh
AAML
35
5
0
30 May 2022
Automated Dynamic Algorithm Configuration
Steven Adriaensen
André Biedenkapp
Gresa Shala
Noor H. Awad
Theresa Eimer
Marius Lindauer
Frank Hutter
32
36
0
27 May 2022
Previous
1
2
3
4
5
6
7
Next