ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1709.06560
  4. Cited By
Deep Reinforcement Learning that Matters

Deep Reinforcement Learning that Matters

19 September 2017
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
D. Meger
    OffRL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning that Matters"

50 / 316 papers shown
Title
Applicability and Challenges of Deep Reinforcement Learning for
  Satellite Frequency Plan Design
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design
J. Luis
E. Crawley
B. Cameron
24
6
0
15 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Reformulating Unsupervised Style Transfer as Paraphrase Generation
Kalpesh Krishna
John Wieting
Mohit Iyyer
21
237
0
12 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and
  Transfer Learning
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
34
120
0
08 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning
Honghao Wei
Lei Ying
14
7
0
04 Oct 2020
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Yuke Zhu
J. Wong
Ajay Mandlekar
Roberto Martín-Martín
Abhishek Joshi
Soroush Nasiriany
Yifeng Zhu
Soroush Nasiriany
Yifeng Zhu
37
430
0
25 Sep 2020
Revisiting Design Choices in Proximal Policy Optimization
Revisiting Design Choices in Proximal Policy Optimization
Chloe Ching-Yun Hsu
Celestine Mendler-Dünner
Moritz Hardt
17
53
0
23 Sep 2020
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control
Ian Fox
Joyce M. Lee
R. Pop-Busui
Jenna Wiens
BDL
OffRL
19
50
0
18 Sep 2020
TriFinger: An Open-Source Robot for Learning Dexterity
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
27
72
0
08 Aug 2020
A Survey on Device Behavior Fingerprinting: Data Sources, Techniques,
  Application Scenarios, and Datasets
A Survey on Device Behavior Fingerprinting: Data Sources, Techniques, Application Scenarios, and Datasets
Pedro Miguel Sánchez Sánchez
José María Jorquera Valero
Alberto Huertas Celdrán
Gérome Bovet
M. Pérez
Gregorio Martínez Pérez
27
95
0
07 Aug 2020
On the Effectiveness of Image Rotation for Open Set Domain Adaptation
On the Effectiveness of Image Rotation for Open Set Domain Adaptation
S. Bucci
Mohammad Reza Loghmani
Tatiana Tommasi
52
142
0
24 Jul 2020
A Differential Game Theoretic Neural Optimizer for Training Residual
  Networks
A Differential Game Theoretic Neural Optimizer for Training Residual Networks
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
24
2
0
17 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
19
5
0
12 Jul 2020
Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones
Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones
Ugurkan Ates
18
10
0
11 Jul 2020
One Policy to Control Them All: Shared Modular Policies for
  Agent-Agnostic Control
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang
Igor Mordatch
Deepak Pathak
51
164
0
09 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep
  Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
11
199
0
09 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
17
71
0
04 Jul 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
25
3
0
18 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
30
74
0
16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
213
0
10 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning
  Machines
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines
Callum Wilson
A. Riccardi
E. Minisci
11
4
0
04 Jun 2020
Novel Policy Seeking with Constrained Optimization
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
13
13
0
21 May 2020
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive
  Locomotion Controller of a Hexapod Robot
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot
M. Schilling
Kai Konen
F. Ohl
Timo Korthals
11
18
0
21 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
12
82
0
20 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular
  Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
30
83
0
18 May 2020
Context-aware Dynamics Model for Generalization in Model-Based
  Reinforcement Learning
Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning
Kimin Lee
Younggyo Seo
Seunghyun Lee
Honglak Lee
Jinwoo Shin
40
124
0
14 May 2020
Guaranteeing Reproducibility in Deep Learning Competitions
Guaranteeing Reproducibility in Deep Learning Competitions
Brandon Houghton
Stephanie Milani
Nicholay Topin
William H. Guss
Katja Hofmann
Diego Perez-Liebana
Manuela Veloso
Ruslan Salakhutdinov
OOD
27
8
0
12 May 2020
Improving Reproducibility in Machine Learning Research (A Report from
  the NeurIPS 2019 Reproducibility Program)
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Joelle Pineau
Philippe Vincent-Lamarre
Koustuv Sinha
V. Larivière
A. Beygelzimer
Florence dÁlché-Buc
E. Fox
Hugo Larochelle
19
358
0
27 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
31
120
0
24 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models
Explore and Exploit with Heterotic Line Bundle Models
Magdalena Larfors
Robin Schneider
36
38
0
10 Mar 2020
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Scalable Multi-Task Imitation Learning with Autonomous Improvement
Avi Singh
Eric Jang
A. Irpan
Daniel Kappler
Murtaza Dalal
Sergey Levine
Mohi Khansari
Chelsea Finn
48
35
0
25 Feb 2020
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence
G. Marcus
VLM
32
353
0
14 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
25
35
0
10 Feb 2020
Ready Policy One: World Building Through Active Learning
Ready Policy One: World Building Through Active Learning
Philip J. Ball
Jack Parker-Holder
Aldo Pacchiano
K. Choromanski
Stephen J. Roberts
OffRL
27
49
0
07 Feb 2020
Stacked Auto Encoder Based Deep Reinforcement Learning for Online
  Resource Scheduling in Large-Scale MEC Networks
Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks
Feibo Jiang
Kezhi Wang
Li Dong
Cunhua Pan
Kun Yang
OffRL
18
39
0
24 Jan 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
21
18
0
21 Jan 2020
Towards GAN Benchmarks Which Require Generalization
Towards GAN Benchmarks Which Require Generalization
Ishaan Gulrajani
Colin Raffel
Luke Metz
24
57
0
10 Jan 2020
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for
  Reproducible Deep Reinforcement Learning
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning
Keng Wah Loon
L. Graesser
Milan Cvitkovic
OffRL
16
13
0
28 Dec 2019
Convolutional Neural Network-based Topology Optimization (CNN-TO) By
  Estimating Sensitivity of Compliance from Material Distribution
Convolutional Neural Network-based Topology Optimization (CNN-TO) By Estimating Sensitivity of Compliance from Material Distribution
Yusuke Takahashi
Yoshiro Suzuki
A. Todoroki
10
6
0
23 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
25
65
0
18 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning
Shuai Lu
Shuai Han
Wenbo Zhou
Junwei Zhang
13
26
0
13 Dec 2019
Learning to Reach Goals via Iterated Supervised Learning
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh
Abhishek Gupta
Ashwin Reddy
Justin Fu
Coline Devin
Benjamin Eysenbach
Sergey Levine
24
33
0
12 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
17
4
0
02 Dec 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement
  Learning
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
30
152
0
15 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
18
2
0
11 Nov 2019
Experience Sharing Between Cooperative Reinforcement Learning Agents
Experience Sharing Between Cooperative Reinforcement Learning Agents
Lucas O. Souza
G. Ramos
C. Ralha
16
9
0
06 Nov 2019
Paths Explored, Paths Omitted, Paths Obscured: Decision Points &
  Selective Reporting in End-to-End Data Analysis
Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis
Yang Liu
Tim Althoff
Jeffrey Heer
8
51
0
30 Oct 2019
Improving Sample Efficiency in Model-Free Reinforcement Learning from
  Images
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Denis Yarats
Amy Zhang
Ilya Kostrikov
Brandon Amos
Joelle Pineau
Rob Fergus
DRL
42
436
0
02 Oct 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping
Cristian Bodnar
A. Li
Karol Hausman
P. Pastor
Mrinal Kalakrishnan
OffRL
20
50
0
01 Oct 2019
Meta-Q-Learning
Meta-Q-Learning
Rasool Fakoor
Pratik Chaudhari
Stefano Soatto
Alex Smola
OffRL
19
145
0
30 Sep 2019
Previous
1234567
Next