Deep Reinforcement Learning that Matters

19 September 2017

Papers citing "Deep Reinforcement Learning that Matters"

50 / 316 papers shown

Title
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design J. Luis E. Crawley B. Cameron 24 6 0 15 Oct 2020
Reformulating Unsupervised Style Transfer as Paraphrase Generation Kalpesh Krishna John Wieting Mohit Iyyer 21 237 0 12 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning Ossama Ahmed Frederik Trauble Anirudh Goyal Alexander Neitz Yoshua Bengio Bernhard Schölkopf M. Wuthrich Stefan Bauer CML 34 120 0 08 Oct 2020
FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning Honghao Wei Lei Ying 14 7 0 04 Oct 2020
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning Yuke Zhu J. Wong Ajay Mandlekar Roberto Martín-Martín Abhishek Joshi Soroush Nasiriany Yifeng Zhu Soroush Nasiriany Yifeng Zhu 37 430 0 25 Sep 2020
Revisiting Design Choices in Proximal Policy Optimization Chloe Ching-Yun Hsu Celestine Mendler-Dünner Moritz Hardt 17 53 0 23 Sep 2020
Deep Reinforcement Learning for Closed-Loop Blood Glucose Control Ian Fox Joyce M. Lee R. Pop-Busui Jenna Wiens BDL OffRL 19 50 0 18 Sep 2020
TriFinger: An Open-Source Robot for Learning Dexterity Manuel Wüthrich Felix Widmaier F. Grimminger J. Akpo S. Joshi ... Julian Viereck M. Naveau Ludovic Righetti Bernhard Schölkopf Stefan Bauer 27 72 0 08 Aug 2020
A Survey on Device Behavior Fingerprinting: Data Sources, Techniques, Application Scenarios, and Datasets Pedro Miguel Sánchez Sánchez José María Jorquera Valero Alberto Huertas Celdrán Gérome Bovet M. Pérez Gregorio Martínez Pérez 27 95 0 07 Aug 2020
On the Effectiveness of Image Rotation for Open Set Domain Adaptation S. Bucci Mohammad Reza Loghmani Tatiana Tommasi 52 142 0 24 Jul 2020
A Differential Game Theoretic Neural Optimizer for Training Residual Networks Guan-Horng Liu T. Chen Evangelos A. Theodorou 24 2 0 17 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer E. Liu Ramtin Keramati Sudarshan Seshadri Kelvin Guu Panupong Pasupat Emma Brunskill Percy Liang OffRL 19 5 0 12 Jul 2020
Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones Ugurkan Ates 18 10 0 11 Jul 2020
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control Wenlong Huang Igor Mordatch Deepak Pathak 51 164 0 09 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning Kimin Lee Michael Laskin A. Srinivas Pieter Abbeel OffRL 11 199 0 09 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning Ron Amit Ron Meir K. Ciosek OffRL 17 71 0 04 Jul 2020
Reparameterized Variational Divergence Minimization for Stable Imitation Dilip Arumugam Debadeepta Dey Alekh Agarwal Asli Celikyilmaz E. Nouri W. Dolan 25 3 0 18 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning Eivind Meyer Amalie Heiberg Adil Rasheed Omer San 30 74 0 16 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks Georgios Papoudakis Filippos Christianos Lukas Schafer Stefano V. Albrecht OffRL 26 220 0 14 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study Marcin Andrychowicz Anton Raichuk Piotr Stańczyk Manu Orsini Sertan Girgin ... M. Geist Olivier Pietquin Marcin Michalski Sylvain Gelly Olivier Bachem OffRL 31 213 0 10 Jun 2020
A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines Callum Wilson A. Riccardi E. Minisci 11 4 0 04 Jun 2020
Novel Policy Seeking with Constrained Optimization Hao Sun Zhenghao Peng Bo Dai Jian Guo Dahua Lin Bolei Zhou 13 13 0 21 May 2020
Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot M. Schilling Kai Konen F. Ohl Timo Korthals 11 18 0 21 May 2020
Mirror Descent Policy Optimization Manan Tomar Lior Shani Yonathan Efroni Mohammad Ghavamzadeh 12 82 0 20 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions Abdulelah S. Alshehri R. Gani Fengqi You AI4CE 30 83 0 18 May 2020
Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning Kimin Lee Younggyo Seo Seunghyun Lee Honglak Lee Jinwoo Shin 40 124 0 14 May 2020
Guaranteeing Reproducibility in Deep Learning Competitions Brandon Houghton Stephanie Milani Nicholay Topin William H. Guss Katja Hofmann Diego Perez-Liebana Manuela Veloso Ruslan Salakhutdinov OOD 27 8 0 12 May 2020
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program) Joelle Pineau Philippe Vincent-Lamarre Koustuv Sinha V. Larivière A. Beygelzimer Florence dÁlché-Buc E. Fox Hugo Larochelle 19 358 0 27 Mar 2020
An empirical investigation of the challenges of real-world reinforcement learning Gabriel Dulac-Arnold Nir Levine D. Mankowitz Jerry Li Cosmin Paduraru Sven Gowal Todd Hester OffRL 31 120 0 24 Mar 2020
Explore and Exploit with Heterotic Line Bundle Models Magdalena Larfors Robin Schneider 36 38 0 10 Mar 2020
Scalable Multi-Task Imitation Learning with Autonomous Improvement Avi Singh Eric Jang A. Irpan Daniel Kappler Murtaza Dalal Sergey Levine Mohi Khansari Chelsea Finn 48 35 0 25 Feb 2020
The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence G. Marcus VLM 32 353 0 14 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems Joao Paulo Jansch-Porto Bin Hu Geir Dullerud 25 35 0 10 Feb 2020
Ready Policy One: World Building Through Active Learning Philip J. Ball Jack Parker-Holder Aldo Pacchiano K. Choromanski Stephen J. Roberts OffRL 27 49 0 07 Feb 2020
Stacked Auto Encoder Based Deep Reinforcement Learning for Online Resource Scheduling in Large-Scale MEC Networks Feibo Jiang Kezhi Wang Li Dong Cunhua Pan Kun Yang OffRL 18 39 0 24 Jan 2020
Lyceum: An efficient and scalable ecosystem for robot learning Colin Summers Kendall Lowrey Aravind Rajeswaran S. Srinivasa E. Todorov 21 18 0 21 Jan 2020
Towards GAN Benchmarks Which Require Generalization Ishaan Gulrajani Colin Raffel Luke Metz 24 57 0 10 Jan 2020
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning Keng Wah Loon L. Graesser Milan Cvitkovic OffRL 16 13 0 28 Dec 2019
Convolutional Neural Network-based Topology Optimization (CNN-TO) By Estimating Sensitivity of Compliance from Material Distribution Yusuke Takahashi Yoshiro Suzuki A. Todoroki 10 6 0 23 Dec 2019
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning Eivind Meyer Haakon Robinson Adil Rasheed Omer San 25 65 0 18 Dec 2019
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning Shuai Lu Shuai Han Wenbo Zhou Junwei Zhang 13 26 0 13 Dec 2019
Learning to Reach Goals via Iterated Supervised Learning Dibya Ghosh Abhishek Gupta Ashwin Reddy Justin Fu Coline Devin Benjamin Eysenbach Sergey Levine 24 33 0 12 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization Jingbin Liu Xinyang Gu Shuai Liu 17 4 0 02 Dec 2019
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning Cameron Voloshin Hoang Minh Le Nan Jiang Yisong Yue OffRL 30 152 0 15 Nov 2019
Multi-Path Policy Optimization L. Pan Qingpeng Cai Longbo Huang 18 2 0 11 Nov 2019
Experience Sharing Between Cooperative Reinforcement Learning Agents Lucas O. Souza G. Ramos C. Ralha 16 9 0 06 Nov 2019
Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis Yang Liu Tim Althoff Jeffrey Heer 8 51 0 30 Oct 2019
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images Denis Yarats Amy Zhang Ilya Kostrikov Brandon Amos Joelle Pineau Rob Fergus DRL 42 436 0 02 Oct 2019
Quantile QT-Opt for Risk-Aware Vision-Based Robotic Grasping Cristian Bodnar A. Li Karol Hausman P. Pastor Mrinal Kalakrishnan OffRL 20 50 0 01 Oct 2019
Meta-Q-Learning Rasool Fakoor Pratik Chaudhari Stefano Soatto Alex Smola OffRL 19 145 0 30 Sep 2019