v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015

Alexander Pritzel

David Silver

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,796 papers shown

Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial

Amal Feriani

Ekram Hossain

374

315

06 Nov 2020

Adversarial Skill Learning for Robust Manipulation

148

06 Nov 2020

Sample-efficient Reinforcement Learning in Robotic Table Tennis

302

06 Nov 2020

Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments

285

05 Nov 2020

Learning a Decentralized Multi-arm Motion Planner

Huy Ha

Jingxi Xu

Shuran Song

258

05 Nov 2020

Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning

Roland Siegwart

111

04 Nov 2020

Generative Inverse Deep Reinforcement Learning for Online Recommendation

Liming Zhu

125

04 Nov 2020

A Study of Policy Gradient on a Class of Exactly Solvable Models

Gavin McCracken

Colin Daniels

Rosie Zhao

Anna M. Brandenberger

Prakash Panangaden

Doina Precup

143

03 Nov 2020

Representation Matters: Improving Perception and Exploration for Robotics

Markus Wulfmeier

...

Martin Riedmiller

307

03 Nov 2020

Amortized Variational Deep Q Network

140

03 Nov 2020

Episodic Linear Quadratic Regulators with Low-rank Transitions

Tianyu Wang

Lin F. Yang

180

03 Nov 2020

Reinforcement Learning with Efficient Active Feature Acquisition

Sebastian Tschiatschek

OffRL

166

02 Nov 2020

Observation Space Matters: Benchmark and Optimization AlgorithmIEEE International Conference on Robotics and Automation (ICRA), 2020

J. Kim

Sehoon Ha

OOD OffRL

216

02 Nov 2020

Optimizing Mixed Autonomy Traffic Flow With Decentralized Autonomous Vehicles and Multi-Agent RL

125

30 Oct 2020

Recovery RL: Safe Reinforcement Learning with Learned Recovery ZonesIEEE Robotics and Automation Letters (RA-L), 2020

299

268

29 Oct 2020

DeepQ Stepper: A framework for reactive dynamic walking on uneven terrainIEEE International Conference on Robotics and Automation (ICRA), 2020

Avadesh Meduri

Majid Khadiv

Ludovic Righetti

134

28 Oct 2020

Learning to Represent Action Values as a Hypergraph on the Action VerticesInternational Conference on Learning Representations (ICLR), 2020

Arash Tavakoli

Mehdi Fatemi

Petar Kormushev

158

28 Oct 2020

Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model EnsemblesConference on Robot Learning (CoRL), 2020

Tim Seyde

Wilko Schwarting

S. Karaman

Daniela Rus

264

27 Oct 2020

Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle ApplicationsIEEE Transactions on Transportation Electrification (TE), 2020

27 Oct 2020

Batch Reinforcement Learning with a Nonparametric Off-Policy Policy GradientIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020

Samuele Tosatto

João Carvalho

Jan Peters

OffRL

234

27 Oct 2020

Behavior Priors for Efficient Reinforcement LearningJournal of machine learning research (JMLR), 2020

...

Wojciech M. Czarnecki

Arun Ahuja

Yee Whye Teh

N. Heess

235

27 Oct 2020

Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous ControlsJournal of machine learning research (JMLR), 2020

Jeongho Kim

Jaeuk Shin

Insoon Yang

162

27 Oct 2020

Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation SkillsIEEE International Conference on Robotics and Automation (ICRA), 2020

Samuele Tosatto

Georgia Chalvatzaki

Jan Peters

212

26 Oct 2020

Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-NetworkInternational Conference on Control, Automation, Robotics and Vision (ICARCV), 2020

Niranjan Deshpande

Dominique Vaufreydaz

A. Spalanzani

120

26 Oct 2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2020

Pieter Abbeel

234

26 Oct 2020

How to Make Deep RL Work in Practice

188

25 Oct 2020

Dynamic Adversarial Patch for Evading Object Detection Models

171

25 Oct 2020

Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search

24 Oct 2020

Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning

Xiyao Wang

Junge Zhang

Wenzhen Huang

Qiyue Yin

150

24 Oct 2020

Stabilizing Transformer-Based Action Sequence Generation For Q-Learning

200

23 Oct 2020

Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning

304

23 Oct 2020

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments

158

22 Oct 2020

Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking

Hongda Shen

Eren Kurshan

209

21 Oct 2020

Improving Generalization in Reinforcement Learning with Mixture RegularizationNeural Information Processing Systems (NeurIPS), 2020

364

129

21 Oct 2020

Iterative Amortized Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020

Joseph Marino

Alexandre Piché

Alessandro Davide Ialongo

Yisong Yue

OffRL

256

20 Oct 2020

Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification

Martin Riedmiller

316

20 Oct 2020

Quality of service based radar resource management using deep reinforcement learningInternational Radar Conference (RADAR), 2020

S. Durst

S. Brüggenwirth

20 Oct 2020

Survivable Hyper-Redundant Robotic Arm with Bayesian Policy Morphing

Sayyed Jaffar Ali Raza

Apan Dastider

Mingjie Lin

20 Oct 2020

Proximal Policy Gradient: PPO with Policy Gradient

114

20 Oct 2020

Dream and Search to Control: Latent Space Planning for Continuous Control

165

19 Oct 2020

Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous ControlConference on Robot Learning (CoRL), 2020

Guangzhi Tang

Neelesh Kumar

Raymond Yoo

Konstantinos Michmizos

205

101

19 Oct 2020

What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator

Jianye Hao

...

260

19 Oct 2020

Chance-Constrained Control with Lexicographic Deep Reinforcement LearningIEEE Control Systems Letters (L-CSS), 2020

Alessandro Giuseppi

A. Pietrabissa

19 Oct 2020

Softmax Deep Double Deterministic Policy GradientsNeural Information Processing Systems (NeurIPS), 2020

Ling Pan

Qingpeng Cai

Longbo Huang

233

116

19 Oct 2020

Belief-Grounded Networks for Accelerated Robot Learning under Partial ObservabilityConference on Robot Learning (CoRL), 2020

305

19 Oct 2020

DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS

Mohit Sewak

S. K. Sahay

Hemant Rathore

114

16 Oct 2020

On the Guaranteed Almost Equivalence between Imitation Learning from Observation and DemonstrationIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020

136

16 Oct 2020

Decentralized Multi-Agent Pursuit using Deep Reinforcement LearningIEEE Robotics and Automation Letters (RA-L), 2020

246

120

16 Oct 2020

A Learning Approach to Robot-Agnostic Force-Guided High Precision Assembly

Jieliang Luo

Hui Li

249

15 Oct 2020

Deep Learning of Koopman Representation for Control

101

131

15 Oct 2020