v1v2v3v4v5v6 (latest)

Continuous control with deep reinforcement learning

9 September 2015

Alexander Pritzel

David Silver

Papers citing "Continuous control with deep reinforcement learning"

50 / 4,795 papers shown

Analyzing the Hidden Activations of Deep Policy Networks: Why Representation Matters

11 Mar 2021

Maximum Entropy RL (Provably) Solves Some Robust RL ProblemsInternational Conference on Learning Representations (ICLR), 2021

Benjamin Eysenbach

Sergey Levine

OOD

270

220

10 Mar 2021

Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2021

123

09 Mar 2021

Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and CompetitionIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Pavan Samtani

Francisco Leiva

Javier Ruiz-del-Solar

09 Mar 2021

Model-free Policy Learning with Reward GradientsInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2021

282

09 Mar 2021

Domain-Robust Visual Imitation Learning with Mutual Information ConstraintsInternational Conference on Learning Representations (ICLR), 2021

Edoardo Cetin

Oya Celiktutan

OOD DRL

196

08 Mar 2021

Instabilities of Offline RL with Pre-Trained Neural RepresentationInternational Conference on Machine Learning (ICML), 2021

270

08 Mar 2021

A Crash Course on Reinforcement Learning

F. Yaghmaie

L. Ljung

143

08 Mar 2021

Learning a State Representation and Navigation in Cluttered and Dynamic EnvironmentsIEEE Robotics and Automation Letters (RA-L), 2021

Marco Hutter

220

07 Mar 2021

Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement LearningIEEE International Joint Conference on Neural Network (IJCNN), 2021

165

06 Mar 2021

Can You Fix My Neural Network? Real-Time Adaptive Waveform Synthesis for Resilient Wireless Signal Classification

Salvatore D’oro

Francesco Restuccia

Tommaso Melodia

130

05 Mar 2021

Deep reinforcement learning in medical imaging: A literature review

157

168

05 Mar 2021

Neuromechanics-based Deep Reinforcement Learning of Neurostimulation Control in FES cyclingInternational IEEE/EMBS Conference on Neural Engineering (NER), 2021

Nat Wannawas

Mahendran Subramanian

A. Faisal

171

04 Mar 2021

Improving Computational Efficiency in Visual Reinforcement Learning via Stored EmbeddingsNeural Information Processing Systems (NeurIPS), 2021

Pieter Abbeel

212

04 Mar 2021

An RL-Based Adaptive Detection Strategy to Secure Cyber-Physical Systems

Ipsita Koley

Sunandan Adhikary

Soumyajit Dey

212

04 Mar 2021

Reinforcement Learning for Orientation Estimation Using Inertial Sensors with Performance GuaranteeIEEE International Conference on Robotics and Automation (ICRA), 2021

Liang Hu

Yujie Tang

Zhipeng Zhou

Wei Pan

211

03 Mar 2021

Addressing Action Oscillations through Learning Policy InertiaAAAI Conference on Artificial Intelligence (AAAI), 2021

Jianye Hao

124

03 Mar 2021

Foresee then Evaluate: Decomposing Value Estimation with Latent Future PredictionAAAI Conference on Artificial Intelligence (AAAI), 2021

Jianye Hao

195

03 Mar 2021

Design of an Affordable Prosthetic Arm Equipped with Deep Learning Vision-Based Manipulation

A. Imran

William Escobar

F. Barez

141

03 Mar 2021

Offline Reinforcement Learning with Pseudometric LearningInternational Conference on Machine Learning (ICML), 2021

Nino Vieillard

Olivier Pietquin

196

02 Mar 2021

Mind Mappings: Enabling Efficient Algorithm-Accelerator Mapping Space SearchInternational Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2021

Christopher W. Fletcher

197

108

02 Mar 2021

Safe Learning of Uncertain Environments

149

02 Mar 2021

Sample Complexity and Overparameterization Bounds for Temporal Difference Learning with Neural Network ApproximationIEEE Transactions on Automatic Control (IEEE TAC), 2021

187

02 Mar 2021

Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational GraphComputer Vision and Pattern Recognition (CVPR), 2021

Xin Ye

Yezhou Yang

275

01 Mar 2021

Decision Making in Monopoly using a Hybrid Deep Reinforcement Learning ApproachIEEE Transactions on Emerging Topics in Computational Intelligence (IEEE TETCI), 2021

407

01 Mar 2021

Sim-to-Real Transfer for Robotic Manipulation with Tactile SensoryIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

197

28 Feb 2021

Revisiting Peng's Q(

λ

) for Modern Reinforcement LearningInternational Conference on Machine Learning (ICML), 2021

146

27 Feb 2021

Reducing Conservativeness Oriented Offline Reinforcement Learning

217

27 Feb 2021

Multi-Agent Path Planning based on MPC and DDPG

148

26 Feb 2021

Off-Policy Imitation Learning from ObservationsNeural Information Processing Systems (NeurIPS), 2021

210

25 Feb 2021

Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning

178

25 Feb 2021

Improved Regret Bound and Experience Replay in Regularized Policy IterationInternational Conference on Machine Learning (ICML), 2021

124

25 Feb 2021

$Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt{T}$ Regret$

Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with

\sqrt{T}

RegretInternational Conference on Machine Learning (ICML), 2021

Asaf B. Cassel

Tomer Koren

OffRL

195

25 Feb 2021

Deep Reinforcement Learning for Safe Landing Site Selection with Concurrent Consideration of Divert Maneuvers

24 Feb 2021

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise ControlIEEE Transactions on Automation Science and Engineering (T-ASE), 2021

197

24 Feb 2021

Memory-based Deep Reinforcement Learning for POMDPsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

Lingheng Meng

R. Gorbet

Dana Kulic

367

122

24 Feb 2021

Combining Off and On-Policy Training in Model-Based Reinforcement Learning

Alexandre Borges

Arlindo L. Oliveira

172

24 Feb 2021

FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive ParallelismDesign Automation Conference (DAC), 2021

Jenny Yang

Seongmin Hong

Joo-Young Kim

112

24 Feb 2021

Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal LogicIEEE Robotics and Automation Letters (RA-L), 2021

Mingyu Cai

Mohammadhosein Hasanbeig

Shaoping Xiao

Alessandro Abate

Z. Kan

736

24 Feb 2021

Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL

255

23 Feb 2021

Doubly Robust Off-Policy Actor-Critic: Convergence and OptimalityInternational Conference on Machine Learning (ICML), 2021

271

23 Feb 2021

Differentiable Logic Machines

Jianyi Zhang

304

23 Feb 2021

Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and model

Yang Guan

Jingliang Duan

Shengbo Eben Li

Jie Li

Jianyu Chen

B. Cheng

OffRL

160

23 Feb 2021

DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2021

343

23 Feb 2021

Exploring Supervised and Unsupervised Rewards in Machine TranslationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2021

123

22 Feb 2021

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement LearningAdaptive Agents and Multi-Agent Systems (AAMAS), 2021

104

22 Feb 2021

Reinforcement Learning with Prototypical RepresentationsInternational Conference on Machine Learning (ICML), 2021

322

246

22 Feb 2021

Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy OptimizationConference on Uncertainty in Artificial Intelligence (UAI), 2021

252

22 Feb 2021

Improved Learning of Robot Manipulation Tasks via Tactile Intrinsic MotivationIEEE Robotics and Automation Letters (RA-L), 2021

Nikola Vulin

Sammy Christen

Stefan Stevšić

Otmar Hilliges

123

22 Feb 2021

Dealing with Non-Stationarity in MARL via Trust-Region DecompositionInternational Conference on Learning Representations (ICLR), 2021

364

21 Feb 2021