Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015

David Silver

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 337 papers shown

Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning

Sha Luo

Lambert Schomaker

233

11 Jun 2023

PACER: A Fully Push-forward-based Distributional Reinforcement Learning AlgorithmNeurocomputing (Neurocomputing), 2023

Chao Zhang

190

11 Jun 2023

Self-Supervised Reinforcement Learning that Transfers using Random FeaturesNeural Information Processing Systems (NeurIPS), 2023

Abhishek Gupta

247

26 May 2023

Decision-Aware Actor-Critic with Function Approximation and Theoretical GuaranteesNeural Information Processing Systems (NeurIPS), 2023

Nicolas Le Roux

322

24 May 2023

A Generalist Dynamics Model for Control

Sarah Bechtle

Martin Riedmiller

Jost Tobias Springenberg

205

18 May 2023

Safe MDP Planning by Learning Temporal Patterns of Undesirable Trajectories and Averting Negative Side EffectsInternational Conference on Automated Planning and Scheduling (ICAPS), 2023

Siow Meng Low

Akshat Kumar

Scott Sanner

127

06 Apr 2023

Diminishing Return of Value Expansion Methods in Model-Based Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

Daniel Palenicek

M. Lutter

João Carvalho

Jan Peters

182

07 Mar 2023

Taylor TD-learningNeural Information Processing Systems (NeurIPS), 2023

238

27 Feb 2023

Leveraging Jumpy Models for Planning and Fast Learning in Robotic Domains

Jingwei Zhang

Jost Tobias Springenberg

Dushyant Rao

Martin Riedmiller

157

24 Feb 2023

Stochastic Generative Flow NetworksConference on Uncertainty in Artificial Intelligence (UAI), 2023

Moksh Jain

254

19 Feb 2023

Predictable MDP Abstraction for Unsupervised Model-Based RLInternational Conference on Machine Learning (ICML), 2023

Seohong Park

Sergey Levine

214

08 Feb 2023

DiSProD: Differentiable Symbolic Propagation of Distributions for PlanningInternational Joint Conference on Artificial Intelligence (IJCAI), 2023

271

03 Feb 2023

Extreme Q-Learning: MaxEnt RL without EntropyInternational Conference on Learning Representations (ICLR), 2023

270

103

05 Jan 2023

Latent Variable Representation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2022

Zhaolin Ren

Chenjun Xiao

Tianjun Zhang

Na Li

Sujay Sanghavi

227

17 Dec 2022

Physics-Informed Model-Based Reinforcement LearningConference on Learning for Dynamics & Control (L4DC), 2022

Adithya Ramesh

Balaraman Ravindran

188

05 Dec 2022

Predictive Sampling: Real-time Behaviour Synthesis with MuJoCo

289

119

01 Dec 2022

The Benefits of Model-Based Generalization in Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

347

04 Nov 2022

Scalable Multi-Agent Reinforcement Learning through Intelligent Information AggregationInternational Conference on Machine Learning (ICML), 2022

Karthik Gopalakrishnan

H. Balakrishnan

212

03 Nov 2022

Integrated Decision and Control for High-Level Automated Vehicles by Mixed Policy Gradient and Its Experiment Verification

Shengbo Eben Li

120

19 Oct 2022

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization AlgorithmNeural Information Processing Systems (NeurIPS), 2022

Ashish Kumar Jayant

S. Bhatnagar

OffRL

165

14 Oct 2022

ControlVAE: Model-Based Learning of Generative Controllers for Physics-Based CharactersACM Transactions on Graphics (TOG), 2022

169

12 Oct 2022

Training Efficient Controllers via Analytic Policy GradientIEEE International Conference on Robotics and Automation (ICRA), 2022

280

26 Sep 2022

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One ObjectiveInternational Conference on Learning Representations (ICLR), 2022

Homanga Bharadhwaj

336

18 Sep 2022

Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

Shen Zhang

161

16 Sep 2022

A model-based approach to meta-Reinforcement Learning: Transformers and tree searchThe European Symposium on Artificial Neural Networks (ESANN), 2022

Brieuc Pinon

Jean-Charles Delvenne

Raphaël Jungers

OffRL

220

24 Aug 2022

Entropy Enhanced Multi-Agent Coordination Based on Hierarchical Graph Learning for Continuous Action SpaceIEEE Transactions on Cognitive and Developmental Systems (IEEE TCDS), 2022

132

23 Aug 2022

Efficient Planning in a Compact Latent Action SpaceInternational Conference on Learning Representations (ICLR), 2022

Tianjun Zhang

289

22 Aug 2022

MPC-based Imitation Learning for Safe and Human-like Autonomous Driving

F. S. Acerbo

Jan Swevers

Tinne Tuytelaars

Tong Duy Son

24 Jun 2022

Auto-Encoding Adversarial Imitation Learning

223

22 Jun 2022

A Survey on Model-based Reinforcement LearningScience China Information Sciences (Sci. China Inf. Sci.), 2022

346

152

19 Jun 2022

Autonomous Platoon Control with Integrated Deep Reinforcement Learning and Dynamic ProgrammingIEEE Internet of Things Journal (IEEE IoT J.), 2022

298

15 Jun 2022

Open-Ended Learning Strategies for Learning Complex Locomotion Skills

Fangqin Zhou

Joaquin Vanschoren

237

14 Jun 2022

Reinforcement Learning for Vision-based Object Manipulation with Non-parametric Policy and Action PrimitivesIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2021

167

12 Jun 2022

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the CloudKnowledge Discovery and Data Mining (KDD), 2022

...

224

31 May 2022

Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation

499

22 May 2022

Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022

274

20 Apr 2022

Accelerated Policy Learning with Parallel Differentiable SimulationInternational Conference on Learning Representations (ICLR), 2022

248

126

14 Apr 2022

Revisiting Model-based Value Expansion

Daniel Palenicek

M. Lutter

Jan Peters

191

28 Mar 2022

Investigating Compounding Prediction Errors in Learned Dynamics Models

222

17 Mar 2022

Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent CoordinationThe Journal of Defence Modeling and Simulation: Applications, Methodology, Technology (JDMS), 2022

...

165

17 Mar 2022

Retrieval-Augmented Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022

...

406

17 Feb 2022

GrASP: Gradient-Based Affordance Selection for Planning

178

08 Feb 2022

A Temporal-Difference Approach to Policy Gradient EstimationInternational Conference on Machine Learning (ICML), 2022

405

04 Feb 2022

Tutorial on amortized optimization

Brandon Amos

OffRL

812

01 Feb 2022

Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point ProcessesAAAI Conference on Artificial Intelligence (AAAI), 2022

242

29 Jan 2022

Joint Differentiable Optimization and Verification for Certified Reinforcement LearningInternational Conference on Cyber-Physical Systems (ICCPS), 2022

188

28 Jan 2022

Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective

Ryan K. Tan

Yang Liu

Lei Xie

293

21 Jan 2022

Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic

192

16 Dec 2021

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

Markus Wulfmeier

220

01 Dec 2021

Generalized Decision Transformer for Offline Hindsight Information MatchingInternational Conference on Learning Representations (ICLR), 2021

Hiroki Furuta

Y. Matsuo

S. Gu

OffRL

261

118

19 Nov 2021