Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015

David Silver

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 337 papers shown

D2 Actor Critic: Diffusion Actor Meets Distributional Critic

265

03 Oct 2025

First Order Model-Based RL through Decoupled Backpropagation

Joseph Amigo

Rooholla Khorrambakht

Elliot Chane-Sane

Nicolas Mansard

Ludovic Righetti

161

29 Aug 2025

Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI

240

28 Aug 2025

Reparameterization Proximal Policy Optimization

184

08 Aug 2025

Test-time Offline Reinforcement Learning on Goal-related Experience

216

24 Jul 2025

Relative Entropy Pathwise Policy Optimization

Amir-massoud Farahmand

Igor Gilitschenski

369

15 Jul 2025

Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces

Jiamin He

A. Rupam Mahmood

Martha White

103

19 Jun 2025

AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning

269

29 May 2025

Wasserstein Policy Optimization

385

01 May 2025

Differentiable Information Enhanced Model-Based Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2025

251

03 Mar 2025

Accelerating Model-Based Reinforcement Learning with State-Space World Models

268

27 Feb 2025

Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps

Linfeng Zhao

Lawson L. S. Wong

355

16 Dec 2024

Stabilizing Reinforcement Learning in Differentiable Multiphysics SimulationInternational Conference on Learning Representations (ICLR), 2024

Eliot Xing

Vernon Luk

Jean Oh

418

16 Dec 2024

Guiding Reinforcement Learning with Incomplete System DynamicsIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024

267

22 Oct 2024

Distribution Guided Active Feature Acquisition

Yang Li

Junier Oliva

283

04 Oct 2024

Online Control-Informed Learning

367

04 Oct 2024

Grounded Answers for Multi-agent Decision-making Problem through Generative World ModelNeural Information Processing Systems (NeurIPS), 2024

359

03 Oct 2024

Pessimistic Iterative Planning with RNNs for Robust POMDPs

Maris F. L. Galesloot

424

16 Aug 2024

A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or SubgoalsInternational Conference on Learning Representations (ICLR), 2024

400

11 Aug 2024

Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

375

01 Aug 2024

Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning

Zakariae El Asri

Olivier Sigaud

Nicolas Thome

213

02 Jul 2024

Diffusion Spectral Representation for Reinforcement Learning

Bo Dai

329

23 Jun 2024

Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice

239

19 May 2024

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

282

06 May 2024

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration

Yibo Wang

Jiang Zhao

OffRL OnRL

244

31 Mar 2024

$Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control$

Robust Model Based Reinforcement Learning Using

\mathcal{L}_1

222

21 Mar 2024

SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning

311

14 Mar 2024

Generalizing Cooperative Eco-driving via Multi-residual Task Learning

163

07 Mar 2024

Do Transformer World Models Give Better Policy Gradients?

Pierre-Luc Bacon

272

07 Feb 2024

Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence

300

05 Feb 2024

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data AttributionNeural Information Processing Systems (NeurIPS), 2024

Tatsunori Hashimoto

325

29 Jan 2024

Bridging State and History Representations: Understanding Self-Predictive RLInternational Conference on Learning Representations (ICLR), 2024

Pierre-Luc Bacon

408

17 Jan 2024

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Thomas Lampe

A. Abdolmaleki

Sarah Bechtle

Sandy H. Huang

Jost Tobias Springenberg

...

Markus Wulfmeier

Martin Riedmiller

199

18 Dec 2023

A Tractable Inference Perspective of Offline RLNeural Information Processing Systems (NeurIPS), 2023

509

31 Oct 2023

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical AlgorithmsNeural Information Processing Systems (NeurIPS), 2023

275

30 Oct 2023

On Representation Complexity of Model-based and Model-free Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023

374

03 Oct 2023

Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned

Han Bao

Raphaël Jungers

Jean-Charles Delvenne

OffRL

193

28 Sep 2023

Deep Learning in Deterministic Computational Mechanics

L. Herrmann

Stefan Kollmannsberger

AI4CE PINN

313

27 Sep 2023

How to Fine-tune the Model: Unified Model Shift and Model Bias Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023

305

22 Sep 2023

A Review on Robot Manipulation Methods in Human-Robot Interactions

176

09 Sep 2023

Thinker: Learning to Plan and ActNeural Information Processing Systems (NeurIPS), 2023

294

27 Jul 2023

Meta-Value Learning: a General Framework for Learning with Learning Awareness

Tim Cooijmans

Milad Aghajohari

Rameswar Panda

235

17 Jul 2023

Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based ModelsConference on Robot Learning (CoRL), 2023

T. Westenbroek

Jacob Levy

David Fridovich-Keil

234

16 Jul 2023

Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement LearningIEEE/CAA Journal of Automatica Sinica (IEEE/CAA JAS), 2023

339

16 Jul 2023

Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning

248

06 Jul 2023

λ

-models: Effective Decision-Aware Reinforcement Learning with Latent Models

Amir-massoud Farahmand

357

30 Jun 2023

Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysisNeural Information Processing Systems (NeurIPS), 2023

365

29 Jun 2023

Provably Convergent Policy Optimization via Metric-aware Trust Region Methods

220

25 Jun 2023

Simplified Temporal Consistency Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023

258

15 Jun 2023

Deep Generative Models for Decision-Making and Control

Michael Janner

295

15 Jun 2023