ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.09142
  4. Cited By
Learning Continuous Control Policies by Stochastic Value Gradients

Learning Continuous Control Policies by Stochastic Value Gradients

30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
ArXivPDFHTML

Papers citing "Learning Continuous Control Policies by Stochastic Value Gradients"

50 / 329 papers shown
Title
Differentiable Information Enhanced Model-Based Reinforcement Learning
Xiaoyuan Zhang
Xinyan Cai
Bo Liu
Weidong Huang
Song-Chun Zhu
Siyuan Qi
Y. Yang
53
0
0
03 Mar 2025
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Maria Krinner
Elie Aljalbout
Angel Romero
Davide Scaramuzza
OffRL
76
1
0
27 Feb 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down
  Maps
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
84
1
0
16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
89
0
0
16 Dec 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
Guiding Reinforcement Learning with Incomplete System Dynamics
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
24
0
0
22 Oct 2024
Distribution Guided Active Feature Acquisition
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
29
0
0
04 Oct 2024
Online Control-Informed Learning
Online Control-Informed Learning
Zihao Liang
Tianyu Zhou
Zehui Lu
Shaoshuai Mou
33
1
0
04 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
31
3
0
03 Oct 2024
Pessimistic Iterative Planning for Robust POMDPs
Pessimistic Iterative Planning for Robust POMDPs
Maris F. L. Galesloot
Marnix Suilen
T. D. Simão
Steven Carr
M. Spaan
Ufuk Topcu
Nils Jansen
53
2
0
16 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from
  Contrastive RL without Rewards, Demonstrations, or Subgoals
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
48
1
0
11 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability
  Distributions for On-Policy Reinforcement Learning
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
28
0
0
01 Aug 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style
  Reinforcement Learning
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
45
0
0
02 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
29
3
0
23 Jun 2024
Deep Dive into Model-free Reinforcement Learning for Biological and
  Robotic Systems: Theory and Practice
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
39
0
0
19 May 2024
Sequence Compression Speeds Up Credit Assignment in Reinforcement
  Learning
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh
Kenny Young
Louis Kirsch
Jürgen Schmidhuber
34
1
0
06 May 2024
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning
Minh Nguyen
Chandrajit Bajaj
25
0
0
24 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
32
0
0
31 Mar 2024
Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive
  Control
Robust Model Based Reinforcement Learning Using L1\mathcal{L}_1L1​ Adaptive Control
Minjun Sung
Sambhu H. Karumanchi
Aditya Gahlawat
N. Hovakimyan
30
1
0
21 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
30
11
0
14 Mar 2024
Generalizing Cooperative Eco-driving via Multi-residual Task Learning
Generalizing Cooperative Eco-driving via Multi-residual Task Learning
Vindula Jayawardana
Sirui Li
Cathy Wu
Y. Farid
Kentaro Oguchi
30
3
0
07 Mar 2024
Do Transformer World Models Give Better Policy Gradients?
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
42
4
0
07 Feb 2024
Understanding What Affects Generalization Gap in Visual Reinforcement
  Learning: Theory and Empirical Evidence
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
Jiafei Lyu
Le Wan
Xiu Li
Zongqing Lu
CML
OffRL
43
4
0
05 Feb 2024
Stochastic Amortization: A Unified Approach to Accelerate Feature and
  Data Attribution
Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution
Ian Covert
Chanwoo Kim
Su-In Lee
James Zou
Tatsunori Hashimoto
TDI
35
9
0
29 Jan 2024
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
24
21
0
17 Jan 2024
Mastering Stacking of Diverse Shapes with Large-Scale Iterative
  Reinforcement Learning on Real Robots
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Thomas Lampe
A. Abdolmaleki
Sarah Bechtle
Sandy H. Huang
Jost Tobias Springenberg
...
Markus Wulfmeier
Jingwei Zhang
Francesco Nori
N. Heess
Martin Riedmiller
OffRL
40
9
0
18 Dec 2023
A Tractable Inference Perspective of Offline RL
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Guy Van den Broeck
Mathias Niepert
Yitao Liang
OffRL
36
1
0
31 Oct 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and
  Practical Algorithms
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
35
2
0
30 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement
  Learning
On Representation Complexity of Model-based and Model-free Reinforcement Learning
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
33
3
0
03 Oct 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and
  Goal-Conditioned
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
21
1
0
28 Sep 2023
Deep Learning in Deterministic Computational Mechanics
Deep Learning in Deterministic Computational Mechanics
L. Herrmann
Stefan Kollmannsberger
AI4CE
PINN
43
0
0
27 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy
  Optimization
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Hai Zhang
Hang Yu
Junqiao Zhao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
19
9
0
22 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions
A Review on Robot Manipulation Methods in Human-Robot Interactions
Haoxu Zhang
P. Kebria
Shady M. K. Mohamed
Samson Yu
Saeid Nahavandi
29
0
0
09 Sep 2023
Thinker: Learning to Plan and Act
Thinker: Learning to Plan and Act
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAG
OffRL
LRM
30
5
0
27 Jul 2023
Meta-Value Learning: a General Framework for Learning with Learning
  Awareness
Meta-Value Learning: a General Framework for Learning with Learning Awareness
Tim Cooijmans
Milad Aghajohari
Rameswar Panda
27
6
0
17 Jul 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with
  Approximate Physics-Based Models
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
T. Westenbroek
Jacob Levy
David Fridovich-Keil
38
0
0
16 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement
  Learning
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
40
4
0
16 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill
  Learning
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
36
5
0
06 Jul 2023
$λ$-models: Effective Decision-Aware Reinforcement Learning with
  Latent Models
λλλ-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
59
0
0
30 Jun 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
29
3
0
29 Jun 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region
  Methods
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
39
3
0
25 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
34
1
0
15 Jun 2023
Reinforcement Learning in Robotic Motion Planning by Combined
  Experience-based Planning and Self-Imitation Learning
Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning
Sha Luo
Lambert Schomaker
27
9
0
11 Jun 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning
  Algorithm
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
32
1
0
11 Jun 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random Features
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Kaipeng Zhang
Abhishek Gupta
OffRL
SSL
36
6
0
26 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical
  Guarantees
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
32
3
0
24 May 2023
A Generalist Dynamics Model for Control
A Generalist Dynamics Model for Control
Ingmar Schubert
Jingwei Zhang
Jake Bruce
Sarah Bechtle
Emilio Parisotto
Martin Riedmiller
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
N. Heess
AI4CE
41
30
0
18 May 2023
Safe MDP Planning by Learning Temporal Patterns of Undesirable
  Trajectories and Averting Negative Side Effects
Safe MDP Planning by Learning Temporal Patterns of Undesirable Trajectories and Averting Negative Side Effects
Siow Meng Low
Akshat Kumar
Scott Sanner
11
2
0
06 Apr 2023
Diminishing Return of Value Expansion Methods in Model-Based
  Reinforcement Learning
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
Daniel Palenicek
M. Lutter
João Carvalho
Jan Peters
29
4
0
07 Mar 2023
Taylor TD-learning
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
23
1
0
27 Feb 2023
1234567
Next