Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1510.09142
Cited By
Learning Continuous Control Policies by Stochastic Value Gradients
30 October 2015
N. Heess
Greg Wayne
David Silver
Timothy Lillicrap
Yuval Tassa
Tom Erez
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Continuous Control Policies by Stochastic Value Gradients"
50 / 337 papers shown
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
Lunjun Zhang
Shuo Han
Hanrui Lyu
Bradly C. Stadie
OffRL
265
1
0
03 Oct 2025
First Order Model-Based RL through Decoupled Backpropagation
Joseph Amigo
Rooholla Khorrambakht
Elliot Chane-Sane
Nicolas Mansard
Ludovic Righetti
161
0
0
29 Aug 2025
Beyond Prediction: Reinforcement Learning as the Defining Leap in Healthcare AI
Dilruk Perera
Gousia Habib
Qianyi Xu
Daniel J. Tan
Kai He
Erik Cambria
Mengling Feng
OffRL
AI4TS
240
0
0
28 Aug 2025
Reparameterization Proximal Policy Optimization
Hai Zhong
Xun Wang
Zhuoran Li
Longbo Huang
184
0
0
08 Aug 2025
Test-time Offline Reinforcement Learning on Goal-related Experience
Marco Bagatella
Mert Albaba
Jonas Hübotter
Georg Martius
Andreas Krause
OffRL
216
4
0
24 Jul 2025
Relative Entropy Pathwise Policy Optimization
C. Voelcker
Axel Brunnbauer
Marcel Hussing
Michal Nauman
Pieter Abbeel
Eric Eaton
Radu Grosu
Amir-massoud Farahmand
Igor Gilitschenski
369
0
0
15 Jul 2025
Distribution Parameter Actor-Critic: Shifting the Agent-Environment Boundary for Diverse Action Spaces
Jiamin He
A. Rupam Mahmood
Martha White
103
0
0
19 Jun 2025
AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning
Lucas N. Alegre
Agon Serifi
Ruben Grandia
David Müller
Espen Knoop
Moritz Bächer
268
1
0
29 May 2025
Wasserstein Policy Optimization
David Pfau
Ian Davies
Diana Borsa
Joao G. M. Araujo
Brendan D. Tracey
H. V. Hasselt
385
3
0
01 May 2025
Differentiable Information Enhanced Model-Based Reinforcement Learning
AAAI Conference on Artificial Intelligence (AAAI), 2025
Xiaoyuan Zhang
Xinyan Cai
Bo Liu
Weidong Huang
Song-Chun Zhu
Siyuan Qi
Y. Yang
248
3
0
03 Mar 2025
Accelerating Model-Based Reinforcement Learning with State-Space World Models
Maria Krinner
Elie Aljalbout
Angel Romero
Davide Scaramuzza
OffRL
268
8
0
27 Feb 2025
Learning to Navigate in Mazes with Novel Layouts using Abstract Top-down Maps
Linfeng Zhao
Lawson L. S. Wong
355
2
0
16 Dec 2024
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
International Conference on Learning Representations (ICLR), 2024
Eliot Xing
Vernon Luk
Jean Oh
418
11
0
16 Dec 2024
Guiding Reinforcement Learning with Incomplete System Dynamics
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Shuyuan Wang
Jingliang Duan
Nathan P. Lawrence
Philip D. Loewen
M. Forbes
R. Bhushan Gopaluni
Lixian Zhang
267
3
0
22 Oct 2024
Distribution Guided Active Feature Acquisition
Yang Li
Junier Oliva
282
1
0
04 Oct 2024
Online Control-Informed Learning
Zihao Liang
Tianyu Zhou
Zehui Lu
Shaoshuai Mou
367
5
0
04 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Neural Information Processing Systems (NeurIPS), 2024
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
358
5
0
03 Oct 2024
Pessimistic Iterative Planning with RNNs for Robust POMDPs
Maris F. L. Galesloot
Marnix Suilen
T. D. Simão
Steven Carr
M. Spaan
Ufuk Topcu
Nils Jansen
424
2
0
16 Aug 2024
A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals
International Conference on Learning Representations (ICLR), 2024
Grace Liu
Michael Tang
Benjamin Eysenbach
OffRL
400
9
0
11 Aug 2024
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Yuanyang Zhu
Zhi Wang
Yuanheng Zhu
Chunlin Chen
Dongbin Zhao
375
3
0
01 Aug 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
210
1
0
02 Jul 2024
Diffusion Spectral Representation for Reinforcement Learning
Dmitry Shribak
Chen-Xiao Gao
Yitong Li
Chenjun Xiao
Bo Dai
DiffM
329
8
0
23 Jun 2024
Deep Dive into Model-free Reinforcement Learning for Biological and Robotic Systems: Theory and Practice
Yusheng Jiao
Feng Ling
Sina Heydari
N. Heess
J. Merel
Eva Kanso
239
3
0
19 May 2024
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
Aditya A. Ramesh
Kenny Young
Louis Kirsch
Jürgen Schmidhuber
282
1
0
06 May 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
244
0
0
31 Mar 2024
Robust Model Based Reinforcement Learning Using
L
1
\mathcal{L}_1
L
1
Adaptive Control
Minjun Sung
Sambhu H. Karumanchi
Aditya Gahlawat
N. Hovakimyan
222
1
0
21 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Christian Lagemann
Urban Fasel
J. Nathan Kutz
Steven Brunton
AI4CE
311
19
0
14 Mar 2024
Generalizing Cooperative Eco-driving via Multi-residual Task Learning
Vindula Jayawardana
Sirui Li
Cathy Wu
Y. Farid
Kentaro Oguchi
163
4
0
07 Mar 2024
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
272
5
0
07 Feb 2024
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
Jiafei Lyu
Le Wan
Xiu Li
Zongqing Lu
CML
OffRL
300
2
0
05 Feb 2024
Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution
Neural Information Processing Systems (NeurIPS), 2024
Ian Covert
Chanwoo Kim
Su-In Lee
James Zou
Tatsunori Hashimoto
TDI
325
15
0
29 Jan 2024
Bridging State and History Representations: Understanding Self-Predictive RL
International Conference on Learning Representations (ICLR), 2024
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
408
41
0
17 Jan 2024
Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots
Thomas Lampe
A. Abdolmaleki
Sarah Bechtle
Sandy H. Huang
Jost Tobias Springenberg
...
Markus Wulfmeier
Jingwei Zhang
Francesco Nori
N. Heess
Martin Riedmiller
OffRL
199
15
0
18 Dec 2023
A Tractable Inference Perspective of Offline RL
Neural Information Processing Systems (NeurIPS), 2023
Xuejie Liu
Hoang Trung-Dung
Karen Ullrich
Yitao Liang
OffRL
509
1
0
31 Oct 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Neural Information Processing Systems (NeurIPS), 2023
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
275
4
0
30 Oct 2023
On Representation Complexity of Model-based and Model-free Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
Hanlin Zhu
Baihe Huang
Stuart Russell
OffRL
374
5
0
03 Oct 2023
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
193
1
0
28 Sep 2023
Deep Learning in Deterministic Computational Mechanics
L. Herrmann
Stefan Kollmannsberger
AI4CE
PINN
313
1
0
27 Sep 2023
How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization
Neural Information Processing Systems (NeurIPS), 2023
Hai Zhang
Hang Yu
Siyue Tao
Di Zhang
Chang Huang
Hongtu Zhou
Xiao Zhang
Chen Ye
305
12
0
22 Sep 2023
A Review on Robot Manipulation Methods in Human-Robot Interactions
Haoxu Zhang
P. Kebria
Shady M. K. Mohamed
Samson Yu
Saeid Nahavandi
173
1
0
09 Sep 2023
Thinker: Learning to Plan and Act
Neural Information Processing Systems (NeurIPS), 2023
Stephen Chung
Ivan Anokhin
David M. Krueger
LLMAG
OffRL
LRM
294
12
0
27 Jul 2023
Meta-Value Learning: a General Framework for Learning with Learning Awareness
Tim Cooijmans
Milad Aghajohari
Rameswar Panda
234
6
0
17 Jul 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
Conference on Robot Learning (CoRL), 2023
T. Westenbroek
Jacob Levy
David Fridovich-Keil
234
0
0
16 Jul 2023
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning
IEEE/CAA Journal of Automatica Sinica (IEEE/CAA JAS), 2023
Hongyu Ding
Yuan-Yan Tang
Qing Wu
Bo Wang
Chunlin Chen
Zhi Wang
339
7
0
16 Jul 2023
Hierarchical Empowerment: Towards Tractable Empowerment-Based Skill Learning
Andrew Levy
Sreehari Rammohan
A. Allievi
S. Niekum
George Konidaris
248
6
0
06 Jul 2023
λ
λ
λ
-models: Effective Decision-Aware Reinforcement Learning with Latent Models
C. Voelcker
Arash Ahmadian
Romina Abachi
Igor Gilitschenski
Amir-massoud Farahmand
357
0
0
30 Jun 2023
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Neural Information Processing Systems (NeurIPS), 2023
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
365
6
0
29 Jun 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
220
4
0
25 Jun 2023
Simplified Temporal Consistency Reinforcement Learning
International Conference on Machine Learning (ICML), 2023
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
258
16
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Michael Janner
294
3
0
15 Jun 2023
1
2
3
4
5
6
7
Next