ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.08068
  4. Cited By
Model-Augmented Actor-Critic: Backpropagating through Paths

Model-Augmented Actor-Critic: Backpropagating through Paths

16 May 2020
I. Clavera
Yao Fu
Pieter Abbeel
ArXivPDFHTML

Papers citing "Model-Augmented Actor-Critic: Backpropagating through Paths"

50 / 58 papers shown
Title
Differentiable Information Enhanced Model-Based Reinforcement Learning
Xiaoyuan Zhang
Xinyan Cai
Bo Liu
Weidong Huang
Song-Chun Zhu
Siyuan Qi
Y. Yang
48
0
0
03 Mar 2025
CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving
CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving
Dongkun Zhang
Jiaming Liang
Ke Guo
Sha Lu
Qi Wang
R. Xiong
Zhenwei Miao
Yue Wang
65
1
0
27 Feb 2025
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation
Eliot Xing
Vernon Luk
Jean Oh
84
0
0
16 Dec 2024
Safe Reinforcement Learning using Finite-Horizon Gradient-based
  Estimation
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation
Juntao Dai
Yaodong Yang
Qian Zheng
Gang Pan
OffRL
76
2
0
15 Dec 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
22
2
0
03 Oct 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style
  Reinforcement Learning
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning
Zakariae El Asri
Olivier Sigaud
Nicolas Thome
31
0
0
02 Jul 2024
Do Transformer World Models Give Better Policy Gradients?
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
34
4
0
07 Feb 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
35
2
0
11 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
20
9
0
06 Jan 2024
An introduction to reinforcement learning for neuroscience
An introduction to reinforcement learning for neuroscience
Kristopher T. Jensen
OOD
OffRL
AI4CE
23
1
0
13 Nov 2023
Model-Based Reparameterization Policy Gradient Methods: Theory and
  Practical Algorithms
Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Shenao Zhang
Boyi Liu
Zhaoran Wang
Tuo Zhao
16
2
0
30 Oct 2023
Learning an Inventory Control Policy with General Inventory Arrival
  Dynamics
Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Sohrab Andaz
Carson Eisenach
Dhruv Madeka
Kari Torkkola
Randy Jia
Dean Phillips Foster
Sham Kakade
25
2
0
26 Oct 2023
Deep Learning in Deterministic Computational Mechanics
Deep Learning in Deterministic Computational Mechanics
L. Herrmann
Stefan Kollmannsberger
AI4CE
PINN
40
0
0
27 Sep 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Juho Kannala
J. Pajarinen
OffRL
30
12
0
15 Jun 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
23
9
0
01 Jun 2023
Diminishing Return of Value Expansion Methods in Model-Based
  Reinforcement Learning
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning
Daniel Palenicek
M. Lutter
João Carvalho
Jan Peters
18
4
0
07 Mar 2023
Taylor TD-learning
Taylor TD-learning
Michele Garibbo
Maxime Robeyns
Laurence Aitchison
OffRL
11
1
0
27 Feb 2023
Is Model Ensemble Necessary? Model-based RL via a Single Model with
  Lipschitz Regularized Value Function
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Ruijie Zheng
Xiyao Wang
Huazhe Xu
Furong Huang
33
13
0
02 Feb 2023
Hint assisted reinforcement learning: an application in radio astronomy
Hint assisted reinforcement learning: an application in radio astronomy
S. Yatawatta
14
1
0
10 Jan 2023
Physics-Informed Model-Based Reinforcement Learning
Physics-Informed Model-Based Reinforcement Learning
Adithya Ramesh
Balaraman Ravindran
11
10
0
05 Dec 2022
Scaling up and Stabilizing Differentiable Planning with Implicit
  Differentiation
Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation
Linfeng Zhao
Huazhe Xu
Lawson L. S. Wong
29
6
0
24 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
14
0
0
24 Oct 2022
Simplifying Model-based RL: Learning Representations, Latent-space
  Models, and Policies with One Objective
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare
Homanga Bharadhwaj
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
40
25
0
18 Sep 2022
Conservative Dual Policy Optimization for Efficient Model-Based
  Reinforcement Learning
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Shen Zhang
15
6
0
16 Sep 2022
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Model-based Reinforcement Learning with Multi-step Plan Value Estimation
Hao-Chu Lin
Yihao Sun
Jiajin Zhang
Yang Yu
OffRL
24
7
0
12 Sep 2022
What deep reinforcement learning tells us about human motor learning and
  vice-versa
What deep reinforcement learning tells us about human motor learning and vice-versa
Michele Garibbo
Casimir J. H. Ludwig
Nathan Lepora
Laurence Aitchison
16
0
0
23 Aug 2022
Backward Imitation and Forward Reinforcement Learning via Bi-directional
  Model Rollouts
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts
Yuxin Pan
Fangzhen Lin
OffRL
17
3
0
04 Aug 2022
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
11
19
0
25 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRL
AI4CE
22
31
0
13 Jul 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
39
101
0
19 Jun 2022
Imitation Learning via Differentiable Physics
Imitation Learning via Differentiable Physics
Siwei Chen
Xiao Ma
Zhongwen Xu
PINN
AI4CE
14
4
0
10 Jun 2022
Integrating Symmetry into Differentiable Planning with Steerable
  Convolutions
Integrating Symmetry into Differentiable Planning with Steerable Convolutions
Linfeng Zhao
Xu Zhu
Lingzhi Kong
Robin G. Walters
Lawson L. S. Wong
20
7
0
08 Jun 2022
Accelerated Policy Learning with Parallel Differentiable Simulation
Accelerated Policy Learning with Parallel Differentiable Simulation
Jie Xu
Viktor Makoviychuk
Yashraj S. Narang
Fabio Ramos
Wojciech Matusik
Animesh Garg
Miles Macklin
11
84
0
14 Apr 2022
Deep Interactive Motion Prediction and Planning: Playing Games with
  Motion Prediction Models
Deep Interactive Motion Prediction and Planning: Playing Games with Motion Prediction Models
J. Vázquez
Alexander Liniger
Wilko Schwarting
Daniela Rus
Luc Van Gool
21
45
0
05 Apr 2022
Proximal Policy Optimization with Adaptive Threshold for Symmetric
  Relative Density Ratio
Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio
Taisuke Kobayashi
17
5
0
18 Mar 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINN
MU
36
219
0
09 Mar 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Che Wang
Xufang Luo
Keith Ross
Dongsheng Li
OffRL
24
49
0
17 Feb 2022
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
9
30
0
16 Dec 2021
On Effective Scheduling of Model-based Reinforcement Learning
On Effective Scheduling of Model-based Reinforcement Learning
Hang Lai
Jian Shen
Weinan Zhang
Yimin Huang
Xingzhi Zhang
Ruiming Tang
Yong Yu
Zhenguo Li
20
18
0
16 Nov 2021
Gradients are Not All You Need
Gradients are Not All You Need
Luke Metz
C. Freeman
S. Schoenholz
Tal Kachman
28
92
0
10 Nov 2021
Contrastive Active Inference
Contrastive Active Inference
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
9
25
0
19 Oct 2021
On-Policy Model Errors in Reinforcement Learning
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
M. Zeilinger
Felix Berkenkamp
OnRL
49
6
0
15 Oct 2021
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence
  Optimization
Optimistic Reinforcement Learning by Forward Kullback-Leibler Divergence Optimization
Taisuke Kobayashi
20
13
0
27 May 2021
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise
  Rollouts
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts
Weinan Zhang
Xihuai Wang
Jian Shen
Ming Zhou
19
35
0
07 May 2021
Optimization Algorithm for Feedback and Feedforward Policies towards
  Robot Control Robust to Sensing Failures
Optimization Algorithm for Feedback and Feedforward Policies towards Robot Control Robust to Sensing Failures
Taisuke Kobayashi
Kenta Yoshizawa
13
3
0
01 Apr 2021
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with
  Deep Reinforcement Learning
Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning
A. S. Morgan
Daljeet Nandha
Georgia Chalvatzaki
Carlo DÉramo
A. Dollar
Jan Peters
32
43
0
25 Mar 2021
Cloth Manipulation Planning on Basis of Mesh Representations with
  Incomplete Domain Knowledge and Voxel-to-Mesh Estimation
Cloth Manipulation Planning on Basis of Mesh Representations with Incomplete Domain Knowledge and Voxel-to-Mesh Estimation
S. Arnold
Daisuke Tanaka
Kimitoshi Yamazaki
16
4
0
15 Mar 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
217
413
0
16 Feb 2021
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve
  Optimism, Embrace Virtual Curvature
Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Kefan Dong
Jiaqi Yang
Tengyu Ma
24
32
0
08 Feb 2021
OffCon$^3$: What is state of the art anyway?
OffCon3^33: What is state of the art anyway?
Philip J. Ball
Stephen J. Roberts
OffRL
13
8
0
27 Jan 2021
12
Next