ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.08068
  4. Cited By
Model-Augmented Actor-Critic: Backpropagating through Paths

Model-Augmented Actor-Critic: Backpropagating through Paths

16 May 2020
I. Clavera
Yao Fu
Pieter Abbeel
ArXivPDFHTML

Papers citing "Model-Augmented Actor-Critic: Backpropagating through Paths"

8 / 58 papers shown
Title
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
54
22
0
20 Oct 2020
Episodic Memory for Learning Subjective-Timescale Models
Episodic Memory for Learning Subjective-Timescale Models
Alexey Zakharov
Matthew Crosby
Z. Fountas
6
4
0
03 Oct 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
10
71
0
28 Aug 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
24
45
0
23 Aug 2020
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep
  Reinforcement Learning
Adaptive and Multiple Time-scale Eligibility Traces for Online Deep Reinforcement Learning
Taisuke Kobayashi
OffRL
11
7
0
23 Aug 2020
Model-based Reinforcement Learning for Semi-Markov Decision Processes
  with Neural ODEs
Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs
Jianzhun Du
Joseph D. Futoma
Finale Doshi-Velez
25
49
0
29 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
25
82
0
15 Jun 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization
P. DÓro
Wojciech Ja'skowski
OffRL
14
27
0
29 Apr 2020
Previous
12