ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 0803.3539
  4. Cited By
Reinforcement Learning by Value Gradients

Reinforcement Learning by Value Gradients

25 March 2008
Michael Fairbank
    SSL
ArXiv (abs)PDFHTML

Papers citing "Reinforcement Learning by Value Gradients"

14 / 14 papers shown
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
333
3
0
15 Jun 2023
Efficient Planning in a Compact Latent Action Space
Efficient Planning in a Compact Latent Action SpaceInternational Conference on Learning Representations (ICLR), 2022
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
360
58
0
22 Aug 2022
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent
  Reinforcement Learning
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Zhiwei Xu
Dapeng Li
Bin Zhang
Yuan Zhan
Yunru Bai
Guoliang Fan
OffRL
304
12
0
20 Apr 2022
Model Based Meta Learning of Critics for Policy Gradients
Model Based Meta Learning of Critics for Policy Gradients
Sarah Bechtle
Ludovic Righetti
Franziska Meier
OffRL
122
0
0
05 Apr 2022
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Offline Reinforcement Learning as One Big Sequence Modeling ProblemNeural Information Processing Systems (NeurIPS), 2021
Michael Janner
Qiyang Li
Sergey Levine
OffRL
856
823
0
03 Jun 2021
On the role of planning in model-based deep reinforcement learning
On the role of planning in model-based deep reinforcement learning
Jessica B. Hamrick
A. Friesen
Feryal M. P. Behbahani
A. Guez
Fabio Viola
Sims Witherspoon
Thomas W. Anthony
Lars Buesing
Petar Velickovic
T. Weber
OffRL
452
74
0
08 Nov 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement
  Learning
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
336
21
0
23 Oct 2020
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator
  Policy Optimization
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy OptimizationNeural Information Processing Systems (NeurIPS), 2020
P. DÓro
Wojciech Ja'skowski
OffRL
261
30
0
29 Apr 2020
The Gambler's Problem and Beyond
The Gambler's Problem and BeyondInternational Conference on Learning Representations (ICLR), 2019
Baoxiang Wang
Shuai Li
Jiajin Li
S. Chan
348
0
0
31 Dec 2019
Deterministic Value-Policy Gradients
Deterministic Value-Policy GradientsAAAI Conference on Artificial Intelligence (AAAI), 2019
Qingpeng Cai
L. Pan
Pingzhong Tang
205
1
0
09 Sep 2019
Deterministic Policy Gradients With General State Transitions
Deterministic Policy Gradients With General State Transitions
Qingpeng Cai
Ling Pan
Pingzhong Tang
OffRL
175
2
0
10 Jul 2018
The Importance of Clipping in Neurocontrol by Direct Gradient Descent on
  the Cost-to-Go Function and in Adaptive Dynamic Programming
The Importance of Clipping in Neurocontrol by Direct Gradient Descent on the Cost-to-Go Function and in Adaptive Dynamic Programming
Michael Fairbank
136
1
0
22 Feb 2013
The Divergence of Reinforcement Learning Algorithms with Value-Iteration
  and Function Approximation
The Divergence of Reinforcement Learning Algorithms with Value-Iteration and Function ApproximationIEEE International Joint Conference on Neural Network (IJCNN), 2011
Michael Fairbank
Eduardo Alonso
198
35
0
22 Jul 2011
The Local Optimality of Reinforcement Learning by Value Gradients, and
  its Relationship to Policy Gradient Learning
The Local Optimality of Reinforcement Learning by Value Gradients, and its Relationship to Policy Gradient Learning
Michael Fairbank
Eduardo Alonso
200
12
0
02 Jan 2011
1
Page 1 of 1