ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.05147
  4. Cited By
Model-free Policy Learning with Reward Gradients
v1v2v3v4 (latest)

Model-free Policy Learning with Reward Gradients

International Conference on Artificial Intelligence and Statistics (AISTATS), 2021
9 March 2021
Qingfeng Lan
Samuele Tosatto
Homayoon Farrahi
Rupam Mahmood
ArXiv (abs)PDFHTML

Papers citing "Model-free Policy Learning with Reward Gradients"

6 / 6 papers shown
Title
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Learning Intractable Multimodal Policies with Reparameterization and Diversity Regularization
Ziqi Wang
Jiashun Liu
L. Pan
167
0
0
03 Nov 2025
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Revisiting Sparse Rewards for Goal-Reaching Reinforcement Learning
Gautham Vasan
Yan Wang
Fahim Shahriar
James Bergstra
Martin Jägersand
A. R. Mahmood
213
11
0
29 Jun 2024
Revisiting Scalable Hessian Diagonal Approximations for Applications in
  Reinforcement Learning
Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning
Mohamed Elsayed
Homayoon Farrahi
Felix Dangel
A. Rupam Mahmood
256
6
0
05 Jun 2024
Learning to Optimize for Reinforcement Learning
Learning to Optimize for Reinforcement Learning
Qingfeng Lan
Rupam Mahmood
Shuicheng Yan
Zhongwen Xu
OffRL
330
11
0
03 Feb 2023
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical RobotsIEEE International Conference on Robotics and Automation (ICRA), 2022
Yufeng Yuan
Rupam Mahmood
OffRL
283
24
0
23 Mar 2022
A Temporal-Difference Approach to Policy Gradient Estimation
A Temporal-Difference Approach to Policy Gradient EstimationInternational Conference on Machine Learning (ICML), 2022
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
308
2
0
04 Feb 2022
1