ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2001.02435
  4. Cited By
A Nonparametric Off-Policy Policy Gradient
v1v2v3 (latest)

A Nonparametric Off-Policy Policy Gradient

International Conference on Artificial Intelligence and Statistics (AISTATS), 2020
8 January 2020
Samuele Tosatto
João Carvalho
Hany Abdulsamad
Jan Peters
    OffRL
ArXiv (abs)PDFHTML

Papers citing "A Nonparametric Off-Policy Policy Gradient"

6 / 6 papers shown
A Temporal-Difference Approach to Policy Gradient Estimation
A Temporal-Difference Approach to Policy Gradient EstimationInternational Conference on Machine Learning (ICML), 2022
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
491
3
0
04 Feb 2022
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted
  Iteration
Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Chengzhuo Ni
Ruiqi Zhang
Xiang Ji
Xuezhou Zhang
Mengdi Wang
OffRL
394
1
0
31 Jan 2022
Contextual Latent-Movements Off-Policy Optimization for Robotic
  Manipulation Skills
Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation SkillsIEEE International Conference on Robotics and Automation (ICRA), 2020
Samuele Tosatto
Georgia Chalvatzaki
Jan Peters
256
13
0
26 Oct 2020
Dimensionality Reduction of Movement Primitives in Parameter Space
Dimensionality Reduction of Movement Primitives in Parameter Space
Samuele Tosatto
Jonas Stadtmueller
Jan Peters
66
0
0
26 Feb 2020
Statistically Efficient Off-Policy Policy Gradients
Statistically Efficient Off-Policy Policy GradientsInternational Conference on Machine Learning (ICML), 2020
Nathan Kallus
Masatoshi Uehara
OffRL
347
42
0
10 Feb 2020
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under
  Lipschitz Assumptions
An Upper Bound of the Bias of Nadaraya-Watson Kernel Regression under Lipschitz Assumptions
Samuele Tosatto
R. Akrour
Jan Peters
268
4
0
29 Jan 2020
1
Page 1 of 1