ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03469
  4. Cited By
Data-Efficient Policy Evaluation Through Behavior Policy Search

Data-Efficient Policy Evaluation Through Behavior Policy Search

International Conference on Machine Learning (ICML), 2017
12 June 2017
Josiah P. Hanna
Philip S. Thomas
Peter Stone
S. Niekum
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Data-Efficient Policy Evaluation Through Behavior Policy Search"

16 / 16 papers shown
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive
  Approach
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive ApproachNeural Information Processing Systems (NeurIPS), 2024
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
190
3
0
17 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
917
1
0
08 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning
Doubly Optimal Policy Evaluation for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2024
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
491
5
0
03 Oct 2024
Adaptive Exploration for Data-Efficient General Value Function
  Evaluations
Adaptive Exploration for Data-Efficient General Value Function EvaluationsNeural Information Processing Systems (NeurIPS), 2024
Arushi Jain
Josiah P. Hanna
Doina Precup
260
2
0
13 May 2024
Policy Gradient with Active Importance Sampling
Policy Gradient with Active Importance Sampling
Matteo Papini
Giorgio Manganini
Alberto Maria Metelli
Marcello Restelli
OffRL
204
7
0
09 May 2024
Efficient Open-world Reinforcement Learning via Knowledge Distillation
  and Autonomous Rule Discovery
Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery
Ekaterina Nikonova
Cheng Xue
Jochen Renz
CLL
260
1
0
24 Nov 2023
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
ReVar: Strengthening Policy Evaluation via Reduced Variance SamplingConference on Uncertainty in Artificial Intelligence (UAI), 2022
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
384
18
0
09 Mar 2022
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in
  Reinforcement Learning
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2021
Rujie Zhong
Duohan Zhang
Lukas Schafer
Stefano V. Albrecht
Josiah P. Hanna
OODOffRL
298
16
0
29 Nov 2021
Deep Reinforcement Learning for the Control of Robotic Manipulation: A
  Focussed Mini-Review
Deep Reinforcement Learning for the Control of Robotic Manipulation: A Focussed Mini-Review
Rongrong Liu
F. Nageotte
P. Zanne
M. de Mathelin
Birgitta Dresp
224
190
0
08 Feb 2021
Beyond variance reduction: Understanding the true impact of baselines on
  policy optimization
Beyond variance reduction: Understanding the true impact of baselines on policy optimizationInternational Conference on Machine Learning (ICML), 2020
Wesley Chung
Valentin Thomas
Marlos C. Machado
Nicolas Le Roux
OffRL
523
35
0
31 Aug 2020
Causality and Batch Reinforcement Learning: Complementary Approaches To
  Planning In Unknown Domains
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains
James Bannon
Bradford T. Windsor
Wenbo Song
Tao Li
CMLOODOffRL
196
22
0
03 Jun 2020
Reinforcement Learning Architectures: SAC, TAC, and ESAC
Reinforcement Learning Architectures: SAC, TAC, and ESAC
Ala’eddin Masadeh
Zhengdao Wang
A. Kamal
83
0
0
05 Apr 2020
Curriculum Learning for Reinforcement Learning Domains: A Framework and
  Survey
Curriculum Learning for Reinforcement Learning Domains: A Framework and SurveyJournal of machine learning research (JMLR), 2020
Sanmit Narvekar
Bei Peng
Matteo Leonetti
Jivko Sinapov
Matthew E. Taylor
Peter Stone
ODL
579
666
0
10 Mar 2020
TuneNet: One-Shot Residual Tuning for System Identification and
  Sim-to-Real Robot Task Transfer
TuneNet: One-Shot Residual Tuning for System Identification and Sim-to-Real Robot Task TransferConference on Robot Learning (CoRL), 2019
Adam Allevato
Elaine Schaertl Short
Mitch Pryor
A. Thomaz
496
61
0
25 Jul 2019
Provably Efficient Q-Learning with Low Switching Cost
Provably Efficient Q-Learning with Low Switching CostNeural Information Processing Systems (NeurIPS), 2019
Yu Bai
Tengyang Xie
Nan Jiang
Yu Wang
321
102
0
30 May 2019
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah P. Hanna
S. Niekum
Peter Stone
OffRL
371
71
0
04 Jun 2018
1
Page 1 of 1