Scaling Marginalized Importance Sampling to High-Dimensional
State-Spaces via State AbstractionAAAI Conference on Artificial Intelligence (AAAI), 2022 |
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy EstimationInternational Conference on Learning Representations (ICLR), 2019 |
Nonparametric Stochastic Compositional Gradient Descent for Q-Learning
in Continuous Markov Decision ProblemsAmerican Control Conference (ACC), 2018 |
Duality-free Methods for Stochastic Composition OptimizationIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2017 |