ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2003.00722
  4. Cited By
Batch Stationary Distribution Estimation

Batch Stationary Distribution Estimation

International Conference on Machine Learning (ICML), 2020
2 March 2020
Junfeng Wen
Bo Dai
Lihong Li
Dale Schuurmans
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Batch Stationary Distribution Estimation"

20 / 20 papers shown
One-Step Flow Policy Mirror Descent
One-Step Flow Policy Mirror Descent
Tianyi Chen
Haitong Ma
Na Li
Kai Wang
Bo Dai
352
5
0
31 Jul 2025
Scalable Offline Reinforcement Learning for Mean Field Games
Scalable Offline Reinforcement Learning for Mean Field Games
Axel Brunnbauer
Julian Lemmel
Z. Babaiee
Sophie A. Neubauer
Radu Grosu
OffRL
282
0
0
23 Oct 2024
A Comprehensive Survey on Rare Event Prediction
A Comprehensive Survey on Rare Event PredictionACM Computing Surveys (ACM Comput. Surv.), 2023
Chathurangi Shyalika
Ruwan Wickramarachchi
A. Sheth
AI4TS
307
51
0
20 Sep 2023
Model-based Offline Policy Optimization with Adversarial Network
Model-based Offline Policy Optimization with Adversarial NetworkEuropean Conference on Artificial Intelligence (ECAI), 2023
Junming Yang
Xingguo Chen
Shengyuan Wang
Bolei Zhang
OffRL
216
4
0
05 Sep 2023
Inexact iterative numerical linear algebra for neural network-based
  spectral estimation and rare-event prediction
Inexact iterative numerical linear algebra for neural network-based spectral estimation and rare-event predictionJournal of Chemical Physics (JCP), 2023
J. Strahan
Spencer C. Guo
Chatipat Lorpaiboon
Aaron R Dinner
Jonathan Weare
383
15
0
22 Mar 2023
Nonparametric Density Estimation under Distribution Drift
Nonparametric Density Estimation under Distribution DriftInternational Conference on Machine Learning (ICML), 2023
Alessio Mazzetto
E. Upfal
350
5
0
05 Feb 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2023
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
402
7
0
28 Jan 2023
Scaling Marginalized Importance Sampling to High-Dimensional
  State-Spaces via State Abstraction
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State AbstractionAAAI Conference on Artificial Intelligence (AAAI), 2022
Brahma S. Pavse
Josiah P. Hanna
OffRL
227
9
0
14 Dec 2022
A Unified Framework for Alternating Offline Model Training and Policy
  Learning
A Unified Framework for Alternating Offline Model Training and Policy LearningNeural Information Processing Systems (NeurIPS), 2022
Shentao Yang
Shujian Zhang
Yihao Feng
Mi Zhou
OffRL
321
17
0
12 Oct 2022
Continual Learning In Environments With Polynomial Mixing Times
Continual Learning In Environments With Polynomial Mixing Times
Matthew D Riemer
Sharath Chandra Raparthy
Ignacio Cases
G. Subbaraj
M. P. Touzel
Irina Rish
CLL
245
16
0
13 Dec 2021
GFlowNet Foundations
GFlowNet Foundations
Yoshua Bengio
Salem Lahlou
T. Deleu
J. E. Hu
Mo Tiwari
Emmanuel Bengio
512
294
0
17 Nov 2021
Cautious Policy Programming: Exploiting KL Regularization in Monotonic
  Policy Improvement for Reinforcement Learning
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning
Lingwei Zhu
Toshinori Kitamura
Takamitsu Matsubara
OffRL
205
1
0
13 Jul 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and OptimizationInternational Conference on Learning Representations (ICLR), 2021
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
300
52
0
28 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2021
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELMOffRL
280
111
0
30 Mar 2021
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and
  Dual Bounds
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual BoundsInternational Conference on Learning Representations (ICLR), 2021
Yihao Feng
Ziyang Tang
Na Zhang
Qiang Liu
OffRL
295
13
0
09 Mar 2021
Provably Good Batch Reinforcement Learning Without Great Exploration
Provably Good Batch Reinforcement Learning Without Great Exploration
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
OffRL
406
109
0
16 Jul 2020
Learning and Planning in Average-Reward Markov Decision Processes
Learning and Planning in Average-Reward Markov Decision Processes
Yi Wan
A. Naik
R. Sutton
OffRL
314
81
0
29 Jun 2020
A maximum-entropy approach to off-policy evaluation in average-reward
  MDPs
A maximum-entropy approach to off-policy evaluation in average-reward MDPs
N. Lazić
Dong Yin
Mehrdad Farajtabar
Nir Levine
Dilan Görür
Chris Harris
Dale Schuurmans
OffRL
230
13
0
17 Jun 2020
Deep Reinforcement and InfoMax Learning
Deep Reinforcement and InfoMax LearningNeural Information Processing Systems (NeurIPS), 2020
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
349
114
0
12 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRLGP
1.3K
2,492
0
04 May 2020
1
Page 1 of 1