ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.09072
  4. Cited By
GenDICE: Generalized Offline Estimation of Stationary Values

GenDICE: Generalized Offline Estimation of Stationary Values

International Conference on Learning Representations (ICLR), 2020
21 February 2020
Ruiyi Zhang
Bo Dai
Lihong Li
Dale Schuurmans
    OffRL
ArXiv (abs)PDFHTML

Papers citing "GenDICE: Generalized Offline Estimation of Stationary Values"

50 / 127 papers shown
Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets
Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets
Shriram Karpoora Sundara Pandian
Ali Baheri
OffRL
216
0
0
01 Oct 2025
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Is Exploration or Optimization the Problem for Deep Reinforcement Learning?
Glen Berseth
OffRL
196
1
0
02 Aug 2025
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu
Xinqi Wang
Simon S. Du
OffRL
386
0
0
10 Jun 2025
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation
Hossein Goli
Michael Gimelfarb
Nathan Samuel de Lara
Haruki Nishimura
Masha Itkina
Florian Shkurti
OffRL
392
2
0
27 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
455
3
0
17 Apr 2025
Average-DICE: Stationary Distribution Correction by Regression
Average-DICE: Stationary Distribution Correction by Regression
Fengdi Che
Bryan Chan
Chen Ma
A. R. Mahmood
OffRL
237
0
0
03 Mar 2025
SimuDICE: Offline Policy Optimization Through World Model Updates and
  DICE Estimation
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE Estimation
Catalin E. Brita
Stephan Bongers
F. Oliehoek
OffRL
336
0
0
09 Dec 2024
Concept-driven Off Policy Evaluation
Concept-driven Off Policy Evaluation
Ritam Majumdar
Jack Teversham
Sonali Parbhoo
OffRL
345
0
0
28 Nov 2024
Scalable Offline Reinforcement Learning for Mean Field Games
Scalable Offline Reinforcement Learning for Mean Field Games
Axel Brunnbauer
Julian Lemmel
Z. Babaiee
Sophie A. Neubauer
Radu Grosu
OffRL
275
0
0
23 Oct 2024
Primal-Dual Spectral Representation for Off-policy Evaluation
Primal-Dual Spectral Representation for Off-policy EvaluationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Yang Hu
Tianyi Chen
Na Li
Kai Wang
Bo Dai
OffRL
320
4
0
23 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
Hechang Chen
Yi Chang
Dacheng Tao
Lichao Sun
OffRL
250
2
0
21 Oct 2024
Abstract Reward Processes: Leveraging State Abstraction for Consistent
  Off-Policy Evaluation
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy EvaluationNeural Information Processing Systems (NeurIPS), 2024
Shreyas Chaudhari
Ameet Deshpande
Bruno Castro da Silva
Philip S. Thomas
OffRL
265
1
0
03 Oct 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy ChurnNeural Information Processing Systems (NeurIPS), 2024
Hongyao Tang
Glen Berseth
OffRL
348
11
0
07 Sep 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement
  Learning
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
420
37
0
29 Jul 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
306
9
0
13 Jun 2024
Towards Provable Log Density Policy Gradient
Towards Provable Log Density Policy Gradient
Pulkit Katdare
Anant Joshi
Katherine Driggs-Campbell
311
0
0
03 Mar 2024
Offline Multi-task Transfer RL with Representational Penalization
Offline Multi-task Transfer RL with Representational Penalization
Avinandan Bose
S. S. Du
Maryam Fazel
OffRL
341
13
0
19 Feb 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
354
0
0
04 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
380
23
0
01 Feb 2024
Learning from Sparse Offline Datasets via Conservative Density
  Estimation
Learning from Sparse Offline Datasets via Conservative Density EstimationInternational Conference on Learning Representations (ICLR), 2024
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
307
12
0
16 Jan 2024
Conservative Exploration for Policy Optimization via Off-Policy Policy
  Evaluation
Conservative Exploration for Policy Optimization via Off-Policy Policy Evaluation
Paul Daoudi
Mathias Formoso
Othman Gaizi
Achraf Azize
Evrard Garcelon
OffRL
233
0
0
24 Dec 2023
SCOPE-RL: A Python Library for Offline Reinforcement Learning and
  Off-Policy Evaluation
SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRLELM
535
5
0
30 Nov 2023
When is Off-Policy Evaluation Useful? A Data-Centric Perspective
When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Hao Sun
Alex J. Chan
Nabeel Seedat
Alihan Huyuk
M. Schaar
ELMOffRL
331
0
0
23 Nov 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
386
30
0
27 Oct 2023
Off-Policy Evaluation for Human Feedback
Off-Policy Evaluation for Human FeedbackNeural Information Processing Systems (NeurIPS), 2023
Qitong Gao
Ge Gao
Juncheng Dong
Vahid Tarokh
Min Chi
Miroslav Pajic
OffRL
383
9
0
11 Oct 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced
  Datasets
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsNeural Information Processing Systems (NeurIPS), 2023
Zhang-Wei Hong
Aviral Kumar
Sathwik Karnik
Abhishek Bhandwaldar
Akash Srivastava
Joni Pajarinen
Romain Laroche
Abhishek Gupta
Pulkit Agrawal
OffRL
265
29
0
06 Oct 2023
Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Aayush Mishra
Simon S. Du
OffRL
380
0
0
28 Sep 2023
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified
  Error Quantification Framework
Distributional Shift-Aware Off-Policy Interval Estimation: A Unified Error Quantification Framework
Wenzhuo Zhou
Yuhan Li
Ruoqing Zhu
Annie Qu
OffRL
327
7
0
23 Sep 2023
Marginalized Importance Sampling for Off-Environment Policy Evaluation
Marginalized Importance Sampling for Off-Environment Policy EvaluationConference on Robot Learning (CoRL), 2023
Pulkit Katdare
Nan Jiang
Katherine Driggs-Campbell
OffRL
394
4
0
04 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing
  Posterior Predictability
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior PredictabilityNeural Information Processing Systems (NeurIPS), 2023
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
375
16
0
31 Aug 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot HardwareInternational Conference on Learning Representations (ICLR), 2023
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
339
38
0
28 Jul 2023
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning
Akash Velu
Skanda Vaidyanath
Dilip Arumugam
OffRL
340
4
0
21 Jul 2023
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Value-aware Importance Weighting for Off-policy Reinforcement Learning
Kristopher De Asis
Eric Graves
R. Sutton
OffRL
268
3
0
27 Jun 2023
Self-Supervised Reinforcement Learning that Transfers using Random
  Features
Self-Supervised Reinforcement Learning that Transfers using Random FeaturesNeural Information Processing Systems (NeurIPS), 2023
Boyuan Chen
Chuning Zhu
Pulkit Agrawal
Jianchao Tan
Abhishek Gupta
OffRLSSL
293
13
0
26 May 2023
A Survey of Demonstration Learning
A Survey of Demonstration Learning
André Rosa de Sousa Porfírio Correia
Luís A. Alexandre
OffRL
259
36
0
20 Mar 2023
Uncertainty-Aware Instance Reweighting for Off-Policy Learning
Uncertainty-Aware Instance Reweighting for Off-Policy LearningNeural Information Processing Systems (NeurIPS), 2023
Xiaoying Zhang
Junpu Chen
Hongning Wang
Hong Xie
Yang Liu
John C. S. Lui
Hang Li
OffRL
303
4
0
11 Mar 2023
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed
  Distribution Matching
Offline Imitation Learning with Suboptimal Demonstrations via Relaxed Distribution MatchingAAAI Conference on Artificial Intelligence (AAAI), 2023
Lantao Yu
Tianhe Yu
Jiaming Song
Willie Neiswanger
Stefano Ermon
OffRL
283
28
0
05 Mar 2023
Hallucinated Adversarial Control for Conservative Offline Policy
  Evaluation
Hallucinated Adversarial Control for Conservative Offline Policy EvaluationConference on Uncertainty in Artificial Intelligence (UAI), 2023
Jonas Rothfuss
Bhavya Sukhija
Tobias Birchler
Parnian Kassraie
Andreas Krause
OffRL
274
14
0
02 Mar 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation
  Learning
Dual RL: Unification and New Methods for Reinforcement and Imitation LearningInternational Conference on Learning Representations (ICLR), 2023
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
409
45
0
16 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement LearningInternational Conference on Machine Learning (ICML), 2023
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
341
79
0
14 Feb 2023
A Reinforcement Learning Framework for Dynamic Mediation Analysis
A Reinforcement Learning Framework for Dynamic Mediation AnalysisInternational Conference on Machine Learning (ICML), 2023
Linjuan Ge
Jitao Wang
C. Shi
Zhanghua Wu
Rui Song
393
6
0
31 Jan 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
523
20
0
30 Jan 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2023
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
402
7
0
28 Jan 2023
Generalized Munchausen Reinforcement Learning using Tsallis KL
  Divergence
Generalized Munchausen Reinforcement Learning using Tsallis KL DivergenceNeural Information Processing Systems (NeurIPS), 2023
Lingwei Zhu
Zheng Chen
Takamitsu Matsubara
Martha White
307
3
0
27 Jan 2023
Model-based Offline Reinforcement Learning with Local Misspecification
Model-based Offline Reinforcement Learning with Local MisspecificationAAAI Conference on Artificial Intelligence (AAAI), 2023
Kefan Dong
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
258
6
0
26 Jan 2023
An Instrumental Variable Approach to Confounded Off-Policy Evaluation
An Instrumental Variable Approach to Confounded Off-Policy EvaluationInternational Conference on Machine Learning (ICML), 2022
Yang Xu
Jin Zhu
C. Shi
Shuang Luo
R. Song
OffRL
361
24
0
29 Dec 2022
Offline Policy Optimization in RL with Variance Regularizaton
Offline Policy Optimization in RL with Variance Regularizaton
Riashat Islam
Samarth Sinha
Homanga Bharadhwaj
Samin Yeasar Arnob
Zhuoran Yang
Animesh Garg
Zhaoran Wang
Lihong Li
Doina Precup
OffRL
168
0
0
29 Dec 2022
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality
Ying Jin
Zhimei Ren
Zhuoran Yang
Zhaoran Wang
OffRL
567
31
0
19 Dec 2022
Scaling Marginalized Importance Sampling to High-Dimensional
  State-Spaces via State Abstraction
Scaling Marginalized Importance Sampling to High-Dimensional State-Spaces via State AbstractionAAAI Conference on Artificial Intelligence (AAAI), 2022
Brahma S. Pavse
Josiah P. Hanna
OffRL
224
9
0
14 Dec 2022
When is Realizability Sufficient for Off-Policy Reinforcement Learning?
When is Realizability Sufficient for Off-Policy Reinforcement Learning?International Conference on Machine Learning (ICML), 2022
Andrea Zanette
OffRL
362
16
0
10 Nov 2022
123
Next
Page 1 of 3