Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.10783
Cited By
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
21 June 2021
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation"
10 / 10 papers shown
Title
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
64
0
0
02 Feb 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
94
2
0
28 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
86
0
0
31 Dec 2024
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
33
1
0
20 May 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
16
0
0
18 Jan 2024
Stackelberg Batch Policy Learning
Wenzhuo Zhou
Annie Qu
OffRL
27
0
0
28 Sep 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
30
15
0
30 Jan 2023
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Yecheng Jason Ma
Andrew Shen
Dinesh Jayaraman
Osbert Bastani
OffRL
19
32
0
04 Feb 2022
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
13
31
0
14 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,949
0
04 May 2020
1