ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.02468
  4. Cited By
Rethinking ValueDice: Does It Really Improve Performance?
v1v2 (latest)

Rethinking ValueDice: Does It Really Improve Performance?

5 February 2022
Ziniu Li
Tian Xu
Yang Yu
Zhimin Luo
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Rethinking ValueDice: Does It Really Improve Performance?"

15 / 15 papers shown
Title
Hierarchical Imitation Learning of Team Behavior from Heterogeneous Demonstrations
Hierarchical Imitation Learning of Team Behavior from Heterogeneous DemonstrationsAdaptive Agents and Multi-Agent Systems (AAMAS), 2025
Sangwon Seo
Vaibhav Unhelkar
215
3
0
24 Feb 2025
IDIL: Imitation Learning of Intent-Driven Expert Behavior
IDIL: Imitation Learning of Intent-Driven Expert Behavior
Sangwon Seo
Vaibhav Unhelkar
96
6
0
25 Apr 2024
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences
Chen Wang
S. Erfani
T. Alpcan
Christopher Leckie
OffRL
146
4
0
07 Feb 2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
Offline Imitation Learning by Controlling the Effective Planning Horizon
Hee-Jun Ahn
Seong-Woong Shim
Byung-Jun Lee
144
0
0
18 Jan 2024
DiffAIL: Diffusion Adversarial Imitation Learning
DiffAIL: Diffusion Adversarial Imitation Learning
Bingzheng Wang
Guoqiang Wu
Teng Pang
Yan Zhang
Yilong Yin
174
18
0
11 Dec 2023
A Simple Solution for Offline Imitation from Observations and Examples
  with Possibly Incomplete Trajectories
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete TrajectoriesNeural Information Processing Systems (NeurIPS), 2023
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRL
252
5
0
02 Nov 2023
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline
  Reinforcement Learning
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2023
Fan Luo
Tian Xu
Xingchen Cao
Yang Yu
OffRL
211
13
0
09 Oct 2023
Provably Efficient Adversarial Imitation Learning with Unknown
  Transitions
Provably Efficient Adversarial Imitation Learning with Unknown TransitionsConference on Uncertainty in Artificial Intelligence (UAI), 2023
Tian Xu
Ziniu Li
Yang Yu
Zhimin Luo
129
11
0
11 Jun 2023
Coherent Soft Imitation Learning
Coherent Soft Imitation LearningNeural Information Processing Systems (NeurIPS), 2023
Joe Watson
Sandy H. Huang
Nicholas Heess
208
16
0
25 May 2023
Replicating Complex Dialogue Policy of Humans via Offline Imitation
  Learning with Supervised Regularization
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization
Zhoujian Sun
Chenyang Zhao
Zheng-Wei Huang
Nai Ding
OffRL
136
2
0
06 May 2023
A Coupled Flow Approach to Imitation Learning
A Coupled Flow Approach to Imitation LearningInternational Conference on Machine Learning (ICML), 2023
G. Freund
Elad Sarafian
Sarit Kraus
OOD
174
14
0
29 Apr 2023
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Theoretical Analysis of Offline Imitation With Supplementary Dataset
Ziniu Li
Tian Xu
Y. Yu
Zhixun Luo
OffRL
138
2
0
27 Jan 2023
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation LearningNeural Information Processing Systems (NeurIPS), 2022
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
193
28
0
18 Oct 2022
Understanding Adversarial Imitation Learning in Small Sample Regime: A
  Stage-coupled Analysis
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis
Tian Xu
Ziniu Li
Yang Yu
Zhimin Luo
119
10
0
03 Aug 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution
  Correction Estimation
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction EstimationNeural Information Processing Systems (NeurIPS), 2022
Geon-hyeong Kim
Jongmin Lee
Youngsoo Jang
Hongseok Yang
Kyungmin Kim
OffRL
279
23
0
28 Feb 2022
1