ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.07484
  4. Cited By
Mutual Information Regularized Offline Reinforcement Learning
v1v2v3 (latest)

Mutual Information Regularized Offline Reinforcement Learning

Neural Information Processing Systems (NeurIPS), 2022
14 October 2022
Xiao Ma
Bingyi Kang
Zhongwen Xu
Min Lin
Shuicheng Yan
    OffRL
ArXiv (abs)PDFHTMLGithub (7★)

Papers citing "Mutual Information Regularized Offline Reinforcement Learning"

7 / 7 papers shown
Maximum Total Correlation Reinforcement Learning
Maximum Total Correlation Reinforcement Learning
Bang You
Puze Liu
Huaping Liu
Jan Peters
Oleg Arenz
203
2
0
22 May 2025
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
282
0
0
05 Jun 2024
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement
  Learning
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
Tianle Zhang
Jiayi Guan
Lin Zhao
Yihang Li
Dongjiang Li
...
Lei Sun
Yue Chen
Xuelong Wei
Lusong Li
Xiaodong He
253
2
0
29 May 2024
Exclusively Penalized Q-learning for Offline Reinforcement Learning
Exclusively Penalized Q-learning for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Junghyuk Yeom
Yonghyeon Jo
Jungmo Kim
Sanghyeon Lee
Seungyul Han
OffRL
304
3
0
23 May 2024
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning
  via Causal Normalizing Flows
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Minjae Cho
Jonathan P. How
Chuangchuang Sun
OODDOffRL
199
1
0
06 May 2024
Understanding, Predicting and Better Resolving Q-Value Divergence in
  Offline-RL
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RLNeural Information Processing Systems (NeurIPS), 2023
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
359
21
0
06 Oct 2023
Efficient Diffusion Policies for Offline Reinforcement Learning
Efficient Diffusion Policies for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Bingyi Kang
Xiao Ma
Chao Du
Tianyu Pang
Shuicheng Yan
OffRL
353
117
0
31 May 2023
1