ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.08050
  4. Cited By
Offline Reinforcement Learning with Fisher Divergence Critic
  Regularization

Offline Reinforcement Learning with Fisher Divergence Critic Regularization

International Conference on Machine Learning (ICML), 2021
14 March 2021
Ilya Kostrikov
Jonathan Tompson
Rob Fergus
Ofir Nachum
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Offline Reinforcement Learning with Fisher Divergence Critic Regularization"

50 / 209 papers shown
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
Yunchang Ma
Tenglong Liu
Yixing Lan
Xin Yin
Changxin Zhang
Xinglong Zhang
Xin Xu
OffRL
294
0
0
12 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
185
1
0
04 Nov 2025
Towards Robust Zero-Shot Reinforcement Learning
Towards Robust Zero-Shot Reinforcement Learning
Kexin Zheng
Lauriane Teyssier
Yinan Zheng
Yu Luo
Xiayuan Zhan
OffRL
403
3
0
17 Oct 2025
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Adversarial Fine-tuning in Offline-to-Online Reinforcement Learning for Robust Robot Control
Shingo Ayabe
Hiroshi Kera
K. Kawamoto
AAMLOffRLOnRL
363
0
0
15 Oct 2025
AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions
AOAD-MAT: Transformer-based multi-agent deep reinforcement learning model considering agents' order of action decisions
Shota Takayama
Katsuhide Fujita
150
0
0
15 Oct 2025
Expert or not? assessing data quality in offline reinforcement learning
Expert or not? assessing data quality in offline reinforcement learning
Arip Asadulaev
Fakhri Karray
Martin Takáč
OffRL
156
0
0
14 Oct 2025
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption
Longxiang He
Deheng Ye
Junbo Tan
Xueqian Wang
Li Shen
OnRL
379
0
0
29 Sep 2025
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
Yifu Luo
Yongzhe Chang
Xueqian Wang
188
2
0
04 Sep 2025
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
Xiao Huang
Xu Liu
Enze Zhang
T. Yu
Shuai Li
OffRLOnRL
265
2
0
09 Aug 2025
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
Samin Yeasar Arnob
Scott Fujimoto
Doina Precup
OffRL
286
0
0
20 Jun 2025
Offline RL with Smooth OOD Generalization in Convex Hull and its NeighborhoodInternational Conference on Learning Representations (ICLR), 2025
Qingmao Yao
Zhichao Lei
Tianyuan Chen
Ziyue Yuan
Xuefan Chen
Jianxiang Liu
Faguo Wu
Xiao Zhang
OffRL
249
2
0
10 Jun 2025
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
Qin-Wen Luo
Ming-Kun Xie
Ye-Wen Wang
Sheng-Jun Huang
OffRL
233
5
0
26 May 2025
Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL
Joey Hong
Anca Dragan
Sergey Levine
OffRLLLMAGLRM
555
6
0
23 May 2025
Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer
Beyond the Known: Decision Making with Counterfactual Reasoning Decision TransformerInternational Joint Conference on Artificial Intelligence (IJCAI), 2025
Minh Hoang Nguyen
Linh Le Pham Van
Thommen George Karimpanal
Sunil Gupta
Hung Le
OffRLLRM
312
2
0
14 May 2025
Generative Auto-Bidding with Value-Guided Explorations
Generative Auto-Bidding with Value-Guided ExplorationsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Jingtong Gao
Yewen Li
Shuai Mao
Peng Jiang
Nan Jiang
...
Fei Pan
Peng Jiang
Kun Gai
Rui Hu
Xiangyu Zhao
OffRL
531
15
0
20 Apr 2025
Improving Sequential Recommenders through Counterfactual Augmentation of System Exposure
Improving Sequential Recommenders through Counterfactual Augmentation of System ExposureAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Ziqi Zhao
Zhaochun Ren
Jiyuan Yang
Zuming Yan
Zihan Wang
Liu Yang
Sudipta Singha Roy
Zhumin Chen
Maarten de Rijke
Xin Xin
CML
350
2
0
18 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
455
3
0
17 Apr 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Decision SpikeFormer: Spike-Driven Transformer for Decision MakingComputer Vision and Pattern Recognition (CVPR), 2025
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
267
2
0
04 Apr 2025
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
Yunkai Gao
Jiaming Guo
Fan Wu
Rui Zhang
OffRL
251
1
0
07 Mar 2025
Eau De $Q$-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Eau De QQQ-Network: Adaptive Distillation of Neural Networks in Deep Reinforcement Learning
Théo Vincent
Tim Lukas Faust
Yogesh Tripathi
Jan Peters
Carlo DÉramo
333
0
0
03 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
241
0
0
02 Mar 2025
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
Woojun Kim
Katia Sycara
OffRL
450
1
0
30 Jan 2025
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement LearningInternational Conference on Machine Learning (ICML), 2024
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
391
6
0
28 Jan 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
755
0
0
17 Jan 2025
LEASE: Offline Preference-based Reinforcement Learning with High Sample Efficiency
LEASE: Offline Preference-based Reinforcement Learning with High Sample Efficiency
Xiao-Yin Liu
Guotao Li
Xiao-Hu Zhou
Z. Hou
OffRL
397
1
0
30 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
496
10
0
22 Dec 2024
An Investigation of Offline Reinforcement Learning in Factorisable Action Spaces
Alex Beeson
David Ireland
Giovanni Montana
OffRL
414
4
0
17 Nov 2024
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement
  Learning
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning
JaeYoon Kim
Junyu Xuan
Christy Jie Liang
F. Hussain
OffRLOnRL
254
0
0
31 Oct 2024
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic
  Management in Network Simulation
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationNeural Information Processing Systems (NeurIPS), 2024
Momin Haider
Ming Yin
Menglei Zhang
Arpit Gupta
Jing Zhu
Yu-Xiang Wang
OffRL
218
2
0
30 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action SuppressionNeural Information Processing Systems (NeurIPS), 2024
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
646
19
0
25 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
399
12
0
03 Oct 2024
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift RegularizationInternational Conference on Learning Representations (ICLR), 2024
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
432
5
0
02 Oct 2024
Surgical Task Automation Using Actor-Critic Frameworks and
  Self-Supervised Imitation Learning
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
475
0
0
04 Sep 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
295
11
0
27 Aug 2024
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided SearchEuropean Conference on Artificial Intelligence (ECAI), 2024
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
403
0
0
19 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2024
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
316
3
0
04 Aug 2024
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang
Yunqiang Pei
Guoqing Wang
Yangming Zhang
Yang Yang
Peng Wang
H. Shen
OffRL
339
10
0
23 Jul 2024
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement
  Learning
OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning
Yi-Fan Yao
Zhepeng Cen
Wenhao Ding
Hao-ming Lin
Shiqi Liu
Tingnan Zhang
Wenhao Yu
Ding Zhao
OffRLOnRL
297
12
0
19 Jul 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous
  Control
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Huayu Chen
Kaiwen Zheng
Hang Su
Jun Zhu
390
8
0
12 Jul 2024
ROER: Regularized Optimal Experience Replay
ROER: Regularized Optimal Experience Replay
Changling Li
Zhang-Wei Hong
Pulkit Agrawal
Divyansh Garg
Joni Pajarinen
OffRL
237
1
0
04 Jul 2024
ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse
  Diffusion Purification
ECLIPSE: Expunging Clean-label Indiscriminate Poisons via Sparse Diffusion Purification
Xianlong Wang
Shengshan Hu
Yechao Zhang
Ziqi Zhou
Leo Yu Zhang
Peng Xu
Wei Wan
Hai Jin
AAML
409
5
0
21 Jun 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive
  Data Sharing
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun Luo
OffRL
221
5
0
20 Jun 2024
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for
  Offline Reinforcement Learning
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning
Xuemin Hu
Shen Li
Yingfen Xu
Bo Tang
Long Chen
247
1
0
13 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
253
1
0
11 Jun 2024
Strategically Conservative Q-Learning
Strategically Conservative Q-Learning
Yutaka Shimizu
Joey Hong
Sergey Levine
Masayoshi Tomizuka
OffRLOnRL
291
1
0
06 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
306
0
0
05 Jun 2024
Adaptive Advantage-Guided Policy Regularization for Offline
  Reinforcement Learning
Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning
Tenglong Liu
Yang Li
Yixing Lan
Hao Gao
Wei Pan
Xin Xu
OffRL
403
14
0
30 May 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
Longxiang He
Li Shen
Xueqian Wang
355
13
0
28 May 2024
Federated Offline Policy Optimization with Dual Regularization
Federated Offline Policy Optimization with Dual Regularization
Sheng Yue
Zerui Qin
Xingyuan Hua
Yongheng Deng
Ju Ren
OffRL
355
2
0
24 May 2024
State-Constrained Offline Reinforcement Learning
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
407
1
0
23 May 2024
12345
Next
Page 1 of 5