ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.06431
  4. Cited By
Offline Reinforcement Learning as Anti-Exploration

Offline Reinforcement Learning as Anti-Exploration

11 June 2021
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as Anti-Exploration"

36 / 36 papers shown
Title
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
115
0
0
03 May 2025
Hypercube Policy Regularization Framework for Offline Reinforcement
  Learning
Hypercube Policy Regularization Framework for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
OffRL
18
0
0
07 Nov 2024
Offline Model-Based Reinforcement Learning with Anti-Exploration
Offline Model-Based Reinforcement Learning with Anti-Exploration
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
49
0
0
20 Aug 2024
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement
  Learning
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
Minjae Cho
Chuangchuang Sun
OffRL
32
0
0
17 Jul 2024
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning
  via Causal Normalizing Flows
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Minjae Cho
Jonathan P. How
Chuangchuang Sun
OODD
OffRL
30
1
0
06 May 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
29
9
0
30 Apr 2024
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Offline Reinforcement Learning with Behavioral Supervisor Tuning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
27
1
0
25 Apr 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
35
0
0
03 Apr 2024
Exploration and Anti-Exploration with Distributional Random Network
  Distillation
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
32
14
0
18 Jan 2024
Action-Quantized Offline Reinforcement Learning for Robotic Skill
  Learning
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
20
18
0
18 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Donglin Wang
OffRL
OnRL
32
0
0
07 Oct 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
P. S. Castro
M. Geist
OffRL
27
5
0
25 Jul 2023
Beyond Conservatism: Diffusion Policies in Offline Multi-agent
  Reinforcement Learning
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning
Zhuoran Li
Ling Pan
Longbo Huang
DiffM
OffRL
20
7
0
04 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
34
8
0
26 Jun 2023
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement
  Learning
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu
Ziqi Zhang
Zhenyu Wei
Zifeng Zhuang
Yachen Kang
Sibo Gai
Donglin Wang
OffRL
20
16
0
22 Jun 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online
  Reinforcement Learning
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
Siyuan Guo
Yanchao Sun
Jifeng Hu
Sili Huang
Hechang Chen
Haiyin Piao
Lichao Sun
Yi-Ju Chang
OffRL
OnRL
31
7
0
13 Jun 2023
Instructed Diffuser with Temporal Condition Guidance for Offline
  Reinforcement Learning
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu
Yan Sun
Sili Huang
Siyuan Guo
Hechang Chen
Li Shen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
DiffM
OffRL
26
13
0
08 Jun 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
21
22
0
30 May 2023
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Revisiting the Minimalist Approach to Offline Reinforcement Learning
Denis Tarasov
Vladislav Kurenkov
Alexander Nikulin
Sergey Kolesnikov
OffRL
25
36
0
16 May 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
16
161
0
06 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
34
62
0
02 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
30
24
0
31 Jan 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
30
15
0
30 Jan 2023
Confidence-Conditioned Value Functions for Offline Reinforcement
  Learning
Confidence-Conditioned Value Functions for Offline Reinforcement Learning
Joey Hong
Aviral Kumar
Sergey Levine
OffRL
15
20
0
08 Dec 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch
  Size
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
17
14
0
20 Nov 2022
Offline RL With Realistic Datasets: Heteroskedasticity and Support
  Constraints
Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints
Anika Singh
Aviral Kumar
Q. Vuong
Yevgen Chebotar
Sergey Levine
OffRL
19
14
0
02 Nov 2022
Optimal Conservative Offline RL with General Function Approximation via
  Augmented Lagrangian
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Paria Rashidinejad
Hanlin Zhu
Kunhe Yang
Stuart J. Russell
Jiantao Jiao
OffRL
33
26
0
01 Nov 2022
Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning
  Approach
Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach
Yunzhe Zhou
Zhengling Qi
C. Shi
Lexin Li
OffRL
10
8
0
26 Oct 2022
Age of Semantics in Cooperative Communications: To Expedite Simulation
  Towards Real via Offline Reinforcement Learning
Age of Semantics in Cooperative Communications: To Expedite Simulation Towards Real via Offline Reinforcement Learning
Xianfu Chen
Zhifeng Zhao
S. Mao
Celimuge Wu
Honggang Zhang
M. Bennis
OffRL
18
3
0
19 Sep 2022
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in
  Offline RL
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Wonjoon Goo
S. Niekum
OffRL
14
20
0
01 Jun 2022
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards
  Optimal Sample Complexity
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi
Gen Li
Yuting Wei
Yuxin Chen
Yuejie Chi
OffRL
26
90
0
28 Feb 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement
  Learning
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
24
132
0
23 Feb 2022
Pessimistic Model-based Offline Reinforcement Learning under Partial
  Coverage
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage
Masatoshi Uehara
Wen Sun
OffRL
93
144
0
13 Jul 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
214
413
0
16 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,951
0
04 May 2020
1