ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.04607
  4. Cited By
Confidence-Conditioned Value Functions for Offline Reinforcement
  Learning
v1v2 (latest)

Confidence-Conditioned Value Functions for Offline Reinforcement Learning

International Conference on Learning Representations (ICLR), 2022
8 December 2022
Joey Hong
Aviral Kumar
Sergey Levine
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Confidence-Conditioned Value Functions for Offline Reinforcement Learning"

17 / 17 papers shown
Reinforcement Learning Gradients as Vitamin for Online Finetuning
  Decision Transformers
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision TransformersNeural Information Processing Systems (NeurIPS), 2024
Kai Yan
Alex Schwing
Yu-Xiong Wang
OffRLOnRL
250
5
0
31 Oct 2024
An Offline Adaptation Framework for Constrained Multi-Objective
  Reinforcement Learning
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Qian Lin
Zongkai Liu
Danying Mo
Chao Yu
OffRL
340
2
0
16 Sep 2024
Preference-Optimized Pareto Set Learning for Blackbox Optimization
Preference-Optimized Pareto Set Learning for Blackbox Optimization
Zhang Haishan
Chen Liang
Koji Tsuda
247
1
0
19 Aug 2024
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning
  via Causal Normalizing Flows
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Minjae Cho
Jonathan P. How
Chuangchuang Sun
OODDOffRL
210
1
0
06 May 2024
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
Yi Shen
Hanyan Huang
Shan Xie
225
1
0
03 Apr 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware PerspectiveNeural Information Processing Systems (NeurIPS), 2024
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
469
10
0
12 Mar 2024
Exploration and Anti-Exploration with Distributional Random Network
  Distillation
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
474
35
0
18 Jan 2024
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy OptimizationInternational Conference on Learning Representations (ICLR), 2023
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRLOnRL
395
28
0
06 Nov 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
361
27
0
27 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRLOnRL
319
4
0
12 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Xuetao Zhang
OffRLOnRL
382
0
0
07 Oct 2023
Learning Control Policies for Variable Objectives from Offline Data
Learning Control Policies for Variable Objectives from Offline DataIEEE Symposium Series on Computational Intelligence (IEEE-SSCI), 2023
Marc Weber
Phillip Swazinna
D. Hein
Steffen Udluft
V. Sterzing
OffRL
228
9
0
11 Aug 2023
Model-based Offline Reinforcement Learning with Count-based Conservatism
Model-based Offline Reinforcement Learning with Count-based ConservatismInternational Conference on Machine Learning (ICML), 2023
Byeongchang Kim
Min Hwan Oh
OffRL
220
16
0
21 Jul 2023
Budgeting Counterfactual for Offline RL
Budgeting Counterfactual for Offline RLNeural Information Processing Systems (NeurIPS), 2023
Yao Liu
Pratik Chaudhari
Rasool Fakoor
OffRL
309
4
0
12 Jul 2023
Automatic Trade-off Adaptation in Offline RL
Automatic Trade-off Adaptation in Offline RLThe European Symposium on Artificial Neural Networks (ESANN), 2023
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
153
0
0
16 Jun 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement
  Learning
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya Zhang
OffRLOnRL
304
29
0
25 May 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network DistillationInternational Conference on Machine Learning (ICML), 2023
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
268
48
0
31 Jan 2023
1
Page 1 of 1