ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.15134
  4. Cited By
Critic Regularized Regression
v1v2v3 (latest)

Critic Regularized Regression

26 June 2020
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
Bobak Shahriari
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Critic Regularized Regression"

50 / 242 papers shown
Real-World Reinforcement Learning of Active Perception Behaviors
Real-World Reinforcement Learning of Active Perception Behaviors
E. Hu
Jie Wang
Xingfang Yuan
Fiona Luo
Muyao Li
Gaspard Lambrechts
Oleh Rybkin
Dinesh Jayaraman
OffRL
284
2
0
01 Dec 2025
$π^{*}_{0.6}$: a VLA That Learns From Experience
π0.6∗π^{*}_{0.6}π0.6∗​: a VLA That Learns From Experience
Physical Intelligence
Ali Amin
Raichelle Aniceto
Ashwin Balakrishna
Kevin Black
...
Blake Williams
Sukwon Yoo
Lili Yu
Ury Zhilinsky
Zhiyuan Zhou
OffRLVLM
1.2K
89
0
18 Nov 2025
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning
Yixiu Mao
Yun Qu
Qi Wang
Xiangyang Ji
OffRL
189
1
0
04 Nov 2025
Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation
Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation
Yi Zhang
Lili Xie
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
157
1
0
15 Oct 2025
Expert or not? assessing data quality in offline reinforcement learning
Expert or not? assessing data quality in offline reinforcement learning
Arip Asadulaev
Fakhri Karray
Martin Takáč
OffRL
156
0
0
14 Oct 2025
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
DEAS: DEtached value learning with Action Sequence for Scalable Offline RL
Changyeon Kim
Haeone Lee
Younggyo Seo
Kimin Lee
Yuke Zhu
OffRL
166
2
0
09 Oct 2025
A KL-regularization framework for learning to plan with adaptive priors
A KL-regularization framework for learning to plan with adaptive priors
Álvaro Serra-Gómez
Daniel Jarne Ornia
Dhruva Tirumala
Thomas Moerland
OffRL
139
1
0
05 Oct 2025
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Physics-informed Value Learner for Offline Goal-Conditioned Reinforcement Learning
Vittorio Giammarino
Ruiqi Ni
A. H. Qureshi
OffRLAI4CE
260
2
0
08 Sep 2025
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
Bhavya Agrawalla
Michal Nauman
Khush Agarwal
Aviral Kumar
OffRL
278
10
0
08 Sep 2025
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Lu Guo
Yixiang Shan
Zhengbang Zhu
Qifan Liang
Lichang Song
Ting Long
Weinan Zhang
Yi-Ju Chang
OffRL
253
0
0
21 Jul 2025
2048: Reinforcement Learning in a Delayed Reward Environment
2048: Reinforcement Learning in a Delayed Reward Environment
Prady Saligram
Tanvir Bhathal
Robby Manihani
OffRL
246
1
0
07 Jul 2025
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization
Ranting Hu
OffRL
325
0
0
18 Jun 2025
Horizon Reduction Makes RL Scalable
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
719
24
0
04 Jun 2025
Reachability Weighted Offline Goal-conditioned Resampling
Reachability Weighted Offline Goal-conditioned Resampling
Wenyan Yang
Joni Pajarinen
OffRL
231
0
0
03 Jun 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
371
28
0
29 May 2025
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
FlowQ: Energy-Guided Flow Policies for Offline Reinforcement Learning
Marvin Alles
Nutan Chen
Patrick van der Smagt
Botond Cseke
463
3
0
20 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
401
3
0
19 May 2025
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward
DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic RewardAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Yi Zhang
Ruihong Qiu
Xuwei Xu
Jiajun Liu
Sen Wang
OffRL
310
5
0
12 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRLVLM
415
3
0
06 May 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
455
3
0
17 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
348
1
0
06 Apr 2025
Behavior Preference Regression for Offline Reinforcement Learning
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
241
0
0
02 Mar 2025
LEGATO: Cross-Embodiment Imitation Using a Grasping Tool
LEGATO: Cross-Embodiment Imitation Using a Grasping ToolIEEE Robotics and Automation Letters (RA-L), 2024
Mingyo Seo
H. Andy Park
Shenli Yuan
Yuke Zhu
Luis Sentis
595
25
0
20 Feb 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
353
4
0
06 Feb 2025
The Best Instruction-Tuning Data are Those That Fit
The Best Instruction-Tuning Data are Those That Fit
Dylan Zhang
Qirun Dai
Hao Peng
ALM
640
27
0
06 Feb 2025
Geometric-Averaged Preference Optimization for Soft Preference Labels
Geometric-Averaged Preference Optimization for Soft Preference LabelsNeural Information Processing Systems (NeurIPS), 2024
Hiroki Furuta
Kuang-Huei Lee
Shixiang Shane Gu
Y. Matsuo
Aleksandra Faust
Heiga Zen
Izzeddin Gur
480
17
0
31 Dec 2024
Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel
  Bidding
Hierarchical Multi-agent Meta-Reinforcement Learning for Cross-channel BiddingIEEE Transactions on Knowledge and Data Engineering (TKDE), 2024
Shenghong He
Chao Yu
300
3
0
26 Dec 2024
Contrastive Representation for Interactive Recommendation
Contrastive Representation for Interactive RecommendationAAAI Conference on Artificial Intelligence (AAAI), 2024
Jingyu Li
Zhiyong Feng
Dongxiao He
Hongqi Chen
Qinghang Gao
Guoli Wu
405
2
0
24 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement LearningIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
496
10
0
22 Dec 2024
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
430
3
0
18 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with TransformersNeural Information Processing Systems (NeurIPS), 2024
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
357
13
0
17 Nov 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action
  Suppression
Offline Reinforcement Learning with OOD State Correction and OOD Action SuppressionNeural Information Processing Systems (NeurIPS), 2024
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
646
19
0
25 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on
  Aligned Spaces
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
Hechang Chen
Yi Chang
Dacheng Tao
Lichao Sun
OffRL
250
2
0
21 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
Amit K. Roy-Chowdhury
OffRL
538
3
0
04 Oct 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
295
11
0
27 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement LearningEuropean Conference on Artificial Intelligence (ECAI), 2024
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
319
3
0
04 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement
  Learning
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
420
37
0
29 Jul 2024
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems
Yi Zhang
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
398
6
0
18 Jul 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous
  Control
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Huayu Chen
Kaiwen Zheng
Hang Su
Jun Zhu
390
8
0
12 Jul 2024
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous
  Reinforcement Learning
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2024
Hao Bai
Yifei Zhou
Mert Cemri
Jiayi Pan
Alane Suhr
Sergey Levine
Aviral Kumar
OffRL
495
151
0
14 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
323
57
0
13 Jun 2024
Integrating Domain Knowledge for handling Limited Data in Offline RL
Integrating Domain Knowledge for handling Limited Data in Offline RL
Briti Gangopadhyay
Zhao Wang
Jia-Fong Yeh
Shingo Takamatsu
OffRL
253
1
0
11 Jun 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLNeural Information Processing Systems (NeurIPS), 2024
Qi Lv
Xiang Deng
Gongwei Chen
Michael Yu Wang
Liqiang Nie
603
18
0
08 Jun 2024
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function
  in Offline Reinforcement Learning
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
Yu Zhang
Rui Yu
Zhipeng Yao
Wenyuan Zhang
Jun Wang
Liming Zhang
OffRL
306
0
0
05 Jun 2024
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement
  Learning
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
Tianle Zhang
Jiayi Guan
Lin Zhao
Yihang Li
Dongjiang Li
...
Lei Sun
Yue Chen
Xuelong Wei
Lusong Li
Xiaodong He
330
2
0
29 May 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
Longxiang He
Li Shen
Xueqian Wang
355
13
0
28 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
482
8
0
25 May 2024
Towards Robust Policy: Enhancing Offline Reinforcement Learning with
  Adversarial Attacks and Defenses
Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and DefensesInternational Conferences on Pattern Recognition and Artificial Intelligence (ICCPRAI), 2024
Thanh Nguyen
Tung M. Luu
Tri Ton
Chang D. Yoo
OffRLAAML
309
5
0
18 May 2024
Reinformer: Max-Return Sequence Modeling for Offline RL
Reinformer: Max-Return Sequence Modeling for Offline RLInternational Conference on Machine Learning (ICML), 2024
Zifeng Zhuang
Dengyun Peng
Jinxin Liu
Ziqi Zhang
Xuetao Zhang
OffRLAI4TS
373
29
0
14 May 2024
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement
  Learning
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement LearningConference on Robot Learning (CoRL), 2024
Dhruva Tirumala
Markus Wulfmeier
Ben Moran
Sandy Huang
Jan Humplik
...
Kushal Patel
Marlon Gwira
Francesco Nori
Martin Riedmiller
N. Heess
285
26
0
03 May 2024
12345
Next
Page 1 of 5