ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.04779
  4. Cited By
Conservative Q-Learning for Offline Reinforcement Learning

Conservative Q-Learning for Offline Reinforcement Learning

8 June 2020
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Conservative Q-Learning for Offline Reinforcement Learning"

50 / 392 papers shown
Title
Reinforcement Learning in Practice: Opportunities and Challenges
Reinforcement Learning in Practice: Opportunities and Challenges
Yuxi Li
OffRL
36
9
0
23 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
64
0
13 Feb 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
46
66
0
09 Feb 2022
Model-Based Offline Meta-Reinforcement Learning with Regularization
Model-Based Offline Meta-Reinforcement Learning with Regularization
Sen Lin
Jialin Wan
Tengyu Xu
Yingbin Liang
Junshan Zhang
OffRL
31
17
0
07 Feb 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
11
125
0
05 Feb 2022
Versatile Offline Imitation from Observations and Examples via
  Regularized State-Occupancy Matching
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Yecheng Jason Ma
Andrew Shen
Dinesh Jayaraman
Osbert Bastani
OffRL
23
32
0
04 Feb 2022
Offline Reinforcement Learning for Mobile Notifications
Offline Reinforcement Learning for Mobile Notifications
Yiping Yuan
A. Muralidharan
Preetam Nandy
Miao Cheng
Prakruthi Prabhakar
OffRL
25
9
0
04 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Conservative Distributional Reinforcement Learning with Safety
  Constraints
Conservative Distributional Reinforcement Learning with Safety Constraints
Hengrui Zhang
Youfang Lin
Sheng Han
Shuo Wang
Kai Lv
OffRL
21
5
0
18 Jan 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
33
100
0
11 Jan 2022
Offline Reinforcement Learning for Road Traffic Control
Offline Reinforcement Learning for Road Traffic Control
Mayuresh Kunjir
S. Chawla
OffRL
32
4
0
07 Jan 2022
Importance of Empirical Sample Complexity Analysis for Offline
  Reinforcement Learning
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning
Samin Yeasar Arnob
Riashat Islam
Doina Precup
OffRL
16
4
0
31 Dec 2021
RvS: What is Essential for Offline RL via Supervised Learning?
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
31
170
0
20 Dec 2021
Data Augmentation through Expert-guided Symmetry Detection to Improve
  Performance in Offline Reinforcement Learning
Data Augmentation through Expert-guided Symmetry Detection to Improve Performance in Offline Reinforcement Learning
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
28
2
0
18 Dec 2021
Sample-Efficient Reinforcement Learning via Conservative Model-Based
  Actor-Critic
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang
Jie Wang
Qi Zhou
Bin Li
Houqiang Li
21
30
0
16 Dec 2021
Learning Contraction Policies from Offline Data
Learning Contraction Policies from Offline Data
Navid Rezazadeh
Maxwell Kolarich
Solmaz S. Kia
Negar Mehr
OffRL
21
7
0
11 Dec 2021
Learning Transferable Motor Skills with Hierarchical Latent Mixture
  Policies
Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
Dushyant Rao
Fereshteh Sadeghi
Leonard Hasenclever
Markus Wulfmeier
Martina Zambelli
...
Dhruva Tirumala
Y. Aytar
J. Merel
N. Heess
R. Hadsell
13
28
0
09 Dec 2021
DR3: Value-Based Deep Reinforcement Learning Requires Explicit
  Regularization
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
31
65
0
09 Dec 2021
Godot Reinforcement Learning Agents
Godot Reinforcement Learning Agents
E. Beeching
Jilles Debangoye
Olivier Simonin
Christian Wolf
GP
OnRL
21
5
0
07 Dec 2021
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence
  Model Tackles All SMAC Tasks
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Linghui Meng
Muning Wen
Yaodong Yang
Chenyang Le
Xiyun Li
Weinan Zhang
Ying Wen
Haifeng Zhang
Jun Wang
Bo Xu
OffRL
26
38
0
06 Dec 2021
Quantile Filtered Imitation Learning
Quantile Filtered Imitation Learning
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
28
6
0
02 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
24
4
0
29 Nov 2021
Measuring Data Quality for Dataset Selection in Offline Reinforcement
  Learning
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
25
6
0
26 Nov 2021
The Impact of Data Distribution on Q-learning with Function
  Approximation
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
A. Sardinha
Francisco S. Melo
OffRL
16
2
0
23 Nov 2021
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning
  Leveraging Planning
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning
Christopher P. Diehl
Timo Sievernich
Martin Krüger
F. Hoffmann
Torsten Bertram
OffRL
26
26
0
22 Nov 2021
Compressive Features in Offline Reinforcement Learning for Recommender
  Systems
Compressive Features in Offline Reinforcement Learning for Recommender Systems
Hung Nguyen
Minh Nguyen
Long Pham
Jennifer Adorno Nieves
OffRL
16
2
0
16 Nov 2021
Exploiting Action Impact Regularity and Exogenous State Variables for
  Offline Reinforcement Learning
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning
Vincent Liu
James Wright
Martha White
OffRL
31
1
0
15 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
13
21
0
09 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at
  Scale
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
92
59
0
09 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
60
100
0
06 Nov 2021
Curriculum Offline Imitation Learning
Curriculum Offline Imitation Learning
Minghuan Liu
Hanye Zhao
Zhengyu Yang
Jian Shen
Weinan Zhang
Li Zhao
Tie-Yan Liu
OffRL
26
1
0
03 Nov 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement
  Learning and Goal-Aware State Information
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information
Jin Li
Xianyuan Zhan
Zixu Xiao
Guyue Zhou
OffRL
OnRL
27
2
0
21 Oct 2021
Data-Driven Offline Optimization For Architecting Hardware Accelerators
Data-Driven Offline Optimization For Architecting Hardware Accelerators
Aviral Kumar
Amir Yazdanbakhsh
Milad Hashemi
Kevin Swersky
Sergey Levine
27
36
0
20 Oct 2021
Continuous Control with Action Quantization from Demonstrations
Continuous Control with Action Quantization from Demonstrations
Robert Dadashi
Léonard Hussenot
Damien Vincent
Sertan Girgin
Anton Raichuk
M. Geist
Olivier Pietquin
OffRL
33
23
0
19 Oct 2021
No RL, No Simulation: Learning to Navigate without Navigating
No RL, No Simulation: Learning to Navigate without Navigating
Meera Hahn
Devendra Singh Chaplot
Shubham Tulsiani
Mustafa Mukadam
James M. Rehg
Abhinav Gupta
75
71
0
18 Oct 2021
Value Penalized Q-Learning for Recommender Systems
Value Penalized Q-Learning for Recommender Systems
Chengqian Gao
Ke Xu
Kuangqi Zhou
Lanqing Li
Xueqian Wang
Bo Yuan
P. Zhao
OffRL
52
20
0
15 Oct 2021
Safe Autonomous Racing via Approximate Reachability on Ego-vision
Safe Autonomous Racing via Approximate Reachability on Ego-vision
Bingqing Chen
Jonathan M Francis
Jean Oh
Eric Nyberg
Sylvia L. Herbert
56
14
0
14 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
26
31
0
14 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
843
0
12 Oct 2021
Fast Block Linear System Solver Using Q-Learning Schduling for Unified
  Dynamic Power System Simulations
Fast Block Linear System Solver Using Q-Learning Schduling for Unified Dynamic Power System Simulations
Yingshi Chen
Xinli Song
Han-Sen Dai
Tao Liu
Wuzhi Zhong
Guoyang Wu
14
0
0
12 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise
  Datasets
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
24
5
0
10 Oct 2021
Representation Learning for Online and Offline RL in Low-rank MDPs
Representation Learning for Online and Offline RL in Low-rank MDPs
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
62
126
0
09 Oct 2021
Offline RL With Resource Constrained Online Deployment
Offline RL With Resource Constrained Online Deployment
Jayanth Reddy Regatti
A. Deshmukh
Frank Cheng
Young Hun Jung
Abhishek Gupta
Ürün Dogan
OffRL
13
2
0
07 Oct 2021
MetaDrive: Composing Diverse Driving Scenarios for Generalizable
  Reinforcement Learning
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Quanyi Li
Zhenghao Peng
Lan Feng
Qihang Zhang
Zhenghai Xue
Bolei Zhou
39
230
0
26 Sep 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
143
85
0
22 Sep 2021
Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning
Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
30
2
0
19 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Implicit Behavioral Cloning
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
49
368
0
01 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement
  Learning
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
111
0
19 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
28
37
0
04 Aug 2021
Previous
12345678
Next