ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.00177
  4. Cited By
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning

Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning

1 October 2019
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning"

50 / 404 papers shown
Title
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned
  Reinforcement Learning
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
44
22
0
24 Jun 2022
Robust Task Representations for Offline Meta-Reinforcement Learning via
  Contrastive Learning
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Haoqi Yuan
Zongqing Lu
SSL
OffRL
42
37
0
21 Jun 2022
A Survey on Model-based Reinforcement Learning
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
53
101
0
19 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline
  Reinforcement Learning
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
40
8
0
09 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
39
18
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
  $f$-Advantage Regression
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via fff-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
25
53
0
07 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q
  Learning
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
144
103
0
05 Jun 2022
On Reinforcement Learning and Distribution Matching for Fine-Tuning
  Language Models with no Catastrophic Forgetting
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
33
51
0
01 Jun 2022
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in
  Offline RL
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Wonjoon Goo
S. Niekum
OffRL
35
20
0
01 Jun 2022
Non-Markovian policies occupancy measures
Non-Markovian policies occupancy measures
Romain Laroche
Rémi Tachet des Combes
Jacob Buckman
OffRL
41
1
0
27 May 2022
Why So Pessimistic? Estimating Uncertainties for Offline RL through
  Ensembles, and Why Their Independence Matters
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters
Seyed Kamyar Seyed Ghasemipour
S. Gu
Ofir Nachum
OffRL
31
69
0
27 May 2022
When Data Geometry Meets Deep Function: Generalizing Offline
  Reinforcement Learning
When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning
Jianxiong Li
Xianyuan Zhan
Haoran Xu
Xiangyu Zhu
Jingjing Liu
Ya Zhang
OffRL
40
25
0
23 May 2022
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Offline Policy Comparison with Confidence: Benchmarks and Baselines
Anurag Koul
Mariano Phielipp
Alan Fern
OffRL
28
0
0
22 May 2022
User-Interactive Offline Reinforcement Learning
User-Interactive Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
33
11
0
21 May 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still
  Insufficient according to an Off-Policy Measure
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
43
8
0
20 May 2022
When Should We Prefer Offline Reinforcement Learning Over Behavioral
  Cloning?
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?
Aviral Kumar
Joey Hong
Anika Singh
Sergey Levine
OffRL
50
77
0
12 Apr 2022
Jump-Start Reinforcement Learning
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task
  Reinforcement Learning
Demonstration-Bootstrapped Autonomous Practicing via Multi-Task Reinforcement Learning
Abhishek Gupta
Corey Lynch
Brandon Kinman
Garrett Peake
Sergey Levine
Karol Hausman
OffRL
19
17
0
29 Mar 2022
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Xi Chen
Ali Ghadirzadeh
Tianhe Yu
Yuan Gao
Jianhao Wang
Wenzhe Li
Bin Liang
Chelsea Finn
Chongjie Zhang
OffRL
35
14
0
16 Mar 2022
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement
  Learning
DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning
Jinxin Liu
Hongyin Zhang
Donglin Wang
OffRL
38
33
0
13 Mar 2022
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open
  Problems
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
OffRL
31
223
0
02 Mar 2022
Learning Relative Return Policies With Upside-Down Reinforcement
  Learning
Learning Relative Return Policies With Upside-Down Reinforcement Learning
Dylan R. Ashley
Kai Arulkumaran
Jürgen Schmidhuber
R. Srivastava
OffRL
24
1
0
23 Feb 2022
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning
Che Wang
Xufang Luo
Keith Ross
Dongsheng Li
OffRL
30
49
0
17 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
40
65
0
13 Feb 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
51
67
0
09 Feb 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
33
27
0
27 Jan 2022
Priors, Hierarchy, and Information Asymmetry for Skill Transfer in
  Reinforcement Learning
Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning
Sasha Salter
Kristian Hartikainen
Walter Goodwin
Ingmar Posner
OffRL
38
5
0
20 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
31
170
0
20 Dec 2021
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob
David J. Wu
Gabriele Farina
Adam Lerer
Hengyuan Hu
A. Bakhtin
Jacob Andreas
Noam Brown
29
52
0
14 Dec 2021
Quantile Filtered Imitation Learning
Quantile Filtered Imitation Learning
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
33
6
0
02 Dec 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
24
21
0
09 Nov 2021
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at
  Scale
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
97
59
0
09 Nov 2021
Towards an Understanding of Default Policies in Multitask Policy
  Optimization
Towards an Understanding of Default Policies in Multitask Policy Optimization
Theodore H. Moskovitz
Michael Arbel
Jack Parker-Holder
Aldo Pacchiano
30
9
0
04 Nov 2021
Curriculum Offline Imitation Learning
Curriculum Offline Imitation Learning
Minghuan Liu
Hanye Zhao
Zhengyu Yang
Jian Shen
Weinan Zhang
Li Zhao
Tie-Yan Liu
OffRL
29
1
0
03 Nov 2021
Offline Reinforcement Learning with Value-based Episodic Memory
Offline Reinforcement Learning with Value-based Episodic Memory
Xiaoteng Ma
Yiqin Yang
Haotian Hu
Qihan Liu
Jun Yang
Chongjie Zhang
Qianchuan Zhao
Bin Liang
OffRL
40
42
0
19 Oct 2021
Offline Reinforcement Learning with Soft Behavior Regularization
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
31
31
0
14 Oct 2021
Safe Driving via Expert Guided Policy Optimization
Safe Driving via Expert Guided Policy Optimization
Zhenghao Peng
Quanyi Li
Chunxiao Liu
Bolei Zhou
OffRL
31
41
0
13 Oct 2021
On Covariate Shift of Latent Confounders in Imitation and Reinforcement
  Learning
On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Guy Tennenholtz
Assaf Hallak
Gal Dalal
Shie Mannor
Gal Chechik
Uri Shalit
OOD
OffRL
55
15
0
13 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
214
852
0
12 Oct 2021
Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in
  the Real World
Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World
Laura M. Smith
J. Kew
Xue Bin Peng
Sehoon Ha
Jie Tan
Sergey Levine
38
101
0
11 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise
  Datasets
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
34
5
0
10 Oct 2021
TiKick: Towards Playing Multi-agent Football Full Games from
  Single-agent Demonstrations
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang
Wenze Chen
Longfei Zhang
Shizhen Xu
Ziyang Li
Fengming Zhu
Deheng Ye
Tingling Chen
Jun Zhu
OffRL
45
25
0
09 Oct 2021
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Wonjoon Goo
S. Niekum
OffRL
45
8
0
05 Oct 2021
Offline Reinforcement Learning with Reverse Model-based Imagination
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang
Wenzhe Li
Haozhe Jiang
Guangxiang Zhu
Siyuan Li
Chongjie Zhang
OffRL
117
61
0
01 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
143
85
0
22 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Collect & Infer -- a fresh look at data-efficient Reinforcement Learning
Martin Riedmiller
Jost Tobias Springenberg
Roland Hafner
N. Heess
OffRL
28
17
0
23 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
34
42
0
11 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
30
37
0
04 Aug 2021
Previous
123456789
Next