Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.07213
Cited By
PLAS: Latent Action Space for Offline Reinforcement Learning
14 November 2020
Wenxuan Zhou
Sujay Bajracharya
David Held
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"PLAS: Latent Action Space for Offline Reinforcement Learning"
50 / 102 papers shown
Title
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
24
26
0
28 Feb 2023
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Francisco Roldan Sanchez
Kevin McGuinness
Noel E. O'Connor
S. Redmond
OffRL
25
3
0
30 Jan 2023
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
Jing Zhang
Chi Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
27
7
0
28 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
24
8
0
27 Jan 2023
Generative Slate Recommendation with Reinforcement Learning
Romain Deffayet
Thibaut Thonet
Jean-Michel Render
Maarten de Rijke
16
23
0
20 Jan 2023
Visuotactile Affordances for Cloth Manipulation with Local Control
N. Sunil
Shaoxiong Wang
Y. She
Edward H. Adelson
Alberto Rodriguez
26
35
0
09 Dec 2022
Learning Options via Compression
Yiding Jiang
E. Liu
Benjamin Eysenbach
Zico Kolter
Chelsea Finn
OffRL
25
13
0
08 Dec 2022
Model-based trajectory stitching for improved behavioural cloning and its applications
Charles A. Hepburn
Giovanni Montana
OffRL
16
5
0
08 Dec 2022
Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning
Guoxi Zhang
H. Kashima
OffRL
23
2
0
29 Nov 2022
CLAS: Coordinating Multi-Robot Manipulation with Central Latent Action Spaces
Elie Aljalbout
Maximilian Karl
Patrick van der Smagt
26
5
0
28 Nov 2022
Model-based Trajectory Stitching for Improved Offline Reinforcement Learning
Charles A. Hepburn
Giovanni Montana
OffRL
26
13
0
21 Nov 2022
Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
D. Akimov
Vladislav Kurenkov
Alexander Nikulin
Denis Tarasov
Sergey Kolesnikov
OffRL
16
9
0
20 Nov 2022
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Dmitry Akimov
Sergey Kolesnikov
OffRL
28
14
0
20 Nov 2022
Dual Generator Offline Reinforcement Learning
Q. Vuong
Aviral Kumar
Sergey Levine
Yevgen Chebotar
OffRL
26
1
0
02 Nov 2022
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
17
42
0
24 Oct 2022
Model-Based Imitation Learning for Urban Driving
Anthony Hu
Gianluca Corrado
Nicolas Griffiths
Zak Murez
Corina Gurau
Hudson Yeo
Alex Kendall
R. Cipolla
Jamie Shotton
112
135
0
14 Oct 2022
Real World Offline Reinforcement Learning with Realistic Data Source
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
40
21
0
12 Oct 2022
State Advantage Weighting for Offline RL
Jiafei Lyu
Aicheng Gong
Le Wan
Zongqing Lu
Xiu Li
OffRL
33
9
0
09 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
18
7
0
07 Oct 2022
LOPR: Latent Occupancy PRediction using Generative Models
Bernard Lange
Masha Itkina
Mykel J. Kochenderfer
AI4CE
46
5
0
03 Oct 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
42
11
0
30 Sep 2022
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
103
105
0
29 Sep 2022
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation
Xiaoteng Ma
Zhipeng Liang
Jose H. Blanchet
MingWen Liu
Li Xia
Jiheng Zhang
Qianchuan Zhao
Zhengyuan Zhou
OOD
OffRL
41
22
0
14 Sep 2022
Dialogue Evaluation with Offline Reinforcement Learning
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Carel van Niekerk
Michael Heck
Shutong Feng
Milica Gavsić
OffRL
19
4
0
02 Sep 2022
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
18
36
0
22 Aug 2022
Offline Reinforcement Learning at Multiple Frequencies
Kaylee Burns
Tianhe Yu
Chelsea Finn
Karol Hausman
OffRL
22
6
0
26 Jul 2022
Back to the Manifold: Recovering from Out-of-Distribution States
Alfredo Reichlin
G. Marchetti
Hang Yin
Ali Ghadirzadeh
Danica Kragic
OffRL
38
11
0
18 Jul 2022
Latent Policies for Adversarial Imitation Learning
Tianyu Wang
Nikhil Karnwal
Nikolay Atanasov
21
5
0
22 Jun 2022
SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments
Mohan Zhang
Xiaozhou Wang
Benjamin Decardi-Nelson
Bo Song
A. Zhang
...
Jiayi Cheng
Xiaohong Liu
DengDeng Yu
Matthew Poon
Animesh Garg
13
4
0
17 Jun 2022
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
Jiafei Lyu
Xiu Li
Zongqing Lu
OffRL
29
25
0
16 Jun 2022
Mildly Conservative Q-Learning for Offline Reinforcement Learning
Jiafei Lyu
Xiaoteng Ma
Xiu Li
Zongqing Lu
OffRL
31
101
0
09 Jun 2022
Meta-Learning Parameterized Skills
Haotian Fu
Shangqun Yu
Saket Tiwari
Michael Littman
George Konidaris
35
6
0
07 Jun 2022
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models
Ali Ghadirzadeh
Petra Poklukar
Karol Arndt
Chelsea Finn
Ville Kyrki
Danica Kragic
Marten Bjorkman
OffRL
22
1
0
18 Apr 2022
Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data
Wenxuan Zhou
Steven Bohez
Jan Humplik
A. Abdolmaleki
Dushyant Rao
Markus Wulfmeier
Tuomas Haarnoja
N. Heess
OffRL
32
6
0
12 Apr 2022
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL
Xi Chen
Ali Ghadirzadeh
Tianhe Yu
Yuan Gao
Jianhao Wang
Wenzhe Li
Bin Liang
Chelsea Finn
Chongjie Zhang
OffRL
32
14
0
16 Mar 2022
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Zhuoran Yang
Zhihong Deng
Animesh Garg
Peng Liu
Zhaoran Wang
OffRL
28
132
0
23 Feb 2022
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu
Haixu Wu
Zihan Qiu
Jianmin Wang
Mingsheng Long
OffRL
35
64
0
13 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
29
61
0
03 Feb 2022
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
M. Tomizuka
Wei Zhan
OffRL
11
21
0
09 Nov 2021
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
Murtaza Dalal
Deepak Pathak
Ruslan Salakhutdinov
29
90
0
28 Oct 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
21
24
0
08 Oct 2021
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang
Wenzhe Li
Haozhe Jiang
Guangxiang Zhu
Siyuan Li
Chongjie Zhang
OffRL
101
59
0
01 Oct 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
32
77
0
16 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRL
OnRL
21
0
0
15 Sep 2021
DROMO: Distributionally Robust Offline Model-based Policy Optimization
Ruizhen Liu
Dazhi Zhong
Zhi-Cong Chen
OffRL
18
3
0
15 Sep 2021
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Boyan Li
Hongyao Tang
Yan Zheng
Jianye Hao
Pengyi Li
Zhen Wang
Zhaopeng Meng
Li Wang
29
42
0
12 Sep 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
132
78
0
01 Feb 2021
Previous
1
2
3
Next