Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.11089
Cited By
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
25 February 2020
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement"
50 / 57 papers shown
Title
Learning to chain-of-thought with Jensen's evidence lower bound
Yunhao Tang
Sid Wang
Rémi Munos
BDL
OffRL
LRM
52
0
0
25 Mar 2025
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
71
1
0
22 Aug 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun-Jie Luo
OffRL
32
2
0
20 Jun 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
31
9
0
30 Apr 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
33
0
0
04 Feb 2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
33
10
0
15 Oct 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
24
8
0
04 Sep 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
22
16
0
24 Jun 2023
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
21
22
0
30 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Biwei Huang
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
16
17
0
28 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
28
15
0
07 Mar 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
34
14
0
24 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
14
29
0
16 Dec 2022
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
18
14
0
25 Nov 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks
Kuan Fang
Patrick Yin
Ashvin Nair
Homer Walke
Ge Yan
Sergey Levine
OffRL
28
22
0
12 Oct 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
18
1
0
26 Sep 2022
Learning Multi-Task Transferable Rewards via Variational Inverse Reinforcement Learning
Se-Wook Yoo
Seung-Woo Seo
DRL
14
5
0
19 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
25
138
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
17
2
0
14 Jun 2022
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
29
16
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
f
f
f
-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
15
51
0
07 Jun 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
50
29
0
17 May 2022
Modeling Human Behavior Part I -- Learning and Belief Approaches
Andrew Fuchs
A. Passarella
M. Conti
29
7
0
13 May 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
23
27
0
27 Apr 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charles Burton Snell
Mengjiao Yang
Justin Fu
Yi Su
Sergey Levine
19
22
0
18 Apr 2022
Automating Reinforcement Learning with Example-based Resets
Jigang Kim
Jaehyeon Park
Daesol Cho
H. J. Kim
CLL
OnRL
19
14
0
05 Apr 2022
One After Another: Learning Incremental Skills for a Changing World
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
11
13
0
21 Mar 2022
Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning
Qinjie Lin
Han Liu
B. Sengupta
OffRL
24
11
0
14 Mar 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
38
65
0
09 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
27
61
0
03 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
19
84
0
31 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
26
27
0
27 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
24
131
0
20 Jan 2022
STIR
2
^2
2
: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
25
2
0
11 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
23
170
0
20 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
18
18
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
16
18
0
01 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
11
99
0
19 Nov 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
T. Luu
Chang-Dong Yoo
10
8
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
17
5
0
27 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
35
33
0
12 Oct 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
21
7
0
18 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
24
78
0
16 Sep 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
19
22
0
08 Jul 2021
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
14
17
0
01 Jul 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
58
14
0
23 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
42
270
0
16 Apr 2021
1
2
Next