ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.11089
  4. Cited By
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

25 February 2020
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
    OffRL
ArXivPDFHTML

Papers citing "Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement"

50 / 57 papers shown
Title
Learning to chain-of-thought with Jensen's evidence lower bound
Learning to chain-of-thought with Jensen's evidence lower bound
Yunhao Tang
Sid Wang
Rémi Munos
BDL
OffRL
LRM
52
0
0
25 Mar 2025
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
71
1
0
22 Aug 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive
  Data Sharing
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun-Jie Luo
OffRL
32
2
0
20 Jun 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
31
9
0
30 Apr 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
33
0
0
04 Feb 2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
33
10
0
15 Oct 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
24
8
0
04 Sep 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
22
16
0
24 Jun 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
21
22
0
30 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach
Yudi Zhang
Yali Du
Biwei Huang
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
16
17
0
28 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
Graph Decision Transformer
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
28
15
0
07 Mar 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
34
14
0
24 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Offline Reinforcement Learning for Visual Navigation
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
14
29
0
16 Dec 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
18
14
0
25 Nov 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for
  Learning Visuomotor Tasks
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor Tasks
Kuan Fang
Patrick Yin
Ashvin Nair
Homer Walke
Ge Yan
Sergey Levine
OffRL
28
22
0
12 Oct 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization
  Perspective
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
18
1
0
26 Sep 2022
Learning Multi-Task Transferable Rewards via Variational Inverse
  Reinforcement Learning
Learning Multi-Task Transferable Rewards via Variational Inverse Reinforcement Learning
Se-Wook Yoo
Seung-Woo Seo
DRL
14
5
0
19 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
25
138
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal
  Reinforcement Learning
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
17
2
0
14 Jun 2022
Imitating Past Successes can be Very Suboptimal
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
29
16
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
  $f$-Advantage Regression
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via fff-Advantage Regression
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
15
51
0
07 Jun 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent Space
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
50
29
0
17 May 2022
Modeling Human Behavior Part I -- Learning and Belief Approaches
Modeling Human Behavior Part I -- Learning and Belief Approaches
Andrew Fuchs
A. Passarella
M. Conti
29
7
0
13 May 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
23
27
0
27 Apr 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charles Burton Snell
Mengjiao Yang
Justin Fu
Yi Su
Sergey Levine
19
22
0
18 Apr 2022
Automating Reinforcement Learning with Example-based Resets
Automating Reinforcement Learning with Example-based Resets
Jigang Kim
Jaehyeon Park
Daesol Cho
H. J. Kim
CLL
OnRL
19
14
0
05 Apr 2022
One After Another: Learning Incremental Skills for a Changing World
One After Another: Learning Incremental Skills for a Changing World
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
11
13
0
21 Mar 2022
Switch Trajectory Transformer with Distributional Value Approximation
  for Multi-Task Reinforcement Learning
Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning
Qinjie Lin
Han Liu
B. Sengupta
OffRL
24
11
0
14 Mar 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
38
65
0
09 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
27
61
0
03 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for
  Offline Reinforcement Learning
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRL
OnRL
19
84
0
31 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
26
27
0
27 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Minghuan Liu
Menghui Zhu
Weinan Zhang
24
131
0
20 Jan 2022
STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation
  Learning on sparse-reward tasks
STIR2^22: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasks
Jesús Bujalance Martín
Fabien Moutarde
OffRL
25
2
0
11 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
23
170
0
20 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
18
18
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous
  manipulation
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
16
18
0
01 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
11
99
0
19 Nov 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
T. Luu
Chang-Dong Yoo
10
8
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
17
5
0
27 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for
  Visual Reinforcement Learning
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
35
33
0
12 Oct 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
21
7
0
18 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
24
78
0
16 Sep 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy
  Learning
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
19
22
0
08 Jul 2021
MHER: Model-based Hindsight Experience Replay
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
14
17
0
01 Jul 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for
  General-Purpose Policies
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
58
14
0
23 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
42
270
0
16 Apr 2021
12
Next