ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.11089
  4. Cited By
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

Neural Information Processing Systems (NeurIPS), 2020
25 February 2020
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement"

50 / 57 papers shown
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Consistent Zero-Shot Imitation with Contrastive Goal Inference
Kathryn Wantlin
Chongyi Zheng
Benjamin Eysenbach
217
1
0
20 Oct 2025
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
584
1
0
22 Aug 2024
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive
  Data Sharing
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing
Xinbo Zhao
Yingxue Zhang
Xin Zhang
Yu Yang
Yiqun Xie
Yanhua Li
Jun Luo
OffRL
221
5
0
20 Jun 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
316
12
0
30 Apr 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
355
0
0
04 Feb 2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRLLM&Ro
411
44
0
15 Oct 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert GuidanceIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRLOnRL
268
18
0
04 Sep 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy OptimizationNeural Information Processing Systems (NeurIPS), 2023
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Xuetao Zhang
Bin Wang
OffRL
481
13
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate TargetsNeural Information Processing Systems (NeurIPS), 2023
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
371
25
0
24 Jun 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?International Conference on Machine Learning (ICML), 2023
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
271
33
0
30 May 2023
Interpretable Reward Redistribution in Reinforcement Learning: A Causal
  Approach
Interpretable Reward Redistribution in Reinforcement Learning: A Causal ApproachNeural Information Processing Systems (NeurIPS), 2023
Yudi Zhang
Yali Du
Erdun Gao
Ziyan Wang
Jun Wang
Meng Fang
Mykola Pechenizkiy
CML
308
29
0
28 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction DataInternational Conference on Machine Learning (ICML), 2023
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
351
19
0
26 Apr 2023
Graph Decision Transformer
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya Zhang
Dacheng Tao
OffRL
308
22
0
07 Mar 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development TrajectoryIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
383
74
0
29 Dec 2022
Understanding the Complexity Gains of Single-Task RL with a Curriculum
Understanding the Complexity Gains of Single-Task RL with a CurriculumInternational Conference on Machine Learning (ICML), 2022
Qiyang Li
Yuexiang Zhai
Yi-An Ma
Sergey Levine
444
20
0
24 Dec 2022
Offline Reinforcement Learning for Visual Navigation
Offline Reinforcement Learning for Visual NavigationConference on Robot Learning (CoRL), 2022
Dhruv Shah
Arjun Bhorkar
Hrish Leen
Ilya Kostrikov
Nicholas Rhinehart
Sergey Levine
OffRL
261
41
0
16 Dec 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior DistillationInternational Conference on Learning Representations (ICLR), 2022
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
328
22
0
25 Nov 2022
Generalization with Lossy Affordances: Leveraging Broad Offline Data for
  Learning Visuomotor Tasks
Generalization with Lossy Affordances: Leveraging Broad Offline Data for Learning Visuomotor TasksConference on Robot Learning (CoRL), 2022
Kuan Fang
Patrick Yin
Ashvin Nair
Homer Walke
Ge Yan
Sergey Levine
OffRL
345
32
0
12 Oct 2022
Understanding Hindsight Goal Relabeling from a Divergence Minimization
  Perspective
Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective
Lunjun Zhang
Bradly C. Stadie
263
1
0
26 Sep 2022
Learning Multi-Task Transferable Rewards via Variational Inverse
  Reinforcement Learning
Learning Multi-Task Transferable Rewards via Variational Inverse Reinforcement LearningIEEE International Conference on Robotics and Automation (ICRA), 2022
Se-Wook Yoo
Seung-Woo Seo
DRL
157
7
0
19 Jun 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSLOffRL
513
239
0
15 Jun 2022
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal
  Reinforcement Learning
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Nicolas Castanet
Sylvain Lamprier
Olivier Sigaud
328
5
0
14 Jun 2022
Imitating Past Successes can be Very Suboptimal
Imitating Past Successes can be Very SuboptimalNeural Information Processing Systems (NeurIPS), 2022
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
320
25
0
07 Jun 2022
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via
  $f$-Advantage Regression
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via fff-Advantage RegressionNeural Information Processing Systems (NeurIPS), 2022
Yecheng Jason Ma
Jason Yan
Dinesh Jayaraman
Osbert Bastani
OffRL
330
78
0
07 Jun 2022
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in
  Latent Space
Planning to Practice: Efficient Online Fine-Tuning by Composing Goals in Latent SpaceIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2022
Kuan Fang
Patrick Yin
Ashvin Nair
Sergey Levine
OffRL
278
42
0
17 May 2022
Modeling Human Behavior Part I -- Learning and Belief Approaches
Modeling Human Behavior Part I -- Learning and Belief Approaches
Andrew Fuchs
A. Passarella
M. Conti
276
8
0
13 May 2022
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning
Bisimulation Makes Analogies in Goal-Conditioned Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Philippe Hansen-Estruch
Amy Zhang
Ashvin Nair
Patrick Yin
Sergey Levine
AI4CE
358
38
0
27 Apr 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charles Burton Snell
Mengjiao Yang
Justin Fu
Yi Su
Sergey Levine
289
30
0
18 Apr 2022
Automating Reinforcement Learning with Example-based Resets
Automating Reinforcement Learning with Example-based ResetsIEEE Robotics and Automation Letters (RA-L), 2022
Jigang Kim
Jaehyeon Park
Daesol Cho
H. J. Kim
CLLOnRL
331
17
0
05 Apr 2022
One After Another: Learning Incremental Skills for a Changing World
One After Another: Learning Incremental Skills for a Changing WorldInternational Conference on Learning Representations (ICLR), 2022
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
CLL
359
15
0
21 Mar 2022
Switch Trajectory Transformer with Distributional Value Approximation
  for Multi-Task Reinforcement Learning
Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning
Qinjie Lin
Han Liu
B. Sengupta
OffRL
168
12
0
14 Mar 2022
Rethinking Goal-conditioned Supervised Learning and Its Connection to
  Offline RL
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RLInternational Conference on Learning Representations (ICLR), 2022
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
388
96
0
09 Feb 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
How to Leverage Unlabeled Data in Offline Reinforcement LearningInternational Conference on Machine Learning (ICML), 2022
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
575
77
0
03 Feb 2022
Don't Change the Algorithm, Change the Data: Exploratory Data for
  Offline Reinforcement Learning
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Denis Yarats
David Brandfonbrener
Hao Liu
Michael Laskin
Pieter Abbeel
A. Lazaric
Lerrel Pinto
OffRLOnRL
262
113
0
31 Jan 2022
The Challenges of Exploration for Offline Reinforcement Learning
The Challenges of Exploration for Offline Reinforcement Learning
Nathan Lambert
Markus Wulfmeier
William F. Whitney
Arunkumar Byravan
Michael Bloesch
Vibhavari Dasagi
Tim Hertweck
Martin Riedmiller
OffRL
233
33
0
27 Jan 2022
Goal-Conditioned Reinforcement Learning: Problems and Solutions
Goal-Conditioned Reinforcement Learning: Problems and SolutionsInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Minghuan Liu
Menghui Zhu
Weinan Zhang
409
199
0
20 Jan 2022
STIR$^2$: Reward Relabelling for combined Reinforcement and Imitation
  Learning on sparse-reward tasks
STIR2^22: Reward Relabelling for combined Reinforcement and Imitation Learning on sparse-reward tasksAdaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jesús Bujalance Martín
Fabien Moutarde
OffRL
220
2
0
11 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
RvS: What is Essential for Offline RL via Supervised Learning?International Conference on Learning Representations (ICLR), 2021
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
404
222
0
20 Dec 2021
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL
Charles Packer
Pieter Abbeel
Joseph E. Gonzalez
OffRL
300
21
0
02 Dec 2021
Wish you were here: Hindsight Goal Selection for long-horizon dexterous
  manipulation
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
Todor Davchev
Oleg O. Sushkov
Jean-Baptiste Regli
S. Schaal
Y. Aytar
Markus Wulfmeier
Jonathan Scholz
243
19
0
01 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information
  Matching
Generalized Decision Transformer for Offline Hindsight Information MatchingInternational Conference on Learning Representations (ICLR), 2021
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
362
124
0
19 Nov 2021
Hindsight Goal Ranking on Replay Buffer for Sparse Reward Environment
Hindsight Goal Ranking on Replay Buffer for Sparse Reward EnvironmentIEEE Access (IEEE Access), 2021
Tung M. Luu
Chang D. Yoo
182
13
0
28 Oct 2021
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward
  Relabeling
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
Jesús Bujalance Martín
Raphael Chekroun
Fabien Moutarde
OffRL
217
7
0
27 Oct 2021
StARformer: Transformer with State-Action-Reward Representations for
  Visual Reinforcement Learning
StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement LearningEuropean Conference on Computer Vision (ECCV), 2021
Jinghuan Shang
Kumara Kahatapitiya
Xiang Li
Michael S. Ryoo
OffRL
476
42
0
12 Oct 2021
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
Michael Wan
Jian-wei Peng
Tanmay Gangwani
242
7
0
18 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
244
87
0
16 Sep 2021
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy
  Learning
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy LearningJournal of Artificial Intelligence Research (JAIR), 2021
Yuexiang Zhai
Christina Baek
Zhengyuan Zhou
Jiantao Jiao
Yi-An Ma
499
28
0
08 Jul 2021
MHER: Model-based Hindsight Experience Replay
MHER: Model-based Hindsight Experience Replay
Rui Yang
Meng Fang
Lei Han
Yali Du
Feng Luo
Xiu Li
OffRL
292
23
0
01 Jul 2021
DisCo RL: Distribution-Conditioned Reinforcement Learning for
  General-Purpose Policies
DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose PoliciesIEEE International Conference on Robotics and Automation (ICRA), 2021
Soroush Nasiriany
Vitchyr H. Pong
Ashvin Nair
Alexander Khazatsky
Glen Berseth
Sergey Levine
OffRL
343
15
0
23 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
520
321
0
16 Apr 2021
12
Next
Page 1 of 2