Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.00379
Cited By
Recall Traces: Backtracking Models for Efficient Reinforcement Learning
2 April 2018
Anirudh Goyal
Philemon Brakel
W. Fedus
Soumye Singhal
Timothy Lillicrap
Sergey Levine
Hugo Larochelle
Yoshua Bengio
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Recall Traces: Backtracking Models for Efficient Reinforcement Learning"
14 / 14 papers shown
Title
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
89
6
0
03 Jun 2024
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
31
1
0
08 Dec 2023
Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts
Yuxin Pan
Fangzhen Lin
OffRL
17
3
0
04 Aug 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
44
101
0
19 Jun 2022
Retrieval-Augmented Reinforcement Learning
Anirudh Goyal
A. Friesen
Andrea Banino
T. Weber
Nan Rosemary Ke
...
Michal Valko
Simon Osindero
Timothy Lillicrap
N. Heess
Charles Blundell
OffRL
32
53
0
17 Feb 2022
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
20
46
0
08 Jun 2021
Solving Sokoban with forward-backward reinforcement learning
Yaron Shoham
G. Elidan
OffRL
32
6
0
05 May 2021
Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation
Chaochao Lu
Erdun Gao
Ke Wang
José Miguel Hernández-Lobato
Anton van den Hengel
Bernhard Schölkopf
CML
OOD
OffRL
13
56
0
16 Dec 2020
Empirical Policy Evaluation with Supergraphs
Daniel Vial
V. Subramanian
OffRL
13
0
0
18 Feb 2020
Learning the Arrow of Time
Nasim Rahaman
Steffen Wolf
Anirudh Goyal
Roman Remme
Yoshua Bengio
8
5
0
02 Jul 2019
Exploration via Hindsight Goal Generation
Zhizhou Ren
Kefan Dong
Yuanshuo Zhou
Qiang Liu
Jian-wei Peng
27
85
0
10 Jun 2019
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
22
53
0
03 Nov 2018
RUDDER: Return Decomposition for Delayed Rewards
Jose A. Arjona-Medina
Michael Gillhofer
Michael Widrich
Thomas Unterthiner
Johannes Brandstetter
Sepp Hochreiter
24
212
0
20 Jun 2018
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
29
137
0
21 May 2018
1