Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.05929
Cited By
Data-Efficient Reinforcement Learning with Self-Predictive Representations
12 July 2020
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Data-Efficient Reinforcement Learning with Self-Predictive Representations"
50 / 76 papers shown
Title
seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models
Hafez Ghaemi
Eilif Muller
Shahab Bakhtiari
49
0
0
06 May 2025
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
Samuel Garcin
Trevor A. McInroe
P. S. Castro
Prakash Panangaden
Christopher G. Lucas
David Abel
Stefano V. Albrecht
51
0
0
08 Mar 2025
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean
Evangelos Chataroulas
Jordan Terry
Isaac Woungang
Nariman Farsad
P. S. Castro
LRM
44
0
0
07 Mar 2025
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
35
0
0
09 Nov 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
34
0
0
22 Oct 2024
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient
Wenlong Wang
Ivana Dusparic
Yucheng Shi
Ke Zhang
V. Cahill
Mamba
131
0
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
39
1
0
11 Oct 2024
Next state prediction gives rise to entangled, yet compositional representations of objects
Tankred Saanum
Luca M. Schulze Buschoff
Peter Dayan
Eric Schulz
OCL
CoGe
OOD
30
1
0
07 Oct 2024
DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control
Zichen Jeff Cui
Hengkai Pan
Aadhithya Iyer
Siddhant Haldar
Lerrel Pinto
VGen
26
10
0
18 Sep 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Biwei Huang
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CML
OOD
TTA
38
1
0
30 Jul 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
38
7
0
24 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
41
1
0
15 Jun 2024
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
65
75
0
27 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
37
16
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
28
8
0
24 May 2024
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
37
1
0
20 May 2024
Feasibility Consistent Representation Learning for Safe Reinforcement Learning
Zhepeng Cen
Yi-Fan Yao
Zuxin Liu
Ding Zhao
OffRL
34
3
0
20 May 2024
The Curse of Diversity in Ensemble-Based Exploration
Zhixuan Lin
P. DÓro
Evgenii Nikishin
Aaron C. Courville
40
1
0
07 May 2024
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Yingru Li
Jiawei Xu
Lei Han
Zhi-Quan Luo
BDL
OffRL
18
6
0
05 Feb 2024
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
17
20
0
17 Jan 2024
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Juho Kannala
J. Pajarinen
OffRL
30
12
0
15 Jun 2023
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
Tom Dupuis
Jaonary Rabarisoa
Q. C. Pham
David Filliat
34
0
0
14 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
P. S. Castro
OffRL
43
82
0
30 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
28
24
0
29 May 2023
Towards a Better Understanding of Representation Dynamics under TD-learning
Yunhao Tang
Rémi Munos
OffRL
21
1
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
26
9
0
29 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
42
10
0
25 May 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
16
3
0
17 Mar 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
60
10
0
11 Feb 2023
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
36
6
0
08 Feb 2023
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline
Nicklas Hansen
Zhecheng Yuan
Yanjie Ze
Tongzhou Mu
Aravind Rajeswaran
H. Su
Huazhe Xu
Xiaolong Wang
32
65
0
12 Dec 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
42
45
0
23 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
30
4
0
12 Nov 2022
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
19
0
0
31 Oct 2022
Learning on the Job: Self-Rewarding Offline-to-Online Finetuning for Industrial Insertion of Novel Connectors from Vision
Ashvin Nair
Brian Zhu
Gokul Narayanan
Eugen Solowjow
Sergey Levine
OffRL
OnRL
23
14
0
27 Oct 2022
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
14
0
0
24 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
27
8
0
21 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Z. Tu
OffRL
23
15
0
19 Oct 2022
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
23
21
0
18 Oct 2022
A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning
Guozheng Ma
Zhen Wang
Zhecheng Yuan
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
30
26
0
10 Oct 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
42
11
0
30 Sep 2022
Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning
Manuel Goulão
Arlindo L. Oliveira
ViT
33
6
0
22 Sep 2022
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous Driving
Haochen Liu
Zhiyu Huang
Xiaoyu Mo
Chen Lv
ViT
OffRL
25
33
0
24 Aug 2022
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Zijian Gao
Yiying Li
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
25
0
0
24 Aug 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu
Lingxiao Wang
Chenjia Bai
Zhuoran Yang
Zhaoran Wang
SSL
OffRL
26
32
0
29 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
41
24
0
17 Jul 2022
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels
Edoardo Cetin
Philip J. Ball
Steve Roberts
Oya Celiktutan
30
36
0
03 Jul 2022
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
79
145
0
28 Jun 2022
BYOL-Explore: Exploration by Bootstrapped Prediction
Z. Guo
S. Thakoor
Miruna Pislar
Bernardo Avila-Pires
Florent Altché
...
Yunhao Tang
Michal Valko
Rémi Munos
M. G. Azar
Bilal Piot
22
67
0
16 Jun 2022
1
2
Next