ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.00210
  4. Cited By
Mastering Atari Games with Limited Data

Mastering Atari Games with Limited Data

30 October 2021
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
    VLM
ArXivPDFHTML

Papers citing "Mastering Atari Games with Limited Data"

50 / 159 papers shown
Title
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent
  Reinforcement Learning
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
Haolin Song
Ming Feng
Wen-gang Zhou
Houqiang Li
OffRL
17
5
0
03 Jun 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
13
9
0
01 Jun 2023
What model does MuZero learn?
What model does MuZero learn?
Jinke He
Thomas M. Moerland
F. Oliehoek
20
4
0
01 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
P. S. Castro
OffRL
43
82
0
30 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
19
24
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
26
9
0
29 May 2023
Reinforcement Learning with Partial Parametric Model Knowledge
Reinforcement Learning with Partial Parametric Model Knowledge
Shuyuan Wang
Philip D. Loewen
Nathan P. Lawrence
M. Forbes
R. Bhushan Gopaluni
KELM
11
0
0
26 Apr 2023
A Cookbook of Self-Supervised Learning
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
31
272
0
24 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
6
0
0
14 Apr 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
30
1
0
23 Mar 2023
Transformer Models for Type Inference in the Simply Typed Lambda
  Calculus: A Case Study in Deep Learning for Code
Transformer Models for Type Inference in the Simply Typed Lambda Calculus: A Case Study in Deep Learning for Code
Brando Miranda
Avraham Shinnar
V. Pestun
B. Trager
14
3
0
15 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
8
68
0
13 Mar 2023
Real-time scheduling of renewable power systems through planning-based
  reinforcement learning
Real-time scheduling of renewable power systems through planning-based reinforcement learning
Shao-Wei Liu
Jinbo Liu
Weirui Ye
Nan Yang
Guanglu Zhang
...
C. Kang
Qirong Jiang
Xuri Song
Fangchun Di
Yang Gao
33
4
0
09 Mar 2023
Policy-Induced Self-Supervision Improves Representation Finetuning in
  Visual RL
Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL
Sébastien M. R. Arnold
Fei Sha
SSL
13
0
0
12 Feb 2023
Investigating the role of model-based learning in exploration and
  transfer
Investigating the role of model-based learning in exploration and transfer
Jacob Walker
Eszter Vértes
Yazhe Li
Gabriel Dulac-Arnold
Ankesh Anand
T. Weber
Jessica B. Hamrick
OffRL
26
6
0
08 Feb 2023
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree
  Search
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Gal Dalal
Assaf Hallak
Gugan Thoppe
Shie Mannor
Gal Chechik
17
3
0
30 Jan 2023
Hierarchical Imitation Learning with Vector Quantized Models
Hierarchical Imitation Learning with Vector Quantized Models
Kalle Kujanpää
J. Pajarinen
Alexander Ilin
9
12
0
30 Jan 2023
Mastering Diverse Domains through World Models
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
19
536
0
10 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
Applying Deep Reinforcement Learning to the HP Model for Protein
  Structure Prediction
Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction
Kaiyuan Yang
Houjing Huang
Olafs Vandans
A. Murali
Fujia Tian
R. Yap
Liang Dai
9
10
0
27 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited
  Datasets
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
20
5
0
23 Nov 2022
Actively Learning Costly Reward Functions for Reinforcement Learning
Actively Learning Costly Reward Functions for Reinforcement Learning
André Eberhard
Houssam Metni
G. Fahland
A. Stroh
Pascal Friederich
OffRL
19
0
0
23 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based
  Reinforcement Learning
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
25
4
0
12 Nov 2022
The Benefits of Model-Based Generalization in Reinforcement Learning
The Benefits of Model-Based Generalization in Reinforcement Learning
K. Young
Aditya A. Ramesh
Louis Kirsch
Jürgen Schmidhuber
OffRL
20
12
0
04 Nov 2022
Will we run out of data? Limits of LLM scaling based on human-generated
  data
Will we run out of data? Limits of LLM scaling based on human-generated data
Pablo Villalobos
A. Ho
J. Sevilla
T. Besiroglu
Lennart Heim
Marius Hobbhahn
ALM
28
106
0
26 Oct 2022
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions
Weirui Ye
Pieter Abbeel
Yang Gao
38
5
0
23 Oct 2022
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Palm up: Playing in the Latent Manifold for Unsupervised Pretraining
Hao Liu
Tom Zahavy
Volodymyr Mnih
Satinder Singh
SSL
25
7
0
19 Oct 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement
  Learning
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Yifan Xu
Nicklas Hansen
Zirui Wang
Yung-Chieh Chan
H. Su
Z. Tu
OffRL
18
15
0
19 Oct 2022
Transformers Learn Shortcuts to Automata
Transformers Learn Shortcuts to Automata
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
OffRL
LRM
19
155
0
19 Oct 2022
Planning for Sample Efficient Imitation Learning
Planning for Sample Efficient Imitation Learning
Zhao-Heng Yin
Weirui Ye
Qifeng Chen
Yang Gao
OffRL
18
21
0
18 Oct 2022
Bridging the Gap between Artificial Intelligence and Artificial General
  Intelligence: A Ten Commandment Framework for Human-Like Intelligence
Bridging the Gap between Artificial Intelligence and Artificial General Intelligence: A Ten Commandment Framework for Human-Like Intelligence
Ananta Nair
F. Kashani
26
2
0
17 Oct 2022
Visual Reinforcement Learning with Self-Supervised 3D Representations
Visual Reinforcement Learning with Self-Supervised 3D Representations
Yanjie Ze
Nicklas Hansen
Yinbo Chen
Mohit Jain
Xiaolong Wang
SSL
22
49
0
13 Oct 2022
Continuous Monte Carlo Graph Search
Continuous Monte Carlo Graph Search
Kalle Kujanpää
Amin Babadi
Yi Zhao
Juho Kannala
Alexander Ilin
J. Pajarinen
LRM
57
2
0
04 Oct 2022
Mastering Spatial Graph Prediction of Road Networks
Mastering Spatial Graph Prediction of Road Networks
Sotiris Anagnostidis
Aurélien Lucchi
Thomas Hofmann
GNN
19
1
0
03 Oct 2022
Human-level Atari 200x faster
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
44
28
0
15 Sep 2022
Concept-modulated model-based offline reinforcement learning for rapid
  generalization
Concept-modulated model-based offline reinforcement learning for rapid generalization
Nicholas A. Ketz
Praveen K. Pilly
OffRL
6
1
0
07 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
11
157
0
01 Sep 2022
Light-weight probing of unsupervised representations for Reinforcement
  Learning
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSL
OffRL
33
13
0
25 Aug 2022
A model-based approach to meta-Reinforcement Learning: Transformers and
  tree search
A model-based approach to meta-Reinforcement Learning: Transformers and tree search
Brieuc Pinon
Jean-Charles Delvenne
Raphaël Jungers
OffRL
11
3
0
24 Aug 2022
Efficient Planning in a Compact Latent Action Space
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
11
36
0
22 Aug 2022
Towards Situation Awareness and Attention Guidance in a Multiplayer
  Environment using Augmented Reality and Carcassonne
Towards Situation Awareness and Attention Guidance in a Multiplayer Environment using Augmented Reality and Carcassonne
D. Kadish
Arezoo Sarkheyli-Hägele
J. Font
D. Niehorster
Thomas Pederson
11
2
0
18 Aug 2022
The Curse of Low Task Diversity: On the Failure of Transfer Learning to
  Outperform MAML and Their Empirical Equivalence
The Curse of Low Task Diversity: On the Failure of Transfer Learning to Outperform MAML and Their Empirical Equivalence
Brando Miranda
P. Yu
Yu-xiong Wang
Oluwasanmi Koyejo
21
10
0
02 Aug 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step
  Inverse Models
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
31
24
0
17 Jul 2022
Masked World Models for Visual Control
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
77
145
0
28 Jun 2022
Value-Consistent Representation Learning for Data-Efficient
  Reinforcement Learning
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
27
13
0
25 Jun 2022
Value Memory Graph: A Graph-Structured World Model for Offline
  Reinforcement Learning
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
Deyao Zhu
Erran L. Li
Mohamed Elhoseiny
OffRL
27
8
0
09 Jun 2022
Towards biologically plausible Dreaming and Planning in recurrent
  spiking networks
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
C. Capone
P. Paolucci
CLL
15
7
0
20 May 2022
Adversarial Training for High-Stakes Reliability
Adversarial Training for High-Stakes Reliability
Daniel M. Ziegler
Seraphina Nix
Lawrence Chan
Tim Bauman
Peter Schmidt-Nielsen
...
Noa Nabeshima
Benjamin Weinstein-Raun
D. Haas
Buck Shlegeris
Nate Thomas
AAML
19
59
0
03 May 2022
Reward Reports for Reinforcement Learning
Reward Reports for Reinforcement Learning
T. Gilbert
Nathan Lambert
Sarah Dean
Tom Zick
Aaron J. Snoswell
27
33
0
22 Apr 2022
Temporal Alignment for History Representation in Reinforcement Learning
Temporal Alignment for History Representation in Reinforcement Learning
Aleksandr Ermolov
E. Sangineto
N. Sebe
AI4TS
11
2
0
07 Apr 2022
Previous
1234
Next