ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.00210
  4. Cited By
Mastering Atari Games with Limited Data

Mastering Atari Games with Limited Data

30 October 2021
Weirui Ye
Shao-Wei Liu
Thanard Kurutach
Pieter Abbeel
Yang Gao
    VLM
ArXivPDFHTML

Papers citing "Mastering Atari Games with Limited Data"

50 / 159 papers shown
Title
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
57
0
0
04 May 2025
Rulebook: bringing co-routines to reinforcement learning environments
Rulebook: bringing co-routines to reinforcement learning environments
Massimo Fioravanti
Samuele Pasini
Giovanni Agosta
33
0
0
28 Apr 2025
Trust-Region Twisted Policy Improvement
Trust-Region Twisted Policy Improvement
Joery A. de Vries
Jinke He
Yaniv Oren
M. Spaan
OffRL
LRM
30
0
0
08 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
Xuguang Lan
32
0
0
05 Apr 2025
Bootstrapped Model Predictive Control
Bootstrapped Model Predictive Control
Yuhang Wang
Hanwei Guo
Sizhe Wang
Long Qian
Xuguang Lan
54
0
0
24 Mar 2025
Learning Transformer-based World Models with Contrastive Predictive Coding
Maxime Burchi
Radu Timofte
67
0
0
06 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
32
0
0
04 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
43
1
0
28 Feb 2025
Implicit Search via Discrete Diffusion: A Study on Chess
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye
Zhenyu Wu
Jiahui Gao
Zhiyong Wu
Xin Jiang
Z. Li
Lingpeng Kong
DiffM
48
2
0
27 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
72
1
0
26 Feb 2025
OptionZero: Planning with Learned Options
OptionZero: Planning with Learned Options
Po-Wei Huang
Pei-Chiun Peng
Hung Guei
Ti-Rong Wu
51
0
0
23 Feb 2025
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Spatial-aware decision-making with ring attractors in reinforcement learning systems
Marcos Negre Saura
Richard Allmendinger
Theodore Papamarkou
Wei Pan
104
0
0
17 Feb 2025
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
Weirui Ye
Fangchen Liu
Z. Ding
Yang Gao
Oleh Rybkin
Pieter Abbeel
VGen
OffRL
84
3
0
14 Feb 2025
DMWM: Dual-Mind World Model with Long-Term Imagination
DMWM: Dual-Mind World Model with Long-Term Imagination
Lingyi Wang
Rashed Shelim
Walid Saad
Naren Ramakrishnan
LRM
118
1
0
11 Feb 2025
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Ruihan Jin
Feihu Che
Zengqi Wen
J. Tao
LRM
68
8
0
04 Feb 2025
Towards General-Purpose Model-Free Reinforcement Learning
Scott Fujimoto
P. DÓro
Amy Zhang
Yuandong Tian
Michael Rabbat
OffRL
34
3
0
28 Jan 2025
Evolution and The Knightian Blindspot of Machine Learning
Evolution and The Knightian Blindspot of Machine Learning
Joel Lehman
Elliot Meyerson
Tarek El-Gaaly
Kenneth O. Stanley
Tarin Ziyaee
84
1
0
22 Jan 2025
GLAM: Global-Local Variation Awareness in Mamba-based World Model
GLAM: Global-Local Variation Awareness in Mamba-based World Model
Qian He
Wenqi Liang
Chunhui Hao
Gan Sun
Jiandong Tian
43
0
0
21 Jan 2025
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning
Siddharth Aravindan
Dixant Mittal
Wee Sun Lee
BDL
74
0
0
17 Jan 2025
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
ReZero: Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
Chunyu Xuan
Yazhe Niu
Yuan Pu
Shuai Hu
Yu Liu
Jing Yang
53
0
0
03 Jan 2025
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree
  Search and Progress Reward Modeling
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling
Junyi Li
Hwee Tou Ng
LRM
84
1
0
19 Dec 2024
Policy-shaped prediction: avoiding distractions in model-based
  reinforcement learning
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
Miles Hutson
Isaac Kauvar
Nick Haber
59
0
0
08 Dec 2024
Decision Transformer vs. Decision Mamba: Analysing the Complexity of
  Sequential Decision Making in Atari Games
Decision Transformer vs. Decision Mamba: Analysing the Complexity of Sequential Decision Making in Atari Games
Ke Yan
65
0
0
01 Dec 2024
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context
  Learning via MCTS
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS
Jinyang Wu
Mingkuan Feng
Shuai Zhang
Feihu Che
Zengqi Wen
J. Tao
ReLM
LRM
102
9
0
27 Nov 2024
Interpreting the Learned Model in MuZero Planning
Interpreting the Learned Model in MuZero Planning
Hung Guei
Yan-Ru Ju
Wei-Yu Chen
Ti-Rong Wu
23
1
0
07 Nov 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
103
2
0
23 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
34
0
0
22 Oct 2024
Reward-free World Models for Online Imitation Learning
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
63
1
0
17 Oct 2024
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based
  Reinforcement Learning
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
Jiayu Chen
Wentse Chen
Jeff Schneider
OffRL
29
1
0
15 Oct 2024
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary
  Space with Tree Search
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
Chenglin Li
Qianglong Chen
Zhi Li
Feng Tao
Yicheng Li
Hao Chen
Fei Yu
Yin Zhang
SyDa
31
0
0
14 Oct 2024
Development and Validation of Heparin Dosing Policies Using an Offline
  Reinforcement Learning Algorithm
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm
Yooseok Lim
Inbeom Park
Sujee Lee
OffRL
18
0
0
24 Sep 2024
No Saved Kaleidosope: an 100% Jitted Neural Network Coding Language with
  Pythonic Syntax
No Saved Kaleidosope: an 100% Jitted Neural Network Coding Language with Pythonic Syntax
Augusto Seben da Rosa
Marlon Daniel Angeli
Jorge Aikes Junior
Alef Iury Ferreira
L. Gris
Anderson da Silva Soares
Arnaldo Candido Junior
Frederico Santos de Oliveira
Gabriel Trevisan Damke
Rafael Teixeira Sousa
23
0
0
17 Sep 2024
Enhancing Reinforcement Learning Through Guided Search
Enhancing Reinforcement Learning Through Guided Search
Jérôme Arjonilla
Abdallah Saffidine
Tristan Cazenave
OffRL
56
0
0
19 Aug 2024
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
Yupei Yang
Biwei Huang
Fan Feng
Xinyue Wang
Shikui Tu
Lei Xu
CML
OOD
TTA
36
1
0
30 Jul 2024
Generalizing soft actor-critic algorithms to discrete action spaces
Generalizing soft actor-critic algorithms to discrete action spaces
Le Zhang
Yong Gu
Xin Zhao
Yanshuo Zhang
Shu Zhao
Yifei Jin
Xinxin Wu
26
0
0
08 Jul 2024
Combining AI Control Systems and Human Decision Support via Robustness
  and Criticality
Combining AI Control Systems and Human Decision Support via Robustness and Criticality
Walt Woods
Alexander Grushin
Simon Khan
Alvaro Velasquez
22
1
0
03 Jul 2024
Efficient World Models with Context-Aware Tokenization
Efficient World Models with Context-Aware Tokenization
Vincent Micheli
Eloi Alonso
François Fleuret
OffRL
VLM
32
5
0
27 Jun 2024
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement
  Learning
SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning
Matthias Weissenbacher
Rishabh Agarwal
Yoshinobu Kawahara
OffRL
27
1
0
21 Jun 2024
CoDreamer: Communication-Based Decentralised World Models
CoDreamer: Communication-Based Decentralised World Models
Edan Toledo
Amanda Prorok
30
0
0
19 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
41
1
0
15 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Arno Solin
J. Pajarinen
SSL
43
5
0
04 Jun 2024
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in
  Offline Reinforcement Learning
Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning
Jiahang Cao
Qiang Zhang
Ziqing Wang
Jiaxu Wang
Hao Cheng
Yecheng Shao
Wen Zhao
Gang Han
Yijie Guo
Renjing Xu
Mamba
43
2
0
04 Jun 2024
Learning to Play Atari in a World of Tokens
Learning to Play Atari in a World of Tokens
Pranav Agarwal
Sheldon Andrews
Samira Ebrahimi Kahou
OffRL
23
0
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
31
0
0
03 Jun 2024
Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned
  Action Abstraction
Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction
Yunhyeok Kwak
Inwoo Hwang
Dooyoung Kim
Sanghack Lee
Byoung-Tak Zhang
28
0
0
02 Jun 2024
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Hierarchical World Models as Visual Whole-Body Humanoid Controllers
Nicklas Hansen
V. JyothirS
Vlad Sobal
Yann LeCun
Xiaolong Wang
Hao Su
VGen
48
10
0
28 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Jiafei Lyu
Chenjia Bai
Jingwen Yang
Zongqing Lu
Xiu Li
26
8
0
24 May 2024
MuDreamer: Learning Predictive World Models without Reconstruction
MuDreamer: Learning Predictive World Models without Reconstruction
Maxime Burchi
Radu Timofte
27
3
0
23 May 2024
Diffusion for World Modeling: Visual Details Matter in Atari
Diffusion for World Modeling: Visual Details Matter in Atari
Eloi Alonso
Adam Jelley
Vincent Micheli
Anssi Kanervisto
Amos Storkey
Tim Pearce
Franccois Fleuret
39
40
0
20 May 2024
1234
Next