ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.02193
  4. Cited By
Mastering Atari with Discrete World Models

Mastering Atari with Discrete World Models

5 October 2020
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
    DRL
ArXivPDFHTML

Papers citing "Mastering Atari with Discrete World Models"

50 / 165 papers shown
Title
Bridging State and History Representations: Understanding
  Self-Predictive RL
Bridging State and History Representations: Understanding Self-Predictive RL
Tianwei Ni
Benjamin Eysenbach
Erfan Seyedsalehi
Michel Ma
Clement Gehring
Aditya Mahajan
Pierre-Luc Bacon
AI4TS
AI4CE
17
20
0
17 Jan 2024
Memory, Space, and Planning: Multiscale Predictive Representations
Memory, Space, and Planning: Multiscale Predictive Representations
Ida Momennejad
23
2
0
16 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
20
9
0
06 Jan 2024
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
73
5
0
13 Dec 2023
Backward Learning for Goal-Conditioned Policies
Backward Learning for Goal-Conditioned Policies
Marc Höftmann
Jan Robine
Stefan Harmeling
29
1
0
08 Dec 2023
Digital Twin-Enhanced Deep Reinforcement Learning for Resource
  Management in Networks Slicing
Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing
Zhengming Zhang
Yongming Huang
Cheng Zhang
Qingbi Zheng
Luxi Yang
Xiaohu You
21
12
0
28 Nov 2023
Provable Representation with Efficient Planning for Partial Observable
  Reinforcement Learning
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
Hongming Zhang
Tongzheng Ren
Chenjun Xiao
Dale Schuurmans
Bo Dai
45
3
0
20 Nov 2023
Interpretable Reinforcement Learning for Robotics and Continuous Control
Interpretable Reinforcement Learning for Robotics and Continuous Control
Rohan R. Paleja
Letian Chen
Yaru Niu
Andrew Silva
Zhaoxin Li
...
K. Chang
H. E. Tseng
Yan Wang
S. Nageshrao
Matthew C. Gombolay
26
7
0
16 Nov 2023
State-Wise Safe Reinforcement Learning With Pixel Observations
State-Wise Safe Reinforcement Learning With Pixel Observations
S. Zhan
Yixuan Wang
Qingyuan Wu
Ruochen Jiao
Chao Huang
Qi Zhu
38
10
0
03 Nov 2023
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via
  Discrete Diffusion
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
Lunjun Zhang
Yuwen Xiong
Ze Yang
Sergio Casas
Rui Hu
R. Urtasun
39
50
0
02 Nov 2023
Novelty Detection in Reinforcement Learning with World Models
Novelty Detection in Reinforcement Learning with World Models
Geigh Zollicoffer
Kenneth Eaton
Jonathan C. Balloch
Julia Kim
Mark O. Riedl
Robert Wright
Mark O. Riedl
21
1
0
12 Oct 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes
Rainer Schlosser
R. Herbrich
21
4
0
08 Oct 2023
Task-Oriented Koopman-Based Control with Contrastive Encoder
Task-Oriented Koopman-Based Control with Contrastive Encoder
Xubo Lyu
Hanyang Hu
Seth Siriya
Ye Pu
Mo Chen
26
5
0
28 Sep 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous
  Driving
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Xiaofeng Wang
Zheng Hua Zhu
Guan Huang
Xinze Chen
Jiagang Zhu
Jiwen Lu
VGen
22
148
0
18 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing
  Posterior Predictability
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
38
13
0
31 Aug 2023
Structured World Models from Human Videos
Structured World Models from Human Videos
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
30
85
0
21 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
25
28
0
14 Aug 2023
On-Robot Bayesian Reinforcement Learning for POMDPs
On-Robot Bayesian Reinforcement Learning for POMDPs
Hai V. Nguyen
Sammie Katt
Yuchen Xiao
Chris Amato
OffRL
18
1
0
22 Jul 2023
Simplified Temporal Consistency Reinforcement Learning
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Juho Kannala
J. Pajarinen
OffRL
30
12
0
15 Jun 2023
On the Efficacy of 3D Point Cloud Reinforcement Learning
On the Efficacy of 3D Point Cloud Reinforcement Learning
Z. Ling
Yuan Yao
Xuanlin Li
H. Su
3DPC
26
13
0
11 Jun 2023
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
P. S. Castro
OffRL
43
82
0
30 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
26
24
0
29 May 2023
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control
  via Sample Multiple Reuse
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse
Jiafei Lyu
Le Wan
Zongqing Lu
Xiu Li
OffRL
26
9
0
29 May 2023
A Reminder of its Brittleness: Language Reward Shaping May Hinder
  Learning for Instruction Following Agents
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents
Sukai Huang
N. Lipovetzky
Trevor Cohn
30
2
0
26 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient
  Visual Reinforcement Learning
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
40
10
0
25 May 2023
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning
  via Transition Occupancy Matching
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
24
5
0
22 May 2023
Understanding the World to Solve Social Dilemmas Using Multi-Agent
  Reinforcement Learning
Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning
Manuel Rios
Nicanor Quijano
Luis Felipe Giraldo
26
1
0
19 May 2023
Prompt-Tuning Decision Transformer with Preference Ranking
Prompt-Tuning Decision Transformer with Preference Ranking
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
26
14
0
16 May 2023
Learning Achievement Structure for Structured Exploration in Domains
  with Sparse Reward
Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward
Zihan Zhou
Animesh Garg
OffRL
14
3
0
30 Apr 2023
Approximate Shielding of Atari Agents for Safe Exploration
Approximate Shielding of Atari Agents for Safe Exploration
Alexander W. Goodall
Francesco Belardinelli
19
2
0
21 Apr 2023
Model Predictive Control with Self-supervised Representation Learning
Model Predictive Control with Self-supervised Representation Learning
Jonas A. Matthies
Muhammad Burhan Hafez
Mostafa Kotb
S. Wermter
SSL
6
0
0
14 Apr 2023
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient
  Multi-Agent Reinforcement Learning
MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning
Aravind Venugopal
Stephanie Milani
Fei Fang
Balaraman Ravindran
OffRL
16
0
0
12 Apr 2023
Inductive biases in deep learning models for weather prediction
Inductive biases in deep learning models for weather prediction
Jannik Thümmel
Matthias Karlbauer
S. Otte
C. Zarfl
Georg Martius
...
Thomas Scholten
Ulrich Friedrich
V. Wulfmeyer
B. Goswami
Martin Volker Butz
AI4CE
33
5
0
06 Apr 2023
ENTL: Embodied Navigation Trajectory Learner
ENTL: Embodied Navigation Trajectory Learner
Klemen Kotar
Aaron Walsman
Roozbeh Mottaghi
15
6
0
05 Apr 2023
Tracker: Model-based Reinforcement Learning for Tracking Control of
  Human Finger Attached with Thin McKibben Muscles
Tracker: Model-based Reinforcement Learning for Tracking Control of Human Finger Attached with Thin McKibben Muscles
Daichi Saito
Eri Nagatomo
Jefferson Pardomuan
Hideki Koike
11
0
0
01 Apr 2023
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting
Nicolai Dorka
Tim Welschehold
Wolfram Burgard
16
3
0
17 Mar 2023
Beware of Instantaneous Dependence in Reinforcement Learning
Beware of Instantaneous Dependence in Reinforcement Learning
Zhengmao Zhu
Yu-Ren Liu
Hong Tian
Yang Yu
Kun Zhang
OffRL
31
1
0
09 Mar 2023
DITTO: Offline Imitation Learning with World Models
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
21
18
0
06 Feb 2023
Variational Latent Branching Model for Off-Policy Evaluation
Variational Latent Branching Model for Off-Policy Evaluation
Qitong Gao
Ge Gao
Min Chi
Miroslav Pajic
OffRL
26
6
0
28 Jan 2023
Neural Episodic Control with State Abstraction
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
L. Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
18
14
0
27 Jan 2023
Neuro-Symbolic World Models for Adapting to Open World Novelty
Neuro-Symbolic World Models for Adapting to Open World Novelty
Jonathan C. Balloch
Zhiyu Lin
Robert Wright
Xiangyu Peng
Mustafa Hussain
Aarun Srinivas
Julia Kim
Mark O. Riedl
11
10
0
16 Jan 2023
Predictive World Models from Real-World Partial Observations
Predictive World Models from Real-World Partial Observations
Robin Karlsson
Alexander Carballo
Keisuke Fujii
Kento Ohtani
K. Takeda
17
5
0
12 Jan 2023
MoDem: Accelerating Visual Model-Based Reinforcement Learning with
  Demonstrations
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
24
49
0
12 Dec 2022
A Rubric for Human-like Agents and NeuroAI
A Rubric for Human-like Agents and NeuroAI
Ida Momennejad
52
14
0
08 Dec 2022
Choreographer: Learning and Adapting Skills in Imagination
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
29
21
0
23 Nov 2022
Representation Learning for Continuous Action Spaces is Beneficial for
  Efficient Policy Learning
Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning
Tingting Zhao
Ying Wang
Weidong Sun
Yarui Chen
Gang Niu
Masashi Sugiyama
16
1
0
23 Nov 2022
Disentangled (Un)Controllable Features
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
19
0
0
31 Oct 2022
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via
  Differentiable Physics-Based Simulation and Rendering
SAM-RL: Sensing-Aware Model-Based Reinforcement Learning via Differentiable Physics-Based Simulation and Rendering
Jun Lv
Yunhai Feng
Cheng Zhang
Shu Zhao
Lin Shao
Cewu Lu
16
24
0
27 Oct 2022
On Many-Actions Policy Gradient
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
14
0
0
24 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
53
8
0
23 Oct 2022
Previous
1234
Next