Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.19452
Cited By
Bigger, Better, Faster: Human-level Atari with human-level efficiency
30 May 2023
Max Schwarzer
J. Obando-Ceron
Aaron C. Courville
Marc G. Bellemare
Rishabh Agarwal
P. S. Castro
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bigger, Better, Faster: Human-level Atari with human-level efficiency"
16 / 66 papers shown
Title
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
24
1
0
30 Oct 2023
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
Jake Grigsby
Linxi Fan
Yuke Zhu
OffRL
LM&Ro
22
10
0
15 Oct 2023
STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning
Weipu Zhang
Gang Wang
Jian-jun Sun
Yetian Yuan
Gao Huang
61
31
0
14 Oct 2023
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Guozheng Ma
Lu Li
Sen Zhang
Zixuan Liu
Zhen Wang
Yixin Chen
Li Shen
Xueqian Wang
Dacheng Tao
OffRL
45
14
0
11 Oct 2023
Small batch deep reinforcement learning
J. Obando-Ceron
Marc G. Bellemare
Pablo Samuel Castro
VLM
29
14
0
05 Oct 2023
BarlowRL: Barlow Twins for Data-Efficient Reinforcement Learning
Omer Veysel Cagatan
Barış Akgün
BDL
OffRL
10
3
0
08 Aug 2023
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
22
5
0
20 Jul 2023
Maintaining Plasticity in Deep Continual Learning
Shibhansh Dohare
J. F. Hernandez-Garcia
Parash Rahman
A. R. Mahmood
Richard S. Sutton
KELM
CLL
25
27
0
23 Jun 2023
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm
Wensong Bai
Chao Zhang
Yichao Fu
Lingwei Peng
Hui Qian
Bin Dai
11
1
0
11 Jun 2023
Human-level Atari 200x faster
Steven Kapturowski
Victor Campos
Ray Jiang
Nemanja Rakićević
Hado van Hasselt
Charles Blundell
Adria Puigdomenech Badia
OffRL
39
28
0
15 Sep 2022
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
Augustine N. Mavor-Parker
Matthew J. Sargent
Christian Pehle
Andrea Banino
Lewis D. Griffin
Caswell Barry
11
2
0
14 Sep 2022
Light-weight probing of unsupervised representations for Reinforcement Learning
Wancong Zhang
Anthony GX-Chen
Vlad Sobal
Yann LeCun
Nicolas Carion
SSL
OffRL
33
13
0
25 Aug 2022
The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin
Max Schwarzer
P. DÓro
Pierre-Luc Bacon
Aaron C. Courville
OnRL
85
178
0
16 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
D2RL: Deep Dense Architectures in Reinforcement Learning
Samarth Sinha
Homanga Bharadhwaj
A. Srinivas
Animesh Garg
OffRL
AI4CE
43
56
0
19 Oct 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,944
0
04 May 2020
Previous
1
2