Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.06764
Cited By
Stabilizing Transformers for Reinforcement Learning
13 October 2019
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
Siddhant M. Jayakumar
Max Jaderberg
Raphael Lopez Kaufman
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Stabilizing Transformers for Reinforcement Learning"
48 / 48 papers shown
Title
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
Francisco Giral
Ignacio Gómez
Ricardo Vinuesa
S. L. Clainche
32
2
0
05 Nov 2024
BlendRL: A Framework for Merging Symbolic and Neural Policy Learning
Hikaru Shindo
Quentin Delfosse
D. Dhami
Kristian Kersting
33
3
0
15 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
134
2
0
12 Oct 2024
Transformers Handle Endogeneity in In-Context Linear Regression
Haodong Liang
Krishnakumar Balasubramanian
Lifeng Lai
36
1
0
02 Oct 2024
Attacking Slicing Network via Side-channel Reinforcement Learning Attack
Wei Shao
Chandra Thapa
Rayne Holland
Sarah Ali Siddiqui
S. Çamtepe
AAML
18
0
0
17 Sep 2024
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
38
1
0
26 Jul 2024
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
Petr Anokhin
Nikita Semenov
Artyom Sorokin
Dmitry Evseev
Mikhail Burtsev
Evgeny Burnaev
Evgeny Burnaev
LLMAG
RALM
KELM
47
7
0
05 Jul 2024
Advantage Alignment Algorithms
Juan Agustin Duque
Milad Aghajohari
Tim Cooijmans
Tianyu Zhang
Aaron C. Courville
Gauthier Gidel
Aaron Courville
23
0
0
20 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
41
1
0
15 Jun 2024
Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology
Gaith Rjoub
Saidul Islam
Jamal Bentahar
M. Almaiah
Rana Alrawashdeh
OffRL
21
1
0
05 Apr 2024
Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization
Chanyeon Kim
Jongwoon Park
Hyun-sool Bae
Woo Chang Kim
42
3
0
03 Apr 2024
Transformer-VQ: Linear-Time Transformers via Vector Quantization
Albert Mohwald
24
15
0
28 Sep 2023
Preference Transformer: Modeling Human Preferences using Transformers for RL
Changyeon Kim
Jongjin Park
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
OffRL
30
61
0
02 Mar 2023
A Survey of Meta-Reinforcement Learning
Jacob Beck
Risto Vuorio
E. Liu
Zheng Xiong
L. Zintgraf
Chelsea Finn
Shimon Whiteson
OOD
OffRL
30
121
0
19 Jan 2023
UniMASK: Unified Inference in Sequential Decision Problems
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
19
21
0
20 Nov 2022
On learning history based policies for controlling Markov decision processes
Gandharv Patil
Aditya Mahajan
Doina Precup
OffRL
21
5
0
06 Nov 2022
Bridging the Gap Between Target Networks and Functional Regularization
Alexandre Piché
Valentin Thomas
Joseph Marino
Rafael Pardiñas
Gian Maria Marconi
C. Pal
Mohammad Emtiyaz Khan
9
1
0
21 Oct 2022
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
Stone Tao
Xiaochen Li
Tongzhou Mu
Zhiao Huang
Yuzhe Qin
Hao Su
17
3
0
14 Oct 2022
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Kevin Esslinger
Robert W. Platt
Chris Amato
OffRL
22
32
0
02 Jun 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
36
149
0
27 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
24
6
0
11 Apr 2022
Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Anand Gopalakrishnan
Kazuki Irie
Jürgen Schmidhuber
Sjoerd van Steenkiste
OffRL
19
16
0
25 Mar 2022
Consistent Dropout for Policy Gradient Reinforcement Learning
Matthew J. Hausknecht
Nolan Wagener
OffRL
16
10
0
23 Feb 2022
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
27
90
0
19 Feb 2022
Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers
Amir Ardalan Kalantari
Mohammad Amini
Sarath Chandar
Doina Precup
44
4
0
01 Feb 2022
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Jack Parker-Holder
Raghunandan Rajan
Xingyou Song
André Biedenkapp
Yingjie Miao
...
Vu-Linh Nguyen
Roberto Calandra
Aleksandra Faust
Frank Hutter
Marius Lindauer
AI4CE
28
100
0
11 Jan 2022
How to Learn and Represent Abstractions: An Investigation using Symbolic Alchemy
Badr AlKhamissi
Akshay Srinivasan
Zeb-Kurth Nelson
Samuel Ritter
20
1
0
14 Dec 2021
Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
Christian Gumbsch
Martin Volker Butz
Georg Martius
AI4CE
13
21
0
29 Oct 2021
Reinforcement Learning with Information-Theoretic Actuation
Elliot Catt
Marcus Hutter
J. Veness
29
0
0
30 Sep 2021
Representation learning for neural population activity with Neural Data Transformers
Joel Ye
C. Pandarinath
AI4TS
AI4CE
11
51
0
02 Aug 2021
Pre-trained Language Models as Prior Knowledge for Playing Text-based Games
Ishika Singh
Gargi Singh
Ashutosh Modi
OffRL
AI4CE
11
28
0
18 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
25
36
0
12 Jul 2021
Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
23
58
0
11 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
31
640
0
03 Jun 2021
Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Andrea Banino
Felix Hill
11
47
0
28 May 2021
Episodic Transformer for Vision-and-Language Navigation
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
13
193
0
13 May 2021
Learning Robust Latent Representations for Controllable Speech Synthesis
Shakti Kumar
Jithin Pradeep
Hussain Zaidi
DRL
20
6
0
10 May 2021
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
Emilio Parisotto
Ruslan Salakhutdinov
29
43
0
04 Apr 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
19
104
0
10 Mar 2021
Pretrained Transformers as Universal Computation Engines
Kevin Lu
Aditya Grover
Pieter Abbeel
Igor Mordatch
21
217
0
09 Mar 2021
Generating Images with Sparse Representations
C. Nash
Jacob Menick
Sander Dieleman
Peter W. Battaglia
13
199
0
05 Mar 2021
Single-Shot Motion Completion with Transformer
Yinglin Duan
Tianyang Shi
Zhengxia Zou
Yenan Lin
Zhehui Qian
Bohan Zhang
U. Michigan
ViT
16
75
0
01 Mar 2021
Portfolio Optimization with 2D Relative-Attentional Gated Transformer
Tae Wan Kim
Matloob Khushi
AI4TS
11
12
0
27 Dec 2020
Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning
Jonáš Kulhánek
Erik Derner
Robert Babuška
15
38
0
21 Oct 2020
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua Zhu
Yingce Xia
Lijun Wu
Jiajun Deng
Wen-gang Zhou
Tao Qin
Houqiang Li
SSL
OffRL
21
54
0
15 Oct 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
14
198
0
10 Jun 2020
Text Summarization with Pretrained Encoders
Yang Liu
Mirella Lapata
MILM
254
1,430
0
22 Aug 2019
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
264
5,326
0
05 Nov 2016
1