On Transforming Reinforcement Learning by Transformer: The Development Trajectory

29 December 2022

Shengchao Hu

Li Shen

Papers citing "On Transforming Reinforcement Learning by Transformer: The Development Trajectory"

20 / 20 papers shown

Title
Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge Amogh Joshi Sourav Sanyal Kaushik Roy 61 2 0 31 Jan 2025
Transformer in Transformer as Backbone for Deep Reinforcement Learning Hangyu Mao Rui Zhao Hao Chen Jianye Hao Yiqun Chen Dong Li Junge Zhang Zhen Xiao OffRL 18 8 0 30 Dec 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation Mohit Shridhar Lucas Manuelli D. Fox LM&Ro 141 449 0 12 Sep 2022
Instruction-driven history-aware policies for robotic manipulations Pierre-Louis Guhur Shizhe Chen Ricardo Garcia Pinel Makarand Tapaswi Ivan Laptev Cordelia Schmid LM&Ro 89 101 0 11 Sep 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL Taku Yamagata Ahmed Khalil Raúl Santos-Rodríguez OffRL 142 70 0 08 Sep 2022
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments Keiran Paster Sheila A. McIlraith Jimmy Ba OffRL 121 27 0 31 May 2022
Training language models to follow instructions with human feedback Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 301 11,730 0 04 Mar 2022
Can Wikipedia Help Offline Reinforcement Learning? Machel Reid Yutaro Yamada S. Gu 3DV RALM OffRL 124 95 0 28 Jan 2022
Multitask Prompted Training Enables Zero-Shot Task Generalization Victor Sanh Albert Webson Colin Raffel Stephen H. Bach Lintang Sutawika ... T. Bers Stella Biderman Leo Gao Thomas Wolf Alexander M. Rush LRM 203 1,651 0 15 Oct 2021
Offline Reinforcement Learning with Implicit Q-Learning Ilya Kostrikov Ashvin Nair Sergey Levine OffRL 203 627 0 12 Oct 2021
Augmenting Sequential Recommendation with Pseudo-Prior Items via Reversely Pre-training Transformer Zhiwei Liu Ziwei Fan Yu Wang Philip S. Yu 86 143 0 02 May 2021
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 253 4,735 0 24 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization Tianhe Yu Aviral Kumar Rafael Rafailov Aravind Rajeswaran Sergey Levine Chelsea Finn OffRL 194 412 0 16 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning Rongjun Qin Songyi Gao Xingyuan Zhang Zhen Xu Shengkai Huang Zewen Li Weinan Zhang Yang Yu OffRL 132 76 0 01 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems Sergey Levine Aviral Kumar George Tucker Justin Fu OffRL GP 321 1,662 0 04 May 2020
Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding Seonguk Park Gyubok Lee Manoj Bhat Jimin Seo Minseok Kang Jonathan M Francis Ashwin R. Jadhav Paul Pu Liang Louis-Philippe Morency 113 117 0 06 Mar 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey B. R. Kiran Ibrahim Sobh V. Talpaert Patrick Mannion A. A. Sallab S. Yogamani P. Pérez 137 1,599 0 02 Feb 2020
Help, Anna! Visual Navigation with Natural Multimodal Assistance via Retrospective Curiosity-Encouraging Imitation Learning Khanh Nguyen Hal Daumé LM&Ro EgoV 167 148 0 04 Sep 2019
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn Pieter Abbeel Sergey Levine OOD 234 11,568 0 09 Mar 2017
A Decomposable Attention Model for Natural Language Inference Ankur P. Parikh Oscar Täckström Dipanjan Das Jakob Uszkoreit 187 1,358 0 06 Jun 2016