Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.01345
Cited By
Decision Transformer: Reinforcement Learning via Sequence Modeling
2 June 2021
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decision Transformer: Reinforcement Learning via Sequence Modeling"
50 / 292 papers shown
Title
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
46
1
0
31 May 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
Seanie Lee
Minsu Kim
Lynn Cherif
David Dobre
Juho Lee
...
Kenji Kawaguchi
Gauthier Gidel
Yoshua Bengio
Nikolay Malkin
Moksh Jain
AAML
55
12
0
28 May 2024
SoK: Leveraging Transformers for Malware Analysis
Pradip Kunwar
Kshitiz Aryal
Maanak Gupta
Mahmoud Abdelsalam
Elisa Bertino
90
0
0
27 May 2024
Reinforcing Language Agents via Policy Optimization with Action Decomposition
Muning Wen
Ziyu Wan
Weinan Zhang
Jun Wang
Ying Wen
38
7
0
23 May 2024
State-Constrained Offline Reinforcement Learning
Charles A. Hepburn
Yue Jin
Giovanni Montana
OffRL
29
0
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
69
41
0
23 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
32
0
0
06 May 2024
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly
Hao-ming Lin
Radu Corcodel
Ding Zhao
40
7
0
26 Apr 2024
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer
Da Chang
Yu Li
64
2
0
19 Apr 2024
Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization
Chanyeon Kim
Jongwoon Park
Hyun-sool Bae
Woo Chang Kim
42
3
0
03 Apr 2024
CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning
Luke Rowe
Roger Girgis
Anthony Gosselin
Bruno Carrez
Florian Golemo
Felix Heide
Liam Paull
Christopher Pal
38
4
0
29 Mar 2024
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion
Kuang-Da Wang
Wei-Yao Wang
Ping-Chun Hsieh
Wenjie Peng
OffRL
29
0
0
19 Mar 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
32
0
0
22 Feb 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang
Wenjie Qiu
Yi-Chen Li
Lei Yuan
Chengxing Jia
Zongzhang Zhang
Yang Yu
OffRL
28
1
0
17 Feb 2024
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
Sangwoo Shin
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Honguk Woo
18
9
0
13 Feb 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
33
6
0
11 Feb 2024
Retrosynthesis Prediction via Search in (Hyper) Graph
Zixun Lan
Binjie Hong
Jiajun Zhu
Zuo Zeng
Zhenfu Liu
Limin Yu
Fei Ma
37
0
0
09 Feb 2024
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Gresa Shala
André Biedenkapp
Josif Grabocka
OffRL
29
4
0
09 Feb 2024
Return-Aligned Decision Transformer
Tsunehiko Tanaka
Kenshi Abe
Kaito Ariu
Tetsuro Morimura
Edgar Simo-Serra
OffRL
59
1
0
06 Feb 2024
Understanding the planning of LLM agents: A survey
Xu Huang
Weiwen Liu
Xiaolong Chen
Xingmei Wang
Hao Wang
Defu Lian
Yasheng Wang
Ruiming Tang
Enhong Chen
LLMAG
LM&Ro
24
130
0
05 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
31
7
0
04 Feb 2024
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
35
2
0
30 Jan 2024
Multi-Object Navigation in real environments using hybrid policies
Assem Sadek
G. Bono
Boris Chidlovskii
A. Baskurt
Christian Wolf
45
5
0
24 Jan 2024
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View
Raj Ghugare
Matthieu Geist
Glen Berseth
Benjamin Eysenbach
OffRL
25
14
0
20 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
26
1
0
17 Jan 2024
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement
Jiaqi Liu
Peng Hang
Xiaocong Zhao
Jianqiang Wang
Jian Sun
31
10
0
08 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
17
9
0
06 Jan 2024
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
73
5
0
13 Dec 2023
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach
Xingshuai Huang
Di Wu
Benoit Boulet
OffRL
19
2
0
12 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
23
1
0
12 Dec 2023
Unified machine learning tasks and datasets for enhancing renewable energy
Arsam Aryandoust
Thomas Rigoni
Francesco di Stefano
Anthony Patt
35
0
0
12 Nov 2023
Uncovering Intermediate Variables in Transformers using Circuit Probing
Michael A. Lepori
Thomas Serre
Ellie Pavlick
70
7
0
07 Nov 2023
A Tractable Inference Perspective of Offline RL
Xuejie Liu
Anji Liu
Guy Van den Broeck
Yitao Liang
OffRL
34
1
0
31 Oct 2023
Hybrid Search for Efficient Planning with Completeness Guarantees
Kalle Kujanpää
J. Pajarinen
Alexander Ilin
17
3
0
19 Oct 2023
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Bodhisattwa Prasad Majumder
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Niket Tandon
Li Zhang
Chris Callison-Burch
Peter Clark
LRM
LLMAG
CLL
13
37
0
16 Oct 2023
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Zichen Zhang
Yunshuang Li
Osbert Bastani
Abhishek Gupta
Dinesh Jayaraman
Yecheng Jason Ma
Luca Weihs
30
17
0
12 Oct 2023
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining
Licong Lin
Yu Bai
Song Mei
OffRL
30
42
0
12 Oct 2023
Predicting Player Engagement in Tom Clancy's The Division 2: A Multimodal Approach via Pixels and Gamepad Actions
Kosmas Pinitas
David Renaudie
Mike Thomsen
M. Barthet
Konstantinos Makantasis
Antonios Liapis
Georgios N. Yannakakis
21
13
0
09 Oct 2023
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Hanjing Wang
Man-Kit Sit
Cong He
Ying Wen
Weinan Zhang
J. Wang
Yaodong Yang
Luo Mai
OffRL
VLM
27
1
0
08 Oct 2023
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge
J. Tan
Mehul Motani
LLMAG
31
8
0
08 Oct 2023
PrototypeFormer: Learning to Explore Prototype Relationships for Few-shot Image Classification
Feihong He
Gang Li
Lingyu Si
VLM
ViT
52
1
0
05 Oct 2023
Language models in molecular discovery
Chaoqi Wang
Yibo Jiang
Chenghao Yang
Han Liu
Yuxin Chen
18
7
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
32
0
0
26 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Guan-Bo Wang
Sijie Cheng
Xianyuan Zhan
Xiangang Li
Sen Song
Yang Liu
ALM
13
227
0
20 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
30
0
0
08 Sep 2023
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
18
8
0
04 Sep 2023
Rule-Based Error Detection and Correction to Operationalize Movement Trajectory Classification
B. Xi
Kevin Scaria
Paulo Shakarian
Paulo Shakarian
32
2
0
28 Aug 2023
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections
Jiaqi Liu
Peng Hang
Xiao Qi
Jianqiang Wang
Jian-jun Sun
18
42
0
30 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
14
3
0
29 Jul 2023
Previous
1
2
3
4
5
6
Next