ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 465 papers shown
Title
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
Transformer-Based Fault-Tolerant Control for Fixed-Wing UAVs Using Knowledge Distillation and In-Context Adaptation
Francisco Giral
Ignacio Gómez
Ricardo Vinuesa
S. L. Clainche
32
2
0
05 Nov 2024
Task-Aware Harmony Multi-Task Decision Transformer for Offline
  Reinforcement Learning
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Ziqing Fan
Shengchao Hu
Yuhang Zhou
Li Shen
Ya-Qin Zhang
Yanfeng Wang
Dacheng Tao
OffRL
34
0
0
02 Nov 2024
Teaching Embodied Reinforcement Learning Agents: Informativeness and
  Diversity of Language Use
Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use
Jiajun Xi
Yinong He
Jianing Yang
Yinpei Dai
Joyce Chai
LM&Ro
24
2
0
31 Oct 2024
Transformer-based Model Predictive Control: Trajectory Optimization via
  Sequence Modeling
Transformer-based Model Predictive Control: Trajectory Optimization via Sequence Modeling
Davide Celestini
Daniele Gammelli
T. Guffanti
Simone DÁmico
Elisa Capello
Marco Pavone
49
8
0
31 Oct 2024
Offline Reinforcement Learning and Sequence Modeling for Downlink Link
  Adaptation
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
Samuele Peri
Alessio Russo
Gabor Fodor
Pablo Soldati
OffRL
20
0
0
30 Oct 2024
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons
Weiqin Chen
Santiago Paternain
OffRL
37
0
0
25 Oct 2024
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts
R. Teo
Tan M. Nguyen
MoE
31
3
0
18 Oct 2024
State Estimation Transformers for Agile Legged Locomotion
State Estimation Transformers for Agile Legged Locomotion
Chen Yu
Yichu Yang
Tianlin Liu
Yangwei You
Mingliang Zhou
Diyun Xiang
26
1
0
17 Oct 2024
Off-dynamics Conditional Diffusion Planners
Off-dynamics Conditional Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Tianwei Zhang
DiffM
OffRL
35
0
0
16 Oct 2024
Generalizable Spacecraft Trajectory Generation via Multimodal Learning
  with Transformers
Generalizable Spacecraft Trajectory Generation via Multimodal Learning with Transformers
Davide Celestini
Amirhossein Afsharrad
Daniele Gammelli
T. Guffanti
G. Zardini
Sanjay Lall
Elisa Capello
Simone DÁmico
Marco Pavone
44
1
0
15 Oct 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World
  Model Disentanglement
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Lyna Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
33
6
0
15 Oct 2024
DODT: Enhanced Online Decision Transformer Learning through Dreamer's
  Actor-Critic Trajectory Forecasting
DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Eric H. Jiang
Zhi Zhang
Dinghuai Zhang
Andrew Lizarraga
Chenheng Xu
...
Zhengjie Xu
Peiyu Yu
Yuer Tang
Deqian Kong
Ying Nian Wu
OffRL
27
0
0
15 Oct 2024
Offline Inverse Constrained Reinforcement Learning for Safe-Critical
  Decision Making in Healthcare
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare
Nan Fang
Guiliang Liu
Wei Gong
OffRL
37
0
0
10 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Retrieval-Augmented Decision Transformer: External Memory for In-context
  RL
Retrieval-Augmented Decision Transformer: External Memory for In-context RL
Thomas Schmied
Fabian Paischer
Vihang Patil
M. Hofmarcher
Razvan Pascanu
Sepp Hochreiter
OffRL
34
6
0
09 Oct 2024
Diffusion Model Predictive Control
Diffusion Model Predictive Control
Guangyao Zhou
Sivaramakrishnan Swaminathan
Rajkumar Vasudeva Raju
J. S. Guntupalli
Wolfgang Lehrach
Joseph Ortiz
Antoine Dedieu
Miguel Lázaro-Gredilla
Kevin P. Murphy
29
6
0
07 Oct 2024
Autoregressive Action Sequence Learning for Robotic Manipulation
Autoregressive Action Sequence Learning for Robotic Manipulation
Xinyu Zhang
Yuhan Liu
Haonan Chang
Liam Schramm
Abdeslam Boularias
33
8
0
04 Oct 2024
Predictive Coding for Decision Transformer
Predictive Coding for Decision Transformer
T. Luu
Donghoon Lee
Chang D. Yoo
OffRL
56
1
0
04 Oct 2024
RAIL: Reachability-Aided Imitation Learning for Safe Policy Execution
RAIL: Reachability-Aided Imitation Learning for Safe Policy Execution
Wonsuhk Jung
Dennis Anthony
Utkarsh Aashu Mishra
Nadun Ranawaka Arachchige
Matthew Bronars
Danfei Xu
Shreyas Kousik
31
0
0
28 Sep 2024
Using Deep Autoregressive Models as Causal Inference Engines
Using Deep Autoregressive Models as Causal Inference Engines
Daniel Jiwoong Im
Kevin Zhang
Nakul Verma
Kyunghyun Cho
CML
19
1
0
27 Sep 2024
AnyCar to Anywhere: Learning Universal Dynamics Model for Agile and
  Adaptive Mobility
AnyCar to Anywhere: Learning Universal Dynamics Model for Agile and Adaptive Mobility
Wenli Xiao
Haoru Xue
Tony Tao
Dvij Kalaria
John M. Dolan
Guanya Shi
29
5
0
24 Sep 2024
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling
Jesse van Remmerden
Z. Bukhsh
Yingqian Zhang
OffRL
OnRL
39
1
0
16 Sep 2024
Planning Transformer: Long-Horizon Offline Reinforcement Learning with
  Planning Tokens
Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens
Joseph Clinton
Robert Lieck
OffRL
38
4
0
14 Sep 2024
Q-value Regularized Decision ConvFormer for Offline Reinforcement
  Learning
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
Teng Yan
Zhendong Ruan
Yaobang Cai
Yu Han
Wenxian Li
Yang Zhang
OffRL
25
0
0
12 Sep 2024
Online Decision MetaMorphFormer: A Casual Transformer-Based
  Reinforcement Learning Framework of Universal Embodied Intelligence
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence
Luo Ji
Runji Lin
OffRL
AI4CE
LM&Ro
26
0
0
11 Sep 2024
DeMoBot: Deformable Mobile Manipulation with Vision-based Sub-goal
  Retrieval
DeMoBot: Deformable Mobile Manipulation with Vision-based Sub-goal Retrieval
Yuying Zhang
Wenyan Yang
J. Pajarinen
32
1
0
28 Aug 2024
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
SAMBO-RL: Shifts-aware Model-based Offline Reinforcement Learning
Wang Luo
Haoran Li
Zicheng Zhang
Congying Han
Jiayu Lv
Tiande Guo
OffRL
40
1
0
23 Aug 2024
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline
  Reinforcement Learning
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning
Yilun Kong
Hangyu Mao
Qi Zhao
Bin Zhang
Jingqing Ruan
Li Shen
Yongzhe Chang
Xueqian Wang
Rui Zhao
Dacheng Tao
OffRL
34
1
0
20 Aug 2024
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey
Ruiqi Zhang
Jing Hou
Florian Walter
Shangding Gu
Jiayi Guan
Florian Röhrbein
Yali Du
Panpan Cai
G. Chen
Alois Knoll
39
12
0
19 Aug 2024
Learning Based Toolpath Planner on Diverse Graphs for 3D Printing
Learning Based Toolpath Planner on Diverse Graphs for 3D Printing
Yuming Huang
Yuhu Guo
Renbo Su
Xingjian Han
Junhao Ding
...
Weiming Wang
Guoxin Fang
Xu Song
Emily Whiting
Charlie C. L. Wang
17
4
0
17 Aug 2024
Building Decision Making Models Through Language Model Regime
Building Decision Making Models Through Language Model Regime
Yu Zhang
Haoxiang Liu
Feijun Jiang
Weihua Luo
Kaifu Zhang
41
0
0
12 Aug 2024
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative
  Imitation Learning
Navigating the Human Maze: Real-Time Robot Pathfinding with Generative Imitation Learning
Martin Moder
Stephen Adhisaputra
Josef Pauli
18
0
0
07 Aug 2024
Adaptive Planning with Generative Models under Uncertainty
Adaptive Planning with Generative Models under Uncertainty
Pascal Jutras-Dubé
Ruqi Zhang
Aniket Bera
26
2
0
02 Aug 2024
Pre-trained Language Models Improve the Few-shot Prompt Ability of
  Decision Transformer
Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Yu Yang
Pan Xu
VLM
OffRL
29
1
0
02 Aug 2024
Actra: Optimized Transformer Architecture for Vision-Language-Action
  Models in Robot Learning
Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
36
5
0
02 Aug 2024
Empowering Clinicians with Medical Decision Transformers: A Framework
  for Sepsis Treatment
Empowering Clinicians with Medical Decision Transformers: A Framework for Sepsis Treatment
A. Rahman
Pranav Agarwal
R. Noumeir
P. Jouvet
Vincent Michalski
Samira Ebrahimi Kahou
OffRL
24
0
0
28 Jul 2024
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive
  Q-Learning
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive Q-Learning
Mostafa Kotb
C. Weber
Muhammad Burhan Hafez
Stefan Wermter
29
0
0
26 Jul 2024
Reinforcement Learning for Sustainable Energy: A Survey
Reinforcement Learning for Sustainable Energy: A Survey
Koen Ponse
Felix Kleuker
Márton Fejér
Álvaro Serra-Gómez
Aske Plaat
Thomas M. Moerland
OffRL
AI4CE
40
1
0
26 Jul 2024
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Diffusion Models as Optimizers for Efficient Planning in Offline RL
Renming Huang
Yunqiang Pei
Guoqing Wang
Yangming Zhang
Yang Yang
Peng Wang
H. Shen
OffRL
31
0
0
23 Jul 2024
CarFormer: Self-Driving with Learned Object-Centric Representations
CarFormer: Self-Driving with Learned Object-Centric Representations
Shadi S. Hamdan
Fatma Guney
3DPC
OCL
35
2
0
22 Jul 2024
MuTT: A Multimodal Trajectory Transformer for Robot Skills
MuTT: A Multimodal Trajectory Transformer for Robot Skills
Claudius Kienle
Benjamin Alt
Onur Celik
P. Becker
Darko Katic
Rainer Jäkel
Gerhard Neumann
38
2
0
22 Jul 2024
Offline Imitation Learning Through Graph Search and Retrieval
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
34
3
0
22 Jul 2024
Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous
  Vehicle Decision-Making in Dynamic Environment
Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous Vehicle Decision-Making in Dynamic Environment
Jayabrata Chowdhury
Venkataramanan Shivaraman
Sumit Dangi
Suresh Sundaram
P. B. Sujit
34
3
0
12 Jul 2024
Geospatial Trajectory Generation via Efficient Abduction: Deployment for Independent Testing
Geospatial Trajectory Generation via Efficient Abduction: Deployment for Independent Testing
Divyagna Bavikadi
Dyuman Aditya
Devendra Parkar
Paulo Shakarian
Graham Mueller
Chad Parvis
Gerardo I. Simari
43
2
0
08 Jul 2024
Decentralized Transformers with Centralized Aggregation are
  Sample-Efficient Multi-Agent World Models
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models
Yang Zhang
Chenjia Bai
Bin Zhao
Junchi Yan
Xiu Li
Xuelong Li
OffRL
19
0
0
22 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
21
13
0
19 Jun 2024
Elliptical Attention
Elliptical Attention
Stefan K. Nielsen
Laziz U. Abdullaev
R. Teo
Tan M. Nguyen
23
3
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal
  Component Analysis
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
43
4
0
19 Jun 2024
Efficient Offline Reinforcement Learning: The Critic is Critical
Efficient Offline Reinforcement Learning: The Critic is Critical
Adam Jelley
Trevor A. McInroe
Sam Devlin
Amos Storkey
OffRL
37
1
0
19 Jun 2024
ARDuP: Active Region Video Diffusion for Universal Policies
ARDuP: Active Region Video Diffusion for Universal Policies
Shuaiyi Huang
Mara Levy
Zhenyu Jiang
Anima Anandkumar
Yuke Zhu
Linxi Fan
De-An Huang
Abhinav Shrivastava
VGen
42
2
0
19 Jun 2024
Previous
12345...8910
Next