ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 465 papers shown
Title
Learning Computational Efficient Bots with Costly Features
Learning Computational Efficient Bots with Costly Features
Anthony Kobanda
Valliappan C. A.
Joshua Romoff
Ludovic Denoyer
OffRL
11
1
0
18 Aug 2023
Learning to Identify Critical States for Reinforcement Learning from
  Videos
Learning to Identify Critical States for Reinforcement Learning from Videos
Haozhe Liu
Mingchen Zhuge
Bing Li
Yu‐Han Wang
Francesco Faccio
Bernard Ghanem
Jürgen Schmidhuber
OffRL
15
6
0
15 Aug 2023
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving
  at Unsignalized Intersections
MTD-GPT: A Multi-Task Decision-Making GPT Model for Autonomous Driving at Unsignalized Intersections
Jiaqi Liu
Peng Hang
Xiao Qi
Jianqiang Wang
Jian-jun Sun
23
42
0
30 Jul 2023
Dynamic deep-reinforcement-learning algorithm in Partially Observed
  Markov Decision Processes
Dynamic deep-reinforcement-learning algorithm in Partially Observed Markov Decision Processes
Saki Omi
Hyo-Sang Shin
Namhoon Cho
Antonios Tsourdos
19
3
0
29 Jul 2023
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Benchmarking Offline Reinforcement Learning on Real-Robot Hardware
Nico Gürtler
Sebastian Blaes
Pavel Kolev
Felix Widmaier
Manuel Wüthrich
Stefan Bauer
Bernhard Schölkopf
Georg Martius
OffRL
30
28
0
28 Jul 2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Offline Reinforcement Learning with On-Policy Q-Function Regularization
Laixi Shi
Robert Dadashi
Yuejie Chi
P. S. Castro
M. Geist
OffRL
27
5
0
25 Jul 2023
PASTA: Pretrained Action-State Transformer Agents
PASTA: Pretrained Action-State Transformer Agents
Raphael Boige
Yannis Flet-Berliac
Arthur Flajolet
Guillaume Richard
Thomas Pierrot
LM&Ro
OffRL
26
5
0
20 Jul 2023
Transformers in Reinforcement Learning: A Survey
Transformers in Reinforcement Learning: A Survey
Pranav Agarwal
A. Rahman
P. St-Charles
Simon J. D. Prince
Samira Ebrahimi Kahou
OffRL
24
18
0
12 Jul 2023
Autonomy 2.0: The Quest for Economies of Scale
Autonomy 2.0: The Quest for Economies of Scale
Shuang Wu
Bo Yu
Shaoshan Liu
Yuhao Zhu
16
2
0
08 Jul 2023
When Do Transformers Shine in RL? Decoupling Memory from Credit
  Assignment
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Tianwei Ni
Michel Ma
Benjamin Eysenbach
Pierre-Luc Bacon
OffRL
18
34
0
07 Jul 2023
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng
Ce Zhang
Shijie Wang
Chen Sun
OffRL
27
5
0
07 Jul 2023
Elastic Decision Transformer
Elastic Decision Transformer
Yueh-hua Wu
Xiaolong Wang
Masashi Hamaya
OffRL
21
39
0
05 Jul 2023
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via
  Self-supervised Learning
Crossway Diffusion: Improving Diffusion-based Visuomotor Policy via Self-supervised Learning
Xiang Li
Varun Belagali
Jinghuan Shang
Michael S. Ryoo
32
28
0
04 Jul 2023
Would I have gotten that reward? Long-term credit assignment by
  counterfactual contribution analysis
Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis
Alexander Meulemans
Simon Schug
Seijin Kobayashi
Nathaniel D. Daw
Gregory Wayne
13
3
0
29 Jun 2023
Length Generalization in Arithmetic Transformers
Length Generalization in Arithmetic Transformers
Samy Jelassi
Stéphane dÁscoli
Carles Domingo-Enrich
Yuhuai Wu
Yuan-Fang Li
Franccois Charton
30
38
0
27 Jun 2023
Learning non-Markovian Decision-Making from State-only Sequences
Learning non-Markovian Decision-Making from State-only Sequences
Aoyang Qin
Feng Gao
Qing Li
Song-Chun Zhu
Sirui Xie
28
9
0
27 Jun 2023
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Supervised Pretraining Can Learn In-Context Reinforcement Learning
Jonathan Lee
Annie Xie
Aldo Pacchiano
Yash Chandak
Chelsea Finn
Ofir Nachum
Emma Brunskill
OffRL
27
73
0
26 Jun 2023
Learning to Modulate pre-trained Models in RL
Learning to Modulate pre-trained Models in RL
Thomas Schmied
M. Hofmarcher
Fabian Paischer
Razvan Pascanu
Sepp Hochreiter
CLL
OffRL
24
14
0
26 Jun 2023
CEIL: Generalized Contextual Imitation Learning
CEIL: Generalized Contextual Imitation Learning
Jinxin Liu
Li He
Yachen Kang
Zifeng Zhuang
Donglin Wang
Huazhe Xu
31
18
0
26 Jun 2023
Design from Policies: Conservative Test-Time Adaptation for Offline
  Policy Optimization
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
42
8
0
26 Jun 2023
Waypoint Transformer: Reinforcement Learning via Supervised Learning
  with Intermediate Targets
Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets
Anirudhan Badrinath
Yannis Flet-Berliac
Allen Nie
Emma Brunskill
OffRL
12
16
0
24 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
J. Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
DiMSam: Diffusion Models as Samplers for Task and Motion Planning under
  Partial Observability
DiMSam: Diffusion Models as Samplers for Task and Motion Planning under Partial Observability
Xiaolin Fang
Caelan Reed Garrett
Clemens Eppner
Tomás Lozano-Pérez
L. Kaelbling
D. Fox
DiffM
33
17
0
22 Jun 2023
Recurrent Action Transformer with Memory
Recurrent Action Transformer with Memory
A. Staroverov
A. Bessonov
Dmitry A. Yudin
A. Kovalev
Aleksandr I. Panov
OffRL
33
4
0
15 Jun 2023
ChessGPT: Bridging Policy Learning and Language Modeling
ChessGPT: Bridging Policy Learning and Language Modeling
Xidong Feng
Yicheng Luo
Ziyan Wang
Hongrui Tang
Mengyue Yang
Kun Shao
D. Mguni
Yali Du
Jun Wang
14
38
0
15 Jun 2023
Deep Generative Models for Decision-Making and Control
Deep Generative Models for Decision-Making and Control
Michael Janner
26
1
0
15 Jun 2023
A Comprehensive Survey on Applications of Transformers for Deep Learning
  Tasks
A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks
Saidul Islam
Hanae Elmekki
Ahmed Elsebai
Jamal Bentahar
Najat Drawel
Gaith Rjoub
Witold Pedrycz
ViT
MedIm
19
171
0
11 Jun 2023
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality
  Synthetic Data from a Policy-Decoupled Approach
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach
Shixi Lian
Yi-An Ma
Jinyi Liu
Yan Zheng
Zhaopeng Meng
OffRL
14
1
0
10 Jun 2023
Decision Stacks: Flexible Reinforcement Learning via Modular Generative
  Models
Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models
Siyan Zhao
Aditya Grover
OffRL
11
7
0
09 Jun 2023
Decision S4: Efficient Sequence-Based RL via State Spaces Layers
Decision S4: Efficient Sequence-Based RL via State Spaces Layers
Shmuel Bar-David
Itamar Zimerman
Eliya Nachmani
Lior Wolf
OffRL
21
27
0
08 Jun 2023
Instructed Diffuser with Temporal Condition Guidance for Offline
  Reinforcement Learning
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning
Jifeng Hu
Yan Sun
Sili Huang
Siyuan Guo
Hechang Chen
Li Shen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
DiffM
OffRL
40
13
0
08 Jun 2023
Learning with a Mole: Transferable latent spatial representations for
  navigation without reconstruction
Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction
G. Bono
L. Antsfeld
Assem Sadek
G. Monaci
Christian Wolf
SSL
30
5
0
06 Jun 2023
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from
  Offline Data
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data
Chongyi Zheng
Benjamin Eysenbach
Homer Walke
Patrick Yin
Kuan Fang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
29
4
0
06 Jun 2023
Active Vision Reinforcement Learning under Limited Visual Observability
Active Vision Reinforcement Learning under Limited Visual Observability
Jinghuan Shang
Michael S. Ryoo
32
0
0
01 Jun 2023
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
23
9
0
01 Jun 2023
Learning Sampling Dictionaries for Efficient and Generalizable Robot
  Motion Planning with Transformers
Learning Sampling Dictionaries for Efficient and Generalizable Robot Motion Planning with Transformers
Jacob J. Johnson
A. H. Qureshi
Michael C. Yip
25
12
0
01 Jun 2023
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
Qian Lin
Bo Tang
Zifan Wu
Chao Yu
Shangqin Mao
Qianlong Xie
Xingxing Wang
Dong Wang
OffRL
34
11
0
01 Jun 2023
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
Wei Xiao
Tsun-Hsuan Wang
Chuang Gan
Daniela Rus
DiffM
24
27
0
31 May 2023
Efficient Diffusion Policies for Offline Reinforcement Learning
Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang
Xiao Ma
Chao Du
Tianyu Pang
Shuicheng Yan
OffRL
26
62
0
31 May 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
Fei Ni
Jianye Hao
Yao Mu
Yifu Yuan
Yan Zheng
Bin Wang
Zhixuan Liang
DiffM
OffRL
59
42
0
31 May 2023
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal
  Representation
Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation
Yingyi Chen
Qinghua Tao
F. Tonin
Johan A. K. Suykens
31
19
0
31 May 2023
NetHack is Hard to Hack
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
19
7
0
30 May 2023
Future-conditioned Unsupervised Pretraining for Decision Transformer
Future-conditioned Unsupervised Pretraining for Decision Transformer
Zhihui Xie
Zichuan Lin
Deheng Ye
Qiang Fu
Wei Yang
Shuai Li
OffRL
OnRL
40
22
0
26 May 2023
Emergent Agentic Transformer from Chain of Hindsight Experience
Emergent Agentic Transformer from Chain of Hindsight Experience
Hao Liu
Pieter Abbeel
OffRL
27
25
0
26 May 2023
Beyond Reward: Offline Preference-guided Policy Optimization
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang
Dingxu Shi
Jinxin Liu
Li He
Donglin Wang
OffRL
24
31
0
25 May 2023
Koopman Kernel Regression
Koopman Kernel Regression
Petar Bevanda
Maximilian Beier
Armin Lederer
Stefan Sosnowski
Eyke Hüllermeier
Sandra Hirche
AI4TS
19
16
0
25 May 2023
Think Before You Act: Decision Transformers with Working Memory
Think Before You Act: Decision Transformers with Working Memory
Jikun Kang
Romain Laroche
Xingdi Yuan
Adam Trischler
Xuefei Liu
Jie Fu
OffRL
19
0
0
24 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
36
90
0
19 May 2023
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning
  with Energy-based Models
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Wenhao Ding
Tong Che
Ding Zhao
Marco Pavone
BDL
OffRL
14
2
0
18 May 2023
A Generalist Dynamics Model for Control
A Generalist Dynamics Model for Control
Ingmar Schubert
Jingwei Zhang
Jake Bruce
Sarah Bechtle
Emilio Parisotto
Martin Riedmiller
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
N. Heess
AI4CE
33
28
0
18 May 2023
Previous
123...1056789
Next