ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 465 papers shown
Title
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Goal-Conditioned Supervised Learning with Sub-Goal Prediction
Tom Jurgenson
Aviv Tamar
21
1
0
17 May 2023
Neural Oscillators are Universal
Neural Oscillators are Universal
S. Lanthaler
T. Konstantin Rusch
Siddhartha Mishra
27
9
0
15 May 2023
Stackelberg Decision Transformer for Asynchronous Action Coordination in
  Multi-Agent Systems
Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems
Bin Zhang
Hangyu Mao
Lijuan Li
Zhiwei Xu
Dapeng Li
Rui Zhao
Guoliang Fan
OffRL
31
5
0
13 May 2023
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving
  Human Utility of Free-Text Rationales
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Brihi Joshi
Ziyi Liu
Sahana Ramnath
Aaron Chan
Zhewei Tong
Shaoliang Nie
Qifan Wang
Yejin Choi
Xiang Ren
HAI
LRM
21
28
0
11 May 2023
Learning Video-Conditioned Policies for Unseen Manipulation Tasks
Learning Video-Conditioned Policies for Unseen Manipulation Tasks
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
27
18
0
10 May 2023
Explaining RL Decisions with Trajectories
Explaining RL Decisions with Trajectories
Shripad Deshmukh
Arpan Dasgupta
Balaji Krishnamurthy
Nan Jiang
Chirag Agarwal
Georgios Theocharous
J. Subramanian
OffRL
23
3
0
06 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
24
1
0
04 May 2023
Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL
Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL
Baiting Zhu
Meihua Dang
Aditya Grover
OffRL
66
23
0
30 Apr 2023
Representation Matters: The Game of Chess Poses a Challenge to Vision
  Transformers
Representation Matters: The Game of Chess Poses a Challenge to Vision Transformers
Johannes Czech
Jannis Blüml
Kristian Kersting
ViT
50
0
0
28 Apr 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion
  Policies
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Philippe Hansen-Estruch
Ilya Kostrikov
Michael Janner
J. Kuba
Sergey Levine
OffRL
20
128
0
20 Apr 2023
Think Before You Act: Unified Policy for Interleaving Language Reasoning
  with Actions
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Lina Mezghani
Piotr Bojanowski
Alahari Karteek
Sainbayar Sukhbaatar
LM&Ro
OffRL
LRM
13
8
0
18 Apr 2023
Pretrained Language Models as Visual Planners for Human Assistance
Pretrained Language Models as Visual Planners for Human Assistance
Dhruvesh Patel
H. Eghbalzadeh
Nitin Kamra
Michael L. Iuzzolino
Unnat Jain
Ruta Desai
LM&Ro
19
24
0
17 Apr 2023
Hyper-Decision Transformer for Efficient Online Policy Adaptation
Hyper-Decision Transformer for Efficient Online Policy Adaptation
Mengdi Xu
Yuchen Lu
Yikang Shen
Shun Zhang
Ding Zhao
Chuang Gan
OffRL
23
39
0
17 Apr 2023
Causal Decision Transformer for Recommender Systems via Offline
  Reinforcement Learning
Causal Decision Transformer for Recommender Systems via Offline Reinforcement Learning
Siyu Wang
Xiaocong Chen
Dietmar Jannach
Lina Yao
CML
OffRL
13
27
0
17 Apr 2023
ENTL: Embodied Navigation Trajectory Learner
ENTL: Embodied Navigation Trajectory Learner
Klemen Kotar
Aaron Walsman
Roozbeh Mottaghi
15
6
0
05 Apr 2023
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning
Tongzhou Wang
Antonio Torralba
Phillip Isola
Amy Zhang
OffRL
24
31
0
03 Apr 2023
Chain-of-Thought Predictive Control
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
28
20
0
03 Apr 2023
Planning with Sequence Models through Iterative Energy Minimization
Planning with Sequence Models through Iterative Energy Minimization
Hongyi Chen
Yilun Du
Yiye Chen
J. Tenenbaum
Patricio A. Vela
20
6
0
28 Mar 2023
The Quality-Diversity Transformer: Generating Behavior-Conditioned
  Trajectories with Decision Transformers
The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers
Valentin Macé
Raphael Boige
Félix Chalumeau
Thomas Pierrot
Guillaume Richard
Nicolas Perrin-Gilbert
OffRL
32
12
0
27 Mar 2023
DexDeform: Dexterous Deformable Object Manipulation with Human
  Demonstrations and Differentiable Physics
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics
Sizhe Li
Zhiao Huang
Tao Chen
Tao Du
Hao Su
J. Tenenbaum
Chuang Gan
78
19
0
27 Mar 2023
A Survey of Demonstration Learning
A Survey of Demonstration Learning
André Rosa de Sousa Porfírio Correia
Luís A. Alexandre
OffRL
28
17
0
20 Mar 2023
DataLight: Offline Data-Driven Traffic Signal Control
DataLight: Offline Data-Driven Traffic Signal Control
L. Zhang
Yutong Zhang
J. Deng
Chen Li
OffRL
8
0
0
20 Mar 2023
HIVE: Harnessing Human Feedback for Instructional Visual Editing
HIVE: Harnessing Human Feedback for Instructional Visual Editing
Shu Zhen Zhang
Xinyi Yang
Yihao Feng
Can Qin
Chia-Chih Chen
...
Haiquan Wang
Silvio Savarese
Stefano Ermon
Caiming Xiong
Ran Xu
18
103
0
16 Mar 2023
A Picture is Worth a Thousand Words: Language Models Plan from Pixels
A Picture is Worth a Thousand Words: Language Models Plan from Pixels
Anthony Z. Liu
Lajanugen Logeswaran
Sungryull Sohn
Honglak Lee
LM&Ro
14
6
0
16 Mar 2023
Architext: Language-Driven Generative Architecture Design
Architext: Language-Driven Generative Architecture Design
Theodoros Galanos
Antonios Liapis
Georgios N. Yannakakis
VLM
AI4CE
26
6
0
13 Mar 2023
Transformer-based World Models Are Happy With 100k Interactions
Transformer-based World Models Are Happy With 100k Interactions
Jan Robine
Marc Höftmann
Tobias Uelwer
Stefan Harmeling
OffRL
16
68
0
13 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
90
154
0
07 Mar 2023
Environment Transformer and Policy Optimization for Model-Based Offline
  Reinforcement Learning
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning
Pengqin Wang
Meixin Zhu
Shaojie Shen
OffRL
20
1
0
07 Mar 2023
Graph Decision Transformer
Graph Decision Transformer
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
28
15
0
07 Mar 2023
Decision Transformer under Random Frame Dropping
Decision Transformer under Random Frame Dropping
Kaizhe Hu
Rachel Zheng
Yang Gao
Huazhe Xu
OffRL
126
12
0
03 Mar 2023
RePreM: Representation Pre-training with Masked Model for Reinforcement
  Learning
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning
Yuanying Cai
Chuheng Zhang
Wei Shen
Xuyun Zhang
Wenjie Ruan
Longbo Huang
OffRL
32
4
0
03 Mar 2023
Learning to Control Autonomous Fleets from Observation via Offline
  Reinforcement Learning
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning
Carolin Schmidt
Daniele Gammelli
Francisco Câmara Pereira
Filipe Rodrigues
OffRL
9
4
0
28 Feb 2023
ChatGPT for Robotics: Design Principles and Model Abilities
ChatGPT for Robotics: Design Principles and Model Abilities
Sai H. Vemprala
Rogerio Bonatti
A. Bucker
Ashish Kapoor
LM&Ro
28
458
0
20 Feb 2023
Pretraining Language Models with Human Preferences
Pretraining Language Models with Human Preferences
Tomasz Korbak
Kejian Shi
Angelica Chen
Rasika Bhalerao
C. L. Buckley
Jason Phang
Sam Bowman
Ethan Perez
ALM
SyDa
30
205
0
16 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
26
46
0
14 Feb 2023
Transformer models: an introduction and catalog
Transformer models: an introduction and catalog
X. Amatriain
Ananth Sankar
Jie Bing
Praveen Kumar Bodigutla
Timothy J. Hazen
Michaeel Kazi
24
50
0
12 Feb 2023
Language Decision Transformers with Exponential Tilt for Interactive
  Text Environments
Language Decision Transformers with Exponential Tilt for Interactive Text Environments
Nicolas Angelard-Gontier
Pau Rodríguez López
I. Laradji
David Vazquez
C. Pal
OffRL
18
1
0
10 Feb 2023
The Wisdom of Hindsight Makes Language Models Better Instruction
  Followers
The Wisdom of Hindsight Makes Language Models Better Instruction Followers
Tianjun Zhang
Fangchen Liu
Justin Wong
Pieter Abbeel
Joseph E. Gonzalez
16
43
0
10 Feb 2023
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners
Zhixuan Liang
Yao Mu
Mingyu Ding
Fei Ni
M. Tomizuka
Ping Luo
66
99
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
34
62
0
02 Feb 2023
Clinical Decision Transformer: Intended Treatment Recommendation through
  Goal Prompting
Clinical Decision Transformer: Intended Treatment Recommendation through Goal Prompting
Seunghyun Lee
D. Lee
Sujeong Im
N. Kim
Sung-Min Park
14
11
0
01 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINN
LM&Ro
13
231
0
31 Jan 2023
Skill Decision Transformer
Skill Decision Transformer
Shyam Sudhakaran
S. Risi
OffRL
18
5
0
31 Jan 2023
Guiding Online Reinforcement Learning with Action-Free Offline
  Pretraining
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Deyao Zhu
Yuhui Wang
Jürgen Schmidhuber
Mohamed Elhoseiny
OffRL
OnRL
36
8
0
30 Jan 2023
Direct Preference-based Policy Optimization without Reward Modeling
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An
Junhyeok Lee
Xingdong Zuo
Norio Kosaka
KyungHyun Kim
Hyun Oh Song
OffRL
24
26
0
30 Jan 2023
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe
  Reinforcement Learning
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning
Q. Zhang
Linrui Zhang
Haoran Xu
Li Shen
Bowen Wang
Yongzhe Chang
Xueqian Wang
Bo Yuan
Dacheng Tao
OffRL
15
16
0
28 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
24
7
0
27 Jan 2023
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun
Shuang Ma
Ratnesh Madaan
Rogerio Bonatti
Furong Huang
Ashish Kapoor
33
39
0
24 Jan 2023
Learning to View: Decision Transformers for Active Object Detection
Learning to View: Decision Transformers for Active Object Detection
Wenhao Ding
Nathalie Majcherczyk
Mohit Deshpande
Xuewei Qi
Ding Zhao
R. Madhivanan
Arnie Sen
OffRL
6
12
0
23 Jan 2023
Previous
123...106789
Next