ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 465 papers shown
Title
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
30
108
0
18 Jan 2023
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via
  Sequence Modeling
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling
Yilin Wen
Biao Luo
Yuqian Zhao
13
1
0
06 Jan 2023
Deep Spectral Q-learning with Application to Mobile Health
Deep Spectral Q-learning with Application to Mobile Health
Yuhe Gao
C. Shi
R. Song
11
0
0
03 Jan 2023
Towards Modeling and Influencing the Dynamics of Human Learning
Towards Modeling and Influencing the Dynamics of Human Learning
Ran Tian
M. Tomizuka
Anca Dragan
Andrea V. Bajcsy
21
18
0
02 Jan 2023
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Transformer in Transformer as Backbone for Deep Reinforcement Learning
Hangyu Mao
Rui Zhao
Hao Chen
Jianye Hao
Yiqun Chen
Dong Li
Junge Zhang
Zhen Xiao
OffRL
31
8
0
30 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
24
0
29 Dec 2022
On Realization of Intelligent Decision-Making in the Real World: A
  Foundation Decision Model Perspective
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Ziyu Wan
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
J. Wang
AI4CE
16
10
0
24 Dec 2022
Local Policy Improvement for Recommender Systems
Local Policy Improvement for Recommender Systems
Dawen Liang
N. Vlassis
OffRL
11
3
0
22 Dec 2022
Scalable Diffusion Models with Transformers
Scalable Diffusion Models with Transformers
William S. Peebles
Saining Xie
GNN
14
2,001
0
19 Dec 2022
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem
Chenyi Yu
Weinan Zhang
H. Lai
Zheng Tian
L. Kneip
Jun Wang
23
15
0
18 Dec 2022
Foundation models in brief: A historical, socio-technical focus
Foundation models in brief: A historical, socio-technical focus
Johannes Schneider
VLM
21
9
0
17 Dec 2022
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer
Hang Lai
Weinan Zhang
Xialin He
Chen Yu
Zheng Tian
Yong Yu
Jun Wang
14
20
0
15 Dec 2022
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement
  Learning
SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Benjamin Ellis
Jonathan Cook
S. Moalla
Mikayel Samvelyan
Mingfei Sun
Anuj Mahajan
Jakob N. Foerster
Shimon Whiteson
19
83
0
14 Dec 2022
Offline Reinforcement Learning with Closed-Form Policy Improvement
  Operators
Offline Reinforcement Learning with Closed-Form Policy Improvement Operators
Jiachen Li
Edwin Zhang
Ming Yin
Qinxun Bai
Yu-Xiang Wang
William Yang Wang
OffRL
31
15
0
29 Nov 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
39
359
0
28 Nov 2022
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
Hypernetworks for Zero-shot Transfer in Reinforcement Learning
S. Rezaei-Shoshtari
Charlotte Morissette
F. Hogan
Gregory Dudek
D. Meger
OffRL
17
14
0
28 Nov 2022
How Crucial is Transformer in Decision Transformer?
How Crucial is Transformer in Decision Transformer?
Max Siebenborn
Boris Belousov
Junning Huang
Jan Peters
16
15
0
26 Nov 2022
A System for Morphology-Task Generalization via Unified Representation
  and Behavior Distillation
A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation
Hiroki Furuta
Yusuke Iwasawa
Yutaka Matsuo
S. Gu
18
14
0
25 Nov 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
42
45
0
23 Nov 2022
UniMASK: Unified Inference in Sequential Decision Problems
UniMASK: Unified Inference in Sequential Decision Problems
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
24
21
0
20 Nov 2022
On the Effect of Pre-training for Transformer in Different Modality on
  Offline Reinforcement Learning
On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning
S. Takagi
OffRL
18
7
0
17 Nov 2022
Contextual Transformer for Offline Meta Reinforcement Learning
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin
Ye Li
Xidong Feng
Zhaowei Zhang
Xian Hong Wu Fung
Haifeng Zhang
Jun Wang
Yali Du
Yaodong Yang
OffRL
18
6
0
15 Nov 2022
Control Transformer: Robot Navigation in Unknown Environments through
  PRM-Guided Return-Conditioned Sequence Modeling
Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling
Daniel Lawson
A. H. Qureshi
17
7
0
11 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRL
OnRL
AI4CE
26
23
0
08 Nov 2022
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement
  Learning
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning
D. Elbaz
Gal Novik
Oren Salzman
OffRL
19
0
0
06 Nov 2022
Grafting Vision Transformers
Grafting Vision Transformers
Jong Sung Park
Kumara Kahatapitiya
Donghyun Kim
Shivchander Sudalairaj
Quanfu Fan
Michael S. Ryoo
ViT
21
2
0
28 Oct 2022
Language Control Diffusion: Efficiently Scaling through Space, Time, and
  Tasks
Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks
Edwin Zhang
Yujie Lu
William Wang
Amy Zhang
DiffM
LM&Ro
24
16
0
27 Oct 2022
PlanT: Explainable Planning Transformers via Object-Level
  Representations
PlanT: Explainable Planning Transformers via Object-Level Representations
Katrin Renz
Kashyap Chitta
Otniel-Bogdan Mercea
A. Sophia Koepke
Zeynep Akata
Andreas Geiger
ViT
33
94
0
25 Oct 2022
Learning General World Models in a Handful of Reward-Free Deployments
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
53
8
0
23 Oct 2022
Transformers Learn Shortcuts to Automata
Transformers Learn Shortcuts to Automata
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
OffRL
LRM
32
155
0
19 Oct 2022
From Play to Policy: Conditional Behavior Generation from Uncurated
  Robot Data
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data
Zichen Jeff Cui
Yibin Wang
Nur Muhammad (Mahi) Shafiullah
Lerrel Pinto
LM&Ro
VGen
OffRL
25
89
0
18 Oct 2022
Boosting Offline Reinforcement Learning via Data Rebalancing
Boosting Offline Reinforcement Learning via Data Rebalancing
Yang Yue
Bingyi Kang
Xiao Ma
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
18
22
0
17 Oct 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
26
61
0
15 Oct 2022
Behavior Cloned Transformers are Neurosymbolic Reasoners
Behavior Cloned Transformers are Neurosymbolic Reasoners
Ruoyao Wang
Peter Alexander Jansen
Marc-Alexandre Côté
Prithviraj Ammanabrolu
16
11
0
13 Oct 2022
Vision Transformers provably learn spatial structure
Vision Transformers provably learn spatial structure
Samy Jelassi
Michael E. Sander
Yuan-Fang Li
ViT
MLT
32
73
0
13 Oct 2022
Semi-Supervised Offline Reinforcement Learning with Action-Free
  Trajectories
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
Qinqing Zheng
Mikael Henaff
Brandon Amos
Aditya Grover
OffRL
18
20
0
12 Oct 2022
Designing Robust Transformers using Robust Kernel Density Estimation
Designing Robust Transformers using Robust Kernel Density Estimation
Xing Han
Tongzheng Ren
T. Nguyen
Khai Nguyen
Joydeep Ghosh
Nhat Ho
21
6
0
11 Oct 2022
A Learning-Based Estimation and Control Framework for Contact-Intensive
  Tight-Tolerance Tasks
A Learning-Based Estimation and Control Framework for Contact-Intensive Tight-Tolerance Tasks
Bukun Son
Hyelim Choi
Jeamin Yoon
Dongjun Lee
28
0
0
11 Oct 2022
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement
  Learning
Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning
Tung Nguyen
Qinqing Zheng
Aditya Grover
OffRL
19
6
0
11 Oct 2022
Are All Vision Models Created Equal? A Study of the Open-Loop to
  Closed-Loop Causality Gap
Are All Vision Models Created Equal? A Study of the Open-Loop to Closed-Loop Causality Gap
Mathias Lechner
Ramin Hasani
Alexander Amini
Tsun-Hsuan Wang
T. Henzinger
Daniela Rus
CML
OOD
21
7
0
09 Oct 2022
State Advantage Weighting for Offline RL
State Advantage Weighting for Offline RL
Jiafei Lyu
Aicheng Gong
Le Wan
Zongqing Lu
Xiu Li
OffRL
33
9
0
09 Oct 2022
Large Language Models can Implement Policy Iteration
Large Language Models can Implement Policy Iteration
Ethan A. Brooks
Logan Walls
Richard L. Lewis
Satinder Singh
LM&Ro
OffRL
126
21
0
07 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts
VIMA: General Robot Manipulation with Multimodal Prompts
Yunfan Jiang
Agrim Gupta
Zichen Zhang
Guanzhi Wang
Yongqiang Dou
Yanjun Chen
Li Fei-Fei
Anima Anandkumar
Yuke Zhu
Linxi Fan
LM&Ro
24
334
0
06 Oct 2022
Learning to Learn with Generative Models of Neural Network Checkpoints
Learning to Learn with Generative Models of Neural Network Checkpoints
William S. Peebles
Ilija Radosavovic
Tim Brooks
Alexei A. Efros
Jitendra Malik
UQCV
73
64
0
26 Sep 2022
PACT: Perception-Action Causal Transformer for Autoregressive Robotics
  Pre-Training
PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training
Rogerio Bonatti
Sai H. Vemprala
Shuang Ma
Felipe Vieira Frujeri
Shuhang Chen
Ashish Kapoor
33
22
0
22 Sep 2022
Hierarchical Decision Transformer
Hierarchical Decision Transformer
André Rosa de Sousa Porfírio Correia
L. A. Alexandre
OffRL
90
10
0
21 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
D. Fox
LM&Ro
155
456
0
12 Sep 2022
Transformers are Sample-Efficient World Models
Transformers are Sample-Efficient World Models
Vincent Micheli
Eloi Alonso
Franccois Fleuret
VLM
OffRL
11
157
0
01 Sep 2022
Goal-Conditioned Q-Learning as Knowledge Distillation
Goal-Conditioned Q-Learning as Knowledge Distillation
Alexander Levine
S. Feizi
OffRL
17
2
0
28 Aug 2022
Efficient Planning in a Compact Latent Action Space
Efficient Planning in a Compact Latent Action Space
Zhengyao Jiang
Tianjun Zhang
Michael Janner
Yueying Li
Tim Rocktaschel
Edward Grefenstette
Yuandong Tian
OffRL
16
36
0
22 Aug 2022
Previous
123...10789
Next