ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 465 papers shown
Title
Decision Transformer as a Foundation Model for Partially Observable
  Continuous Control
Decision Transformer as a Foundation Model for Partially Observable Continuous Control
Xiangyuan Zhang
Weichao Mao
Haoran Qiu
Tamer Basar
OffRL
AI4CE
24
5
0
03 Apr 2024
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept,
  Taxonomy, and Methods
Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
Yuji Cao
Huan Zhao
Yuheng Cheng
Ting Shu
Guolong Liu
Gaoqi Liang
Junhua Zhao
Yun Li
LLMAG
KELM
OffRL
LM&Ro
35
48
0
30 Mar 2024
Retentive Decision Transformer with Adaptive Masking for Reinforcement
  Learning based Recommendation Systems
Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems
Siyu Wang
Xiaocong Chen
Lina Yao
OffRL
31
1
0
26 Mar 2024
Diffusion Model for Data-Driven Black-Box Optimization
Diffusion Model for Data-Driven Black-Box Optimization
Zihao Li
Hui Yuan
Kaixuan Huang
Chengzhuo Ni
Yinyu Ye
Minshuo Chen
Mengdi Wang
DiffM
32
9
0
20 Mar 2024
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination
  for Simulated-World Control
MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control
Enshen Zhou
Yiran Qin
Zhen-fei Yin
Yuzhou Huang
Ruimao Zhang
Lu Sheng
Yu Qiao
Jing Shao
LM&Ro
AI4CE
37
32
0
18 Mar 2024
Reinforcement Learning with Token-level Feedback for Controllable Text
  Generation
Reinforcement Learning with Token-level Feedback for Controllable Text Generation
Wendi Li
Wei Wei
Kaihe Xu
Wenfeng Xie
Dangyang Chen
Yu Cheng
41
7
0
18 Mar 2024
Scenario Engineering for Autonomous Transportation: A New Stage in
  Open-Pit Mines
Scenario Engineering for Autonomous Transportation: A New Stage in Open-Pit Mines
Siyu Teng
Xuan Li
Yuchen Li
Zhe Xuanyuan
Yunfeng Ai
Long Chen
29
8
0
15 Mar 2024
Symbiotic Game and Foundation Models for Cyber Deception Operations in
  Strategic Cyber Warfare
Symbiotic Game and Foundation Models for Cyber Deception Operations in Strategic Cyber Warfare
Tao Li
Quanyan Zhu
AAML
32
5
0
14 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRL
OnRL
38
3
0
11 Mar 2024
tsGT: Stochastic Time Series Modeling With Transformer
tsGT: Stochastic Time Series Modeling With Transformer
Lukasz Kuciñski
Witold Drzewakowski
Mateusz Olko
Piotr Kozakowski
Lukasz Maziarka
Marta Emilia Nowakowska
Lukasz Kaiser
Piotr Milo's
44
1
0
08 Mar 2024
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence
  Modeling Problem
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem
Ceyao Zhang
Renjie Li
Cheng Zhang
Zhaoyu Zhang
Feng Yin
29
0
0
08 Mar 2024
Inference via Interpolation: Contrastive Representations Provably Enable
  Planning and Inference
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Benjamin Eysenbach
Vivek Myers
Ruslan Salakhutdinov
Sergey Levine
AI4TS
39
8
0
06 Mar 2024
Deep Reinforcement Learning for Solving Management Problems: Towards A
  Large Management Mode
Deep Reinforcement Learning for Solving Management Problems: Towards A Large Management Mode
Jinyang Jiang
Xiaotian Liu
Tao Ren
Qinghao Wang
Yi Zheng
Yufu Du
Yijie Peng
Cheng Zhang
OffRL
AI4CE
30
0
0
01 Mar 2024
Intensive Care as One Big Sequence Modeling Problem
Intensive Care as One Big Sequence Modeling Problem
Vadim Liventsev
Tobias Fritz
20
1
0
27 Feb 2024
PIDformer: Transformer Meets Control Theory
PIDformer: Transformer Meets Control Theory
Tam Nguyen
César A. Uribe
Tan-Minh Nguyen
Richard G. Baraniuk
48
7
0
25 Feb 2024
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human
  Racing Gameplay
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver
Chen Tang
Ce Hao
Kenta Kawamoto
Masayoshi Tomizuka
Wei Zhan
OffRL
32
0
0
22 Feb 2024
Beyond A*: Better Planning with Transformers via Search Dynamics
  Bootstrapping
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
Lucas Lehnert
Sainbayar Sukhbaatar
DiJia Su
Qinqing Zheng
Paul Mcvay
Michael Rabbat
Yuandong Tian
27
52
0
21 Feb 2024
Tiny Reinforcement Learning for Quadruped Locomotion using Decision
  Transformers
Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers
Orhan Eren Akgün
Néstor Cuevas
Matheus Farias
Daniel Garces
28
0
0
20 Feb 2024
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared
  Semantic Spaces
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces
Tianyu Zheng
Ge Zhang
Xingwei Qu
Ming Kuang
Stephen W. Huang
Zhaofeng He
OffRL
45
1
0
20 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandréa
21
11
0
15 Feb 2024
Large Language Models as Agents in Two-Player Games
Large Language Models as Agents in Two-Player Games
Yang Liu
Peng Sun
Hang Li
LLMAG
37
4
0
12 Feb 2024
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Hierarchical Transformers are Efficient Meta-Reinforcement Learners
Gresa Shala
André Biedenkapp
Josif Grabocka
OffRL
29
4
0
09 Feb 2024
Offline Risk-sensitive RL with Partial Observability to Enhance
  Performance in Human-Robot Teaming
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming
Giorgio Angelotti
Caroline Ponzoni Carvalho Chanel
Adam H. M. Pinto
Christophe Lounis
C. Chauffaut
Nicolas Drougard
OffRL
11
2
0
08 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
11
0
08 Feb 2024
Do Transformer World Models Give Better Policy Gradients?
Do Transformer World Models Give Better Policy Gradients?
Michel Ma
Tianwei Ni
Clement Gehring
P. DÓro
Pierre-Luc Bacon
34
4
0
07 Feb 2024
Return-Aligned Decision Transformer
Return-Aligned Decision Transformer
Tsunehiko Tanaka
Kenshi Abe
Kaito Ariu
Tetsuro Morimura
Edgar Simo-Serra
OffRL
59
1
0
06 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
SEABO: A Simple Search-Based Method for Offline Imitation Learning
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
19
9
0
06 Feb 2024
Reinforcement Learning from Bagged Reward
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
24
0
0
06 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
39
17
0
05 Feb 2024
Contrastive Diffuser: Planning Towards High Return States via
  Contrastive Learning
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning
Yixiang Shan
Zhengbang Zhu
Ting Long
Qifan Liang
Yi-Ju Chang
Weinan Zhang
Liang Yin
OffRL
34
1
0
05 Feb 2024
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based
  Trajectory Stitching
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
Guanghe Li
Yixiang Shan
Zhengbang Zhu
Ting Long
Weinan Zhang
OffRL
21
9
0
04 Feb 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
Lanqing Li
Hai Zhang
Xinyu Zhang
Shatong Zhu
Junqiao Zhao
Junqiao Zhao
Pheng-Ann Heng
OffRL
31
7
0
04 Feb 2024
NetLLM: Adapting Large Language Models for Networking
NetLLM: Adapting Large Language Models for Networking
Duo Wu
Xianda Wang
Yaqi Qiao
Zhi Wang
Junchen Jiang
Shuguang Cui
Fangxin Wang
32
30
0
04 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
16
7
0
02 Feb 2024
Nested Construction of Polar Codes via Transformers
Nested Construction of Polar Codes via Transformers
S. Ankireddy
Ashwin Hebbar
Heping Wan
Joonyoung Cho
C. Zhang
21
5
0
30 Jan 2024
Multi-Object Navigation in real environments using hybrid policies
Multi-Object Navigation in real environments using hybrid policies
Assem Sadek
G. Bono
Boris Chidlovskii
A. Baskurt
Christian Wolf
45
5
0
24 Jan 2024
TraKDis: A Transformer-based Knowledge Distillation Approach for Visual
  Reinforcement Learning with Application to Cloth Manipulation
TraKDis: A Transformer-based Knowledge Distillation Approach for Visual Reinforcement Learning with Application to Cloth Manipulation
Wei Chen
Nicolás Rojas
38
7
0
24 Jan 2024
Closing the Gap between TD Learning and Supervised Learning -- A
  Generalisation Point of View
Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View
Raj Ghugare
Matthieu Geist
Glen Berseth
Benjamin Eysenbach
OffRL
25
14
0
20 Jan 2024
Simple Hierarchical Planning with Diffusion
Simple Hierarchical Planning with Diffusion
Chang Chen
Fei Deng
Kenji Kawaguchi
Çağlar Gülçehre
Sungjin Ahn
OffRL
DiffM
38
24
0
05 Jan 2024
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam
  Intensity Control in Mu2e
Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e
Chenwei Xu
Jerry Yao-Chieh Hu
A. Narayanan
M. Thieme
V. Nagaslaev
...
Rui Shi
S. Memik
A. Shuping
Kyle Hazelwood
Han Liu
19
2
0
28 Dec 2023
Critic-Guided Decision Transformer for Offline Reinforcement Learning
Critic-Guided Decision Transformer for Offline Reinforcement Learning
Yuanfu Wang
Chao Yang
Yinghong Wen
Yu Liu
Yu Qiao
OffRL
27
11
0
21 Dec 2023
In-Context Reinforcement Learning for Variable Action Spaces
In-Context Reinforcement Learning for Variable Action Spaces
Viacheslav Sinii
Alexander Nikulin
Vladislav Kurenkov
Ilya Zisman
Sergey Kolesnikov
16
14
0
20 Dec 2023
Emergence of In-Context Reinforcement Learning from Noise Distillation
Emergence of In-Context Reinforcement Learning from Noise Distillation
Ilya Zisman
Vladislav Kurenkov
Alexander Nikulin
Viacheslav Sinii
Sergey Kolesnikov
OffRL
33
9
0
19 Dec 2023
Explore 3D Dance Generation via Reward Model from Automatically-Ranked
  Demonstrations
Explore 3D Dance Generation via Reward Model from Automatically-Ranked Demonstrations
Zilin Wang
Hao-Wen Zhuang
Lu Li
Yinmin Zhang
Junjie Zhong
Jun Chen
Yu Yang
Boshi Tang
Zhiyong Wu
45
3
0
18 Dec 2023
Saturn Platform: Foundation Model Operations and Generative AI for
  Financial Services
Saturn Platform: Foundation Model Operations and Generative AI for Financial Services
Antonio Busson
Rennan Gaio
Rafael H. Rocha
Francisco Evangelista
Bruno Rizzi
Luan Carvalho
Rafael Miceli
Marcos Rabaioli
David Favaro
23
1
0
12 Dec 2023
Real-time Network Intrusion Detection via Decision Transformers
Real-time Network Intrusion Detection via Decision Transformers
Jingdi Chen
Hanhan Zhou
Yongsheng Mei
Gina Adam
Nathaniel D. Bastian
Tian-Shing Lan
16
15
0
12 Dec 2023
DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven
  Differentiable Physics
DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven Differentiable Physics
Zhiao Huang
Feng Chen
Yewen Pu
Chun-Tse Lin
Hao Su
Chuang Gan
16
4
0
11 Dec 2023
Toward Open-ended Embodied Tasks Solving
Toward Open-ended Embodied Tasks Solving
William Wei Wang
Dongqi Han
Xufang Luo
Yifei Shen
Charles X. Ling
Boyu Wang
Dongsheng Li
AI4CE
10
5
0
10 Dec 2023
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text
  Games
Self-Supervised Behavior Cloned Transformers are Path Crawlers for Text Games
Ruoyao Wang
Peter Alexander Jansen
LRM
12
1
0
07 Dec 2023
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play
Lili Chen
Shikhar Bahl
Deepak Pathak
15
41
0
07 Dec 2023
Previous
12345...8910
Next