ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem
v1v2v3v4 (latest)

Offline Reinforcement Learning as One Big Sequence Modeling Problem

Neural Information Processing Systems (NeurIPS), 2021
3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXiv (abs)PDFHTMLHuggingFace (2 upvotes)

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 530 papers shown
Towards Understanding Transformers in Learning Random Walks
Towards Understanding Transformers in Learning Random Walks
Wei Shi
Yuan Cao
133
1
0
28 Nov 2025
Efficient Diffusion Planning with Temporal Diffusion
Efficient Diffusion Planning with Temporal Diffusion
Jiaming Guo
Rui Zhang
Z. Li
Yunkai Gao
Shaohui Peng
Siming Lan
Xing Hu
Zidong Du
Xishan Zhang
Ling Li
DiffM
219
0
0
26 Nov 2025
Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding
Dynamical Properties of Tokens in Self-Attention and Effects of Positional Encoding
Duy-Tung Pham
A. Nguyen
Viet-Hoang Tran
Nhan-Phu Chung
Xin T. Tong
T. Nguyen
Thieu N. Vo
117
0
0
25 Nov 2025
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
Yuxuan Wang
Haobin Jiang
Shiqing Yao
Ziluo Ding
Zongqing Lu
LM&Ro
450
4
0
24 Nov 2025
A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
A Comparison Between Decision Transformers and Traditional Offline Reinforcement Learning Algorithms
Ali Murtaza Caunhye
Asad Jeewa
190
0
0
20 Nov 2025
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
Xinming Gao
Shangzhe Li
Yujin Cai
Wenwu Yu
OffRLGP
164
0
0
15 Nov 2025
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning
Daniel De Dios Allegue
J. He
F. Oliehoek
OffRL
338
0
0
10 Nov 2025
Towards Reinforcement Learning Based Log Loading Automation
Towards Reinforcement Learning Based Log Loading Automation
Ilya Kurinov
Miroslav Ivanov
Grzegorz Orzechowski
A. Mikkola
103
0
0
30 Oct 2025
Online Optimization for Offline Safe Reinforcement Learning
Online Optimization for Offline Safe Reinforcement Learning
Yassine Chemingui
Aryan Deshwal
Alan Fern
Thanh Nguyen-Tang
J. Doppa
OffRL
179
0
0
24 Oct 2025
Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
Modeling Expert Interactions in Sparse Mixture of Experts via Graph Structures
Minh Khoi Nguyen Nhat
R. Teo
Laziz U. Abdullaev
Maurice Mok
Viet-Hoang Tran
T. Nguyen
MoE
227
0
0
18 Oct 2025
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control
Nikita Kachaev
Daniil Zelezetsky
Egor Cherepanov
Alexey K. Kovelev
Aleksandr I. Panov
OffRL
192
2
0
15 Oct 2025
Robust Adversarial Reinforcement Learning in Stochastic Games via Sequence Modeling
Robust Adversarial Reinforcement Learning in Stochastic Games via Sequence Modeling
Xiaohang Tang
Zhuowen Cheng
Satyabrat Kumar
140
0
0
13 Oct 2025
Learning with Incomplete Context: Linear Contextual Bandits with Pretrained Imputation
Learning with Incomplete Context: Linear Contextual Bandits with Pretrained Imputation
Hao Yan
Heyan Zhang
Yongyi Guo
213
0
0
10 Oct 2025
Expressive Value Learning for Scalable Offline Reinforcement Learning
Expressive Value Learning for Scalable Offline Reinforcement Learning
Nicolas Espinosa-Dice
Kianté Brantley
Wen Sun
OffRL
308
1
0
09 Oct 2025
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
Evgenii Opryshko
Junwei Quan
C. Voelcker
Yilun Du
Igor Gilitschenski
OffRL
171
3
0
08 Oct 2025
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
Kai Fukazawa
Kunal Mundada
Iman Soltani
OffRL
219
0
0
03 Oct 2025
Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation
Integrating Offline Pre-Training with Online Fine-Tuning: A Reinforcement Learning Approach for Robot Social Navigation
Run Su
Hao Fu
Shuai Zhou
Yingao Fu
OffRLOnRL
264
0
0
01 Oct 2025
CroSTAta: Cross-State Transition Attention Transformer for Robotic Manipulation
CroSTAta: Cross-State Transition Attention Transformer for Robotic Manipulation
Giovanni Minelli
Giulio Turrisi
Victor Barasuol
Claudio Semini
194
0
0
01 Oct 2025
Accelerating Transformers in Online RL
Accelerating Transformers in Online RL
Daniil Zelezetsky
A. Kovalev
Aleksandr I. Panov
OffRL
159
0
0
30 Sep 2025
MUVLA: Learning to Explore Object Navigation via Map Understanding
MUVLA: Learning to Explore Object Navigation via Map Understanding
Peilong Han
Fan Jia
Min Zhang
Yutao Qiu
Hongyao Tang
Yan Zheng
Tiancai Wang
Jianye Hao
154
2
0
30 Sep 2025
Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction
Understanding and Enhancing the Planning Capability of Language Models via Multi-Token Prediction
Qimin Zhong
Hao Liao
Siwei Wang
Mingyang Zhou
X. Wu
Rui Mao
Wei Chen
291
3
0
27 Sep 2025
Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Structural Information-based Hierarchical Diffusion for Offline Reinforcement Learning
Xianghua Zeng
Hao Peng
Angsheng Li
Yicheng Pan
OffRL
166
1
0
26 Sep 2025
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
Zongyue Li
Xiao Han
Yusong Li
Niklas Strauss
Matthias Schubert
OffRL
162
0
0
23 Sep 2025
Mental Accounts for Actions: EWA-Inspired Attention in Decision Transformers
Mental Accounts for Actions: EWA-Inspired Attention in Decision Transformers
Zahra Aref
Narayan B. Mandayam
OffRL
179
0
0
19 Sep 2025
An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios
An Uncertainty-Weighted Decision Transformer for Navigation in Dense, Complex Driving Scenarios
Zhihao Zhang
Chengyang Peng
Minghao Zhu
Ekim Yurtsever
Keith A. Redmill
184
2
0
16 Sep 2025
Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Data
Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Data
Jesse van Remmerden
Zaharah Bukhsh
Yingqian Zhang
OffRLOnRL
290
1
0
12 Sep 2025
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
floq: Training Critics via Flow-Matching for Scaling Compute in Value-Based RL
Bhavya Agrawalla
Michal Nauman
Khush Agarwal
Aviral Kumar
OffRL
278
11
0
08 Sep 2025
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
Wavelet Fourier Diffuser: Frequency-Aware Diffusion Model for Reinforcement Learning
Yifu Luo
Yongzhe Chang
Xueqian Wang
195
2
0
04 Sep 2025
Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner
Generative Auto-Bidding in Large-Scale Competitive Auctions via Diffusion Completer-Aligner
Yewen Li
Jingtong Gao
Nan Jiang
Shuai Mao
Ruyi An
Fei Pan
Xiangyu Zhao
Bo An
Qingpeng Cai
Peng Jiang
183
3
0
03 Sep 2025
Generative Sequential Notification Optimization via Multi-Objective Decision Transformers
Generative Sequential Notification Optimization via Multi-Objective Decision Transformers
Borja Ocejo
Ruofan Wang
Ke Liu
Rohit K. Patra
Haotian Shen
David Liu
Yiwen Yuan
Gokulraj Mohanasundaram
Fedor Borisyuk
Prakruthi Prabhakar
OffRL
285
0
0
02 Sep 2025
Learning to Ask: Decision Transformers for Adaptive Quantitative Group Testing
Learning to Ask: Decision Transformers for Adaptive Quantitative Group Testing
Mahdi Soleymani
Tara Javidi
205
0
0
01 Sep 2025
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning
Tan Jing
Xiaorui Li
Chao Yao
Xiaojuan Ban
Yuetong Fang
Zhanchen Zhu
Zhaolin Yuan
OffRL
171
0
0
27 Aug 2025
Re:Frame -- Retrieving Experience From Associative Memory
Re:Frame -- Retrieving Experience From Associative Memory
Daniil Zelezetsky
Egor Cherepanov
A. Kovalev
Aleksandr Panov
OffRL
108
1
0
26 Aug 2025
Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
Yue Pei
Hongming Zhang
Chao Gao
Martin Müller
Mengxiao Zhu
Hao Sheng
Ziliang Chen
Liang Lin
Haogang Zhu
OffRL
222
0
0
22 Aug 2025
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation
Yongjie Bai
Zhouxia Wang
Wenshu Fan
Weixing Chen
Ziliang Chen
...
Yongsen Zheng
Lingbo Liu
Guanbin Li
Guanbin Li
Liang Lin
498
1
0
07 Aug 2025
CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation
CLASS: Contrastive Learning via Action Sequence Supervision for Robot Manipulation
Sung-Wook Lee
Xuhui Kang
Brandon Yang
Yen-Ling Kuo
SSL
239
4
0
03 Aug 2025
GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration
GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration
Li Mi
Manon Bechaz
Zeming Chen
Antoine Bosselut
D. Tuia
227
1
0
31 Jul 2025
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
RAD: Retrieval High-quality Demonstrations to Enhance Decision-making
Lu Guo
Yixiang Shan
Zhengbang Zhu
Qifan Liang
Lichang Song
Ting Long
Weinan Zhang
Yi-Ju Chang
OffRL
257
0
0
21 Jul 2025
Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNN
Towards Bio-Inspired Robotic Trajectory Planning via Self-Supervised RNNInternational Conference on Artificial Neural Networks (ICANN), 2025
Miroslav Cibula
Kristína Malinovská
Matthias Kerzel
SSL
212
0
0
02 Jul 2025
TransDreamerV3: Implanting Transformer In DreamerV3
TransDreamerV3: Implanting Transformer In DreamerV3
Shruti Sadanand Dongare
Amun Kharel
Jonathan Samuel
Xiaona Zhou
154
0
0
20 Jun 2025
Scaling Algorithm Distillation for Continuous Control with Mamba
Scaling Algorithm Distillation for Continuous Control with Mamba
Samuel Beaussant
Mehdi Mounsif
269
0
0
16 Jun 2025
SAIL: Faster-than-Demonstration Execution of Imitation Learning Policies
SAIL: Faster-than-Demonstration Execution of Imitation Learning Policies
Nadun Ranawaka Arachchige
Zhenyang Chen
Wonsuhk Jung
Woo Chul Shin
Rohan Bansal
...
Yu Hang He
Yingyang Celine Lin
Benjamin Joffe
Shreyas Kousik
Danfei Xu
312
13
0
13 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TSOffRLAI4CE
395
5
0
10 Jun 2025
How to Provably Improve Return Conditioned Supervised Learning?
Zhishuai Liu
Yu Yang
Ruhan Wang
Pan Xu
Dongruo Zhou
OffRL
274
1
0
10 Jun 2025
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
Hao Hu
Xinqi Wang
Simon S. Du
OffRL
393
0
0
10 Jun 2025
Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
Accelerating Diffusion Planners in Offline RL via Reward-Aware Consistency Trajectory Distillation
Xintong Duan
Yutong He
Fahim Tajwar
Ruslan Salakhutdinov
J. Zico Kolter
J. Schneider
OffRL
385
1
0
09 Jun 2025
Local Manifold Approximation and Projection for Manifold-Aware Diffusion Planning
Local Manifold Approximation and Projection for Manifold-Aware Diffusion Planning
Kyowoon Lee
Jaesik Choi
DiffM
396
5
0
01 Jun 2025
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong
Guozheng Ma
Qi Zhao
Haoyu Wang
Li Shen
Xueqian Wang
Dacheng Tao
MoEOffRL
253
6
0
30 May 2025
Normalizing Flows are Capable Models for RL
Normalizing Flows are Capable Models for RL
Raj Ghugare
Benjamin Eysenbach
OffRLAI4CE
421
13
0
29 May 2025
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL
BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RLInternational Conference on Learning Representations (ICLR), 2025
Yu-Heng Hung
Kai-Jie Lin
Yu-Heng Lin
Chien-Yi Wang
Cheng Sun
Ping-Chun Hsieh
381
6
0
28 May 2025
1234...91011
Next
Page 1 of 11
Pageof 11