Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.02039
Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem
3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline Reinforcement Learning as One Big Sequence Modeling Problem"
50 / 464 papers shown
Title
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
W. Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
36
0
0
13 May 2025
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Hongyin Zhang
Zifeng Zhuang
H. Zhao
Pengxiang Ding
Hongchao Lu
Donglin Wang
OffRL
44
0
0
12 May 2025
UniCO: Towards a Unified Model for Combinatorial Optimization Problems
Zefang Zong
Xiaochen Wei
Guozhen Zhang
Chen Gao
Huandong Wang
Yong Li
29
0
0
07 May 2025
Latent Adaptive Planner for Dynamic Manipulation
Donghun Noh
Deqian Kong
Minglu Zhao
Andrew Lizarraga
Jianwen Xie
Ying Nian Wu
Dennis W. Hong
100
0
0
06 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
109
0
0
03 May 2025
NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence
Zhicong Li
Hangyu Mao
Jiangjin Yin
Mingzhe Xing
Zhiwei Xu
Yuanxing Zhang
Yang Xiao
29
0
0
30 Apr 2025
Offline Learning of Controllable Diverse Behaviors
Mathieu Petitbois
Rémy Portelas
Sylvain Lamprier
Ludovic Denoyer
OffRL
34
0
0
25 Apr 2025
Towards Forceful Robotic Foundation Models: a Literature Survey
William Xie
N. Correll
OffRL
56
1
0
16 Apr 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
29
1
0
04 Apr 2025
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Yuan Xu
Tianwei Zhang
37
0
0
24 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
41
0
0
20 Mar 2025
Masked Sensory-Temporal Attention for Sensor Generalization in Quadruped Locomotion
Dikai Liu
Tianwei Zhang
Jianxiong Yin
Simon See
85
1
0
13 Mar 2025
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
Hiroshi Takahashi
Tomoharu Iwata
Atsutoshi Kumagai
Yuuki Yamanaka
Tomoya Yamashita
DiffM
65
0
0
05 Mar 2025
Target Return Optimizer for Multi-Game Decision Transformer
Kensuke Tatematsu
Akifumi Wachi
OffRL
64
0
0
04 Mar 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
68
0
0
03 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
39
2
0
02 Mar 2025
What Makes a Good Diffusion Planner for Decision Making?
Haofei Lu
Dongqi Han
Yifei Shen
Dongsheng Li
DiffM
33
3
0
01 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
43
1
0
28 Feb 2025
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments
Jinghao Xin
Zhichao Liang
Zihuan Zhang
Peng Wang
Ning Li
59
0
0
27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Long Minh Bui
Tho Tran Huu
Duy-Tung Dinh
T. Nguyen
Trong Nghia Hoang
44
2
0
27 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
72
1
0
26 Feb 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
47
0
0
26 Feb 2025
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning
Kun Hu
Muning Wen
X. Wang
S. Zhang
Yiwei Shi
Minne Li
Minglong Li
Ying Wen
42
0
0
23 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
73
4
0
21 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
75
0
0
18 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
62
1
0
17 Feb 2025
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
D. Elbaz
Oren Salzman
OffRL
32
0
0
13 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
138
0
0
10 Feb 2025
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu
Yifei Shen
Dongsheng Li
Junliang Xing
Dongqi Han
62
0
0
10 Feb 2025
Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
OffRL
67
0
0
10 Feb 2025
Towards Robust Spacecraft Trajectory Optimization via Transformers
Yuji Takubo
T. Guffanti
Daniele Gammelli
Marco Pavone
Simone DÁmico
69
4
0
28 Jan 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
33
0
0
28 Jan 2025
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
38
1
0
23 Jan 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
66
1
0
17 Jan 2025
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao
Kexin Huang
Ziang Fei
Jiaju Chen
J. Chen
Jianshan Sun
Shuchang Liu
Qingpeng Cai
Peng Jiang
OffRL
34
0
0
13 Jan 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
39
0
0
03 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
78
33
0
03 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Z. Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting
Hemant Kumawat
Saibal Mukhopadhyay
62
0
0
19 Dec 2024
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Wonje Choi
Woo Kyung Kim
Minjong Yoo
Honguk Woo
OffRL
LM&Ro
108
2
0
16 Dec 2024
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
75
0
0
13 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
M. K. Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
95
4
0
09 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
89
0
0
02 Dec 2024
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
76
0
0
01 Dec 2024
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
75
1
0
18 Nov 2024
One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
Zihao Li
Yuan Cao
Cheng Gao
Yihan He
Han Liu
Jason M. Klusowski
Jianqing Fan
Mengdi Wang
MLT
47
6
0
16 Nov 2024
Evaluating World Models with LLM for Decision Making
Chang Yang
Xinrun Wang
Junzhe Jiang
Qinggang Zhang
Xiao Huang
LLMAG
ELM
36
2
0
13 Nov 2024
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Karim Abdel Sadek
Matteo Nulli
Joan Velja
Jort Vincenti
38
0
0
11 Nov 2024
CROPS: A Deployable Crop Management System Over All Possible State Availabilities
Jing Wu
Zhixin Lai
Shengjie Liu
Suiyao Chen
Ran Tao
Pan Zhao
Chuyuan Tao
Yikun Cheng
N. Hovakimyan
OffRL
39
0
0
09 Nov 2024
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Marvin Alles
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
OffRL
34
0
0
07 Nov 2024
1
2
3
4
...
8
9
10
Next