ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.02039
  4. Cited By
Offline Reinforcement Learning as One Big Sequence Modeling Problem

Offline Reinforcement Learning as One Big Sequence Modeling Problem

3 June 2021
Michael Janner
Qiyang Li
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

50 / 464 papers shown
Title
DSADF: Thinking Fast and Slow for Decision Making
DSADF: Thinking Fast and Slow for Decision Making
Alex Zhihao Dou
Dongfei Cui
Jun Yan
W. Wang
Benteng Chen
Haoming Wang
Zeke Xie
Shufei Zhang
OffRL
36
0
0
13 May 2025
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning
Hongyin Zhang
Zifeng Zhuang
H. Zhao
Pengxiang Ding
Hongchao Lu
Donglin Wang
OffRL
44
0
0
12 May 2025
UniCO: Towards a Unified Model for Combinatorial Optimization Problems
UniCO: Towards a Unified Model for Combinatorial Optimization Problems
Zefang Zong
Xiaochen Wei
Guozhen Zhang
Chen Gao
Huandong Wang
Yong Li
29
0
0
07 May 2025
Latent Adaptive Planner for Dynamic Manipulation
Latent Adaptive Planner for Dynamic Manipulation
Donghun Noh
Deqian Kong
Minglu Zhao
Andrew Lizarraga
Jianwen Xie
Ying Nian Wu
Dennis W. Hong
100
0
0
06 May 2025
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
Jifeng Hu
Sili Huang
Z. Yang
Shengchao Hu
Li Shen
H. Chen
Lichao Sun
Yi-Ju Chang
Dacheng Tao
OffRL
109
0
0
03 May 2025
NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence
NGENT: Next-Generation AI Agents Must Integrate Multi-Domain Abilities to Achieve Artificial General Intelligence
Zhicong Li
Hangyu Mao
Jiangjin Yin
Mingzhe Xing
Zhiwei Xu
Yuanxing Zhang
Yang Xiao
29
0
0
30 Apr 2025
Offline Learning of Controllable Diverse Behaviors
Offline Learning of Controllable Diverse Behaviors
Mathieu Petitbois
Rémy Portelas
Sylvain Lamprier
Ludovic Denoyer
OffRL
34
0
0
25 Apr 2025
Towards Forceful Robotic Foundation Models: a Literature Survey
Towards Forceful Robotic Foundation Models: a Literature Survey
William Xie
N. Correll
OffRL
56
1
0
16 Apr 2025
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
29
1
0
04 Apr 2025
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Yuan Xu
Tianwei Zhang
37
0
0
24 Mar 2025
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
CONTHER: Human-Like Contextual Robot Learning via Hindsight Experience Replay and Transformers without Expert Demonstrations
Maria Makarova
Qian Liu
Dzmitry Tsetserukou
OffRL
41
0
0
20 Mar 2025
Masked Sensory-Temporal Attention for Sensor Generalization in Quadruped Locomotion
Masked Sensory-Temporal Attention for Sensor Generalization in Quadruped Locomotion
Dikai Liu
Tianwei Zhang
Jianxiong Yin
Simon See
85
1
0
13 Mar 2025
Positive-Unlabeled Diffusion Models for Preventing Sensitive Data Generation
Hiroshi Takahashi
Tomoharu Iwata
Atsutoshi Kumagai
Yuuki Yamanaka
Tomoya Yamashita
DiffM
65
0
0
05 Mar 2025
Target Return Optimizer for Multi-Game Decision Transformer
Kensuke Tatematsu
Akifumi Wachi
OffRL
64
0
0
04 Mar 2025
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
DPR: Diffusion Preference-based Reward for Offline Reinforcement Learning
Teng Pang
Bingzheng Wang
Guoqiang Wu
Yilong Yin
OffRL
68
0
0
03 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
39
2
0
02 Mar 2025
What Makes a Good Diffusion Planner for Decision Making?
Haofei Lu
Dongqi Han
Yifei Shen
Dongsheng Li
DiffM
33
3
0
01 Mar 2025
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction
Baiting Luo
Ava Pettet
Aron Laszka
A. Dubey
Ayan Mukhopadhyay
OffRL
43
1
0
28 Feb 2025
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments
ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments
Jinghao Xin
Zhichao Liang
Zihuan Zhang
Peng Wang
Ning Li
59
0
0
27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Long Minh Bui
Tho Tran Huu
Duy-Tung Dinh
T. Nguyen
Trong Nghia Hoang
44
2
0
27 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
72
1
0
26 Feb 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
47
0
0
26 Feb 2025
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning
Kun Hu
Muning Wen
X. Wang
S. Zhang
Yiwei Shi
Minne Li
Minglong Li
Ying Wen
42
0
0
23 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
G. Klambauer
Razvan Pascanu
Sepp Hochreiter
73
4
0
21 Feb 2025
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution
Emmanuel K. Raptis
Athanasios Ch. Kapoutsis
Elias B. Kosmatopoulos
LM&Ro
75
0
0
18 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
62
1
0
17 Feb 2025
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
D. Elbaz
Oren Salzman
OffRL
32
0
0
13 Feb 2025
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Model-Based Offline Reinforcement Learning with Reliability-Guaranteed Sequence Modeling
Shenghong He
OffRL
138
0
0
10 Feb 2025
Habitizing Diffusion Planning for Efficient and Effective Decision Making
Haofei Lu
Yifei Shen
Dongsheng Li
Junliang Xing
Dongqi Han
62
0
0
10 Feb 2025
Utilizing Novelty-based Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
OffRL
67
0
0
10 Feb 2025
Towards Robust Spacecraft Trajectory Optimization via Transformers
Towards Robust Spacecraft Trajectory Optimization via Transformers
Yuji Takubo
T. Guffanti
Daniele Gammelli
Marco Pavone
Simone DÁmico
69
4
0
28 Jan 2025
Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning
Zhihao Zhang
Ekim Yurtsever
Keith A. Redmill
33
0
0
28 Jan 2025
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Utilizing Evolution Strategies to Train Transformers in Reinforcement Learning
Matyáš Lorenc
38
1
0
23 Jan 2025
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
Abdullah Akgul
Manuel Haußmann
M. Kandemir
OffRL
66
1
0
17 Jan 2025
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Future-Conditioned Recommendations with Multi-Objective Controllable Decision Transformer
Chongming Gao
Kexin Huang
Ziang Fei
Jiaju Chen
J. Chen
Jianshan Sun
Shuchang Liu
Qingpeng Cai
Peng Jiang
OffRL
34
0
0
13 Jan 2025
Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors
Niels Justesen
Maria Kaselimi
Sam Snodgrass
Miruna Vozaru
Matthew Schlegel
...
Albert Wang
Christoffer Holmgård
Georgios N. Yannakakis
S. Risi
Julian Togelius
39
0
0
03 Jan 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
MADiff: Offline Multi-agent Learning with Diffusion Models
Zhengbang Zhu
Minghuan Liu
Liyuan Mao
Bingyi Kang
Minkai Xu
Yong Yu
Stefano Ermon
Weinan Zhang
DiffM
OffRL
78
33
0
03 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Z. Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting
Hemant Kumawat
Saibal Mukhopadhyay
62
0
0
19 Dec 2024
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Embodied CoT Distillation From LLM To Off-the-shelf Agents
Wonje Choi
Woo Kyung Kim
Minjong Yoo
Honguk Woo
OffRL
LM&Ro
108
2
0
16 Dec 2024
Advances in Transformers for Robotic Applications: A Review
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
75
0
0
13 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class
  and Backbone
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
M. K. Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
95
4
0
09 Dec 2024
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations
Cevahir Köprülü
Po-han Li
Tianyu Qiu
Ruihan Zhao
T. Westenbroek
David Fridovich-Keil
Sandeep P. Chinchali
Ufuk Topcu
OffRL
89
0
0
02 Dec 2024
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy
  Generalization with Global and Adaptive Guidance
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
76
0
0
01 Dec 2024
Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation
Zhihong Liu
Long Qian
Zeyang Liu
Lipeng Wan
Xingyu Chen
Xuguang Lan
OffRL
75
1
0
18 Nov 2024
One-Layer Transformer Provably Learns One-Nearest Neighbor In Context
Zihao Li
Yuan Cao
Cheng Gao
Yihan He
Han Liu
Jason M. Klusowski
Jianqing Fan
Mengdi Wang
MLT
47
6
0
16 Nov 2024
Evaluating World Models with LLM for Decision Making
Evaluating World Models with LLM for Decision Making
Chang Yang
Xinrun Wang
Junzhe Jiang
Qinggang Zhang
Xiao Huang
LLMAG
ELM
36
2
0
13 Nov 2024
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Éxplaining RL Decisions with Trajectories': A Reproducibility Study
Karim Abdel Sadek
Matteo Nulli
Joan Velja
Jort Vincenti
38
0
0
11 Nov 2024
CROPS: A Deployable Crop Management System Over All Possible State
  Availabilities
CROPS: A Deployable Crop Management System Over All Possible State Availabilities
Jing Wu
Zhixin Lai
Shengjie Liu
Suiyao Chen
Ran Tao
Pan Zhao
Chuyuan Tao
Yikun Cheng
N. Hovakimyan
OffRL
39
0
0
09 Nov 2024
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning
Marvin Alles
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
OffRL
34
0
0
07 Nov 2024
1234...8910
Next