Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.09648
Cited By
Prompt-Tuning Decision Transformer with Preference Ranking
16 May 2023
Shengchao Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Prompt-Tuning Decision Transformer with Preference Ranking"
20 / 20 papers shown
Title
Decision SpikeFormer: Spike-Driven Transformer for Decision Making
Wei Huang
Qinying Gu
Nanyang Ye
OffRL
23
1
0
04 Apr 2025
Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance
Zhe Wang
Haozhu Wang
Yanjun Qi
OffRL
71
0
0
01 Dec 2024
Continual Task Learning through Adaptive Policy Self-Composition
Shengchao Hu
Yuhang Zhou
Ziqing Fan
Jifeng Hu
Li Shen
Ya-Qin Zhang
Dacheng Tao
OffRL
69
0
0
18 Nov 2024
Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Ziqing Fan
Shengchao Hu
Yuhang Zhou
Li Shen
Ya-Qin Zhang
Yanfeng Wang
Dacheng Tao
OffRL
29
0
0
02 Nov 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
Li Lyna Zhang
Wenhao Wu
Yuanheng Zhu
Dongbin Zhao
C. L. Philip Chen
OffRL
30
6
0
15 Oct 2024
Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer
Yu Yang
Pan Xu
VLM
OffRL
21
1
0
02 Aug 2024
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
Shengchao Hu
Ziqing Fan
Li Shen
Ya-Qin Zhang
Yanfeng Wang
Dacheng Tao
OffRL
28
9
0
28 May 2024
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?
Yang Dai
Oubo Ma
Longfei Zhang
Xingxing Liang
Shengchao Hu
Mengzhu Wang
Shouling Ji
Jincai Huang
Li Shen
Mamba
24
4
0
20 May 2024
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
Xudong Yu
Chenjia Bai
Haoran He
Changhong Wang
Xuelong Li
19
6
0
07 Apr 2024
Context-Former: Stitching via Latent Conditioned Sequence Modeling
Ziqi Zhang
Jingzehua Xu
Jinxin Liu
Zifeng Zhuang
Donglin Wang
Miao Liu
Shuai Zhang
OffRL
26
4
0
29 Jan 2024
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRL
RALM
18
18
0
31 Oct 2023
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles
Zhiwei Tang
Dmitry Rybin
Tsung-Hui Chang
ALM
DiffM
28
25
0
07 Mar 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya-Qin Zhang
Yixin Chen
Dacheng Tao
OffRL
23
23
0
29 Dec 2022
Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL
Taku Yamagata
Ahmed Khalil
Raúl Santos-Rodríguez
OffRL
142
70
0
08 Sep 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
322
2,108
0
02 Sep 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
278
3,784
0
18 Apr 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
194
412
0
16 Feb 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
237
11,568
0
09 Mar 2017
1