Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.01448
Cited By
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
4 September 2023
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance"
11 / 11 papers shown
Title
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation
Hongye Cao
Fan Feng
Jing Huo
Shangdong Yang
Meng Fang
Tianpei Yang
Yang Gao
AAML
OffRL
58
0
0
26 Mar 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Z. Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Yifan Pu
Zhuofan Xia
Jiayi Guo
Dongchen Han
Qixiu Li
...
Ji Li
Yizeng Han
Shiji Song
Gao Huang
Xiu Li
56
11
0
11 Aug 2024
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
21
12
0
27 Oct 2023
Rank-DETR for High Quality Object Detection
Yifan Pu
Weicong Liang
Yiduo Hao
Yuhui Yuan
Yukang Yang
Chao Zhang
Hanhua Hu
Gao Huang
36
54
0
13 Oct 2023
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
OffRL
27
16
0
06 Oct 2023
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning
Xiao-Yin Liu
Xiao-Hu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
Hao Li
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Zeng-Guang Hou
OffRL
OOD
16
5
0
16 Sep 2023
Decoupled Prioritized Resampling for Offline RL
Yang Yue
Bingyi Kang
Xiao Ma
Qisen Yang
Gao Huang
S. Song
Shuicheng Yan
OffRL
25
0
0
08 Jun 2023
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
838
0
12 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,951
0
04 May 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
281
11,681
0
09 Mar 2017
1