Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.04478
Cited By
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL
9 February 2022
Rui Yang
Yiming Lu
Wenzhe Li
Hao Sun
Meng Fang
Yali Du
Xiu Li
Lei Han
Chongjie Zhang
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL"
17 / 17 papers shown
Title
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning
Hao Sun
M. Schaar
87
14
0
28 Jan 2025
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
46
8
0
26 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
77
0
0
11 Oct 2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang
Zhihan Liu
Boyi Liu
Y. Zhang
Yingxiang Yang
Y. Liu
Liyu Chen
Tao Sun
Z. Wang
87
2
0
10 Oct 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
36
7
0
24 Jun 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang
Ruomeng Ding
Yong Lin
Huan Zhang
Tong Zhang
21
42
0
14 Jun 2024
Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets
Haoran He
C. Chang
Huazhe Xu
Ling Pan
77
6
0
03 Jun 2024
Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
Sungyoon Kim
Yunseon Choi
Daiki E. Matsunaga
Kee-Eung Kim
OffRL
23
6
0
11 Feb 2024
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
23
21
0
09 Oct 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
31
12
0
26 Apr 2023
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Moritz Reuss
M. Li
Xiaogang Jia
Rudolf Lioutikov
DiffM
21
156
0
05 Apr 2023
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
J. Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
8
1
0
17 Feb 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
32
17
0
05 Jan 2023
Learning Robotic Navigation from Experience: Principles, Methods, and Recent Results
Sergey Levine
Dhruv Shah
SSL
16
21
0
13 Dec 2022
A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Haoran Xu
Li Jiang
Jianxiong Li
Xianyuan Zhan
OffRL
16
61
0
15 Oct 2022
What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization
Hao Sun
B. V. Breugel
Jonathan Crabbé
Nabeel Seedat
M. Schaar
22
4
0
11 Jul 2022
Offline Reinforcement Learning with Reverse Model-based Imagination
Jianhao Wang
Wenzhe Li
Haozhe Jiang
Guangxiang Zhu
Siyuan Li
Chongjie Zhang
OffRL
96
59
0
01 Oct 2021
1