Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.15567
Cited By
Foundation Policies with Hilbert Representations
23 February 2024
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Foundation Policies with Hilbert Representations"
16 / 16 papers shown
Title
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TS
AIFin
50
0
0
12 Mar 2025
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
39
1
0
11 Nov 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
46
8
0
26 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
54
0
0
23 Oct 2024
Offline-to-online Reinforcement Learning for Image-based Grasping with Scarce Demonstrations
Bryan Chan
Anson Leung
James Bergstra
OffRL
OnRL
52
0
0
19 Oct 2024
LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots
Peilin Wu
Weiji Xie
Jiahang Cao
Hang Lai
Weinan Zhang
21
1
0
26 Sep 2024
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
48
3
0
27 Aug 2024
NOLO: Navigate Only Look Once
Mengyu Bu
Shuhao Gu
Yang Feng
EgoV
33
1
0
02 Aug 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
27
1
0
11 Jul 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
43
7
0
13 Jun 2024
Multi-Agent Transfer Learning via Temporal Contrastive Learning
Weihao Zeng
Joseph Campbell
Simon Stepputtis
Katia P. Sycara
OffRL
33
2
0
03 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
37
1
0
01 Jun 2024
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
24
6
0
28 Oct 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
109
108
0
09 Mar 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
206
832
0
12 Oct 2021
1