Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1910.11956
Cited By
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Conference on Robot Learning (CoRL), 2019
25 October 2019
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning"
50 / 345 papers shown
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction
International Conference on Learning Representations (ICLR), 2024
Anthony GX-Chen
Kenneth Marino
Rob Fergus
OCL
514
1
0
21 Aug 2024
Diffusion Model for Planning: A Systematic Literature Review
Toshihide Ubukata
Jialong Li
Kenji Tei
DiffM
MedIm
288
17
0
16 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
215
11
0
15 Aug 2024
Semi-Supervised One-Shot Imitation Learning
Philipp Wu
Kourosh Hakhamaneshi
Yuqing Du
Igor Mordatch
Aravind Rajeswaran
Pieter Abbeel
SSL
308
3
0
09 Aug 2024
Astra: Efficient Transformer Architecture and Contrastive Dynamics Learning for Embodied Instruction Following
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Irwin King
211
9
0
02 Aug 2024
MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery
Pei Zhou
Yanchao Yang
250
2
0
21 Jul 2024
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Xu-Hui Liu
Tian-Shuo Liu
Shengyi Jiang
Ruifeng Chen
Zhilong Zhang
Xinwei Chen
Yang Yu
OffRL
OnRL
299
9
0
17 Jul 2024
HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Bowen Jiang
Yilin Wu
Wenxuan Zhou
Chris Paxton
David Held
264
6
0
11 Jul 2024
TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
Junik Bae
Kwanyoung Park
Youngwoon Lee
250
10
0
11 Jul 2024
Equivariant Diffusion Policy
Dian Wang
Stephen M. Hart
David Surovik
Tarik Kelestemur
Haojie Huang
Haibo Zhao
Mark Yeatman
Jiuguang Wang
Robin Walters
Robert Platt
DiffM
336
60
0
01 Jul 2024
DEAR: Disentangled Environment and Agent Representations for Reinforcement Learning without Reconstruction
Ameya Pore
Riccardo Muradore
Diego DallÁlba
DRL
310
6
0
30 Jun 2024
Multimodal foundation world models for generalist embodied agents
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Rameswar Panda
Sai Rajeswar
OffRL
LM&Ro
271
1
0
26 Jun 2024
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Jiaming Zhou
Teli Ma
Kun-Yu Lin
Ronghe Qiu
Zifan Wang
Junwei Liang
430
17
0
20 Jun 2024
Variational Distillation of Diffusion Policies into Mixture of Experts
Neural Information Processing Systems (NeurIPS), 2024
Hongyi Zhou
Denis Blessing
Ge Li
Onur Celik
Xiaogang Jia
Gerhard Neumann
Rudolf Lioutikov
DiffM
245
7
0
18 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
215
29
0
13 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
350
8
0
11 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
598
3
0
09 Jun 2024
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
International Conference on Machine Learning (ICML), 2024
Qianlan Yang
Yu-Xiong Wang
OnRL
243
1
0
06 Jun 2024
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
VLM
362
25
0
02 Jun 2024
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
394
38
0
01 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
508
4
0
01 Jun 2024
Causal Action Influence Aware Counterfactual Data Augmentation
Núria Armengol Urpí
Marco Bagatella
Marin Vlastelica
Georg Martius
CML
191
10
0
29 May 2024
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung
Pang-Chi Lo
Jia-Fong Yeh
Han-Yuan Hsu
Yi-Ting Chen
Winston H. Hsu
409
2
0
26 May 2024
How to Leverage Diverse Demonstrations in Offline Imitation Learning
Sheng Yue
Jiani Liu
Xingyuan Hua
Ju Ren
Sen Lin
Junshan Zhang
Yaoxue Zhang
OffRL
374
7
0
24 May 2024
DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation
International Conference on Machine Learning (ICML), 2024
Jinxin Liu
Xinghong Guo
Zifeng Zhuang
Xuetao Zhang
DiffM
OffRL
230
3
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
889
166
0
23 May 2024
Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
Aaditya Prasad
Kevin Qinghong Lin
Jimmy Wu
Linqi Zhou
Jeannette Bohg
331
106
0
13 May 2024
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
International Conference on Learning Representations (ICLR), 2024
Murtaza Dalal
Tarun Chiruvolu
Devendra Singh Chaplot
Ruslan Salakhutdinov
LM&Ro
370
73
0
02 May 2024
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations
Puhao Li
Tengyu Liu
Yuyang Li
Muzhi Han
Haoran Geng
Shu Wang
Yixin Zhu
Song-Chun Zhu
Siyuan Huang
263
27
0
26 Apr 2024
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh
Wesley A Suttle
Brian M Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
288
5
0
20 Apr 2024
Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Yutao Ouyang
Jinhan Li
Yunfei Li
Zhongyu Li
Chao Yu
Koushil Sreenath
Yi Wu
393
23
0
08 Apr 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
409
90
0
15 Mar 2024
ALaRM: Align Language Models via Hierarchical Rewards Modeling
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuhang Lai
Siyuan Wang
Shujun Liu
Xuanjing Huang
Zhongyu Wei
280
8
0
11 Mar 2024
Spatiotemporal Predictive Pre-training for Robotic Motor Control
Jiange Yang
Bei Liu
Jianlong Fu
Bocheng Pan
Gangshan Wu
Limin Wang
374
20
0
08 Mar 2024
Behavior Generation with Latent Actions
Seungjae Lee
Yibin Wang
Haritheja Etukuru
H. J. Kim
Mahi Shafiullah
Lerrel Pinto
VGen
OffRL
403
121
0
05 Mar 2024
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Shaoteng Liu
Haoqi Yuan
Minda Hu
Yanwei Li
Yukang Chen
Shu Liu
Zongqing Lu
Jiaya Jia
LLMAG
273
26
0
29 Feb 2024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
Jianxiong Li
Jinliang Zheng
Yinan Zheng
Liyuan Mao
Xiaoming Hu
...
Jihao Liu
Yu Liu
Jingjing Liu
Ya Zhang
Xianyuan Zhan
LM&Ro
OffRL
283
14
0
28 Feb 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
301
7
0
27 Feb 2024
Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion
Kaiqi Chen
Eugene Lim
Kelvin Lin
Yiyang Chen
Harold Soh
DiffM
408
19
0
25 Feb 2024
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSL
OffRL
391
50
0
23 Feb 2024
Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations
Xiaogang Jia
Denis Blessing
Xinkai Jiang
Moritz Reuss
Atalay Donat
Rudolf Lioutikov
Gerhard Neumann
245
42
0
22 Feb 2024
DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models
Norman Di Palo
Edward Johns
LM&Ro
231
54
0
20 Feb 2024
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control
Ruijie Zheng
Ching-An Cheng
Hal Daumé
Furong Huang
Andrey Kolobov
210
16
0
16 Feb 2024
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
Sangwoo Shin
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Honguk Woo
352
11
0
13 Feb 2024
Large Language Models as Agents in Two-Player Games
Yang Liu
Yang Liu
Hang Li
LLMAG
180
7
0
12 Feb 2024
SemTra: A Semantic Skill Translator for Cross-Domain Zero-Shot Policy Adaptation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Sangwoo Shin
Minjong Yoo
Jeongwoo Lee
Honguk Woo
233
6
0
12 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
468
10
0
09 Feb 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
International Conference on Learning Representations (ICLR), 2024
Jiafei Lyu
Xiaoteng Ma
Le Wan
Runze Liu
Xiu Li
Zongqing Lu
OffRL
313
14
0
06 Feb 2024
RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents
Tomoyuki Kagaya
Thong Jing Yuan
Yuxuan Lou
J. Karlekar
Sugiri Pranata
Akira Kinose
Koki Oguri
Felix Wick
Yang You
LLMAG
208
64
0
06 Feb 2024
DiffuserLite: Towards Real-time Diffusion Planning
Neural Information Processing Systems (NeurIPS), 2024
Zibin Dong
Jianye Hao
Yifu Yuan
Fei Ni
Yitian Wang
Pengyi Li
Yan Zheng
465
35
0
27 Jan 2024
Previous
1
2
3
4
5
6
7
Next