ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.11956
  4. Cited By
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and
  Reinforcement Learning

Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning

Conference on Robot Learning (CoRL), 2019
25 October 2019
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
ArXiv (abs)PDFHTML

Papers citing "Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning"

50 / 344 papers shown
Title
Efficient Diffusion Planning with Temporal Diffusion
Efficient Diffusion Planning with Temporal Diffusion
Jiaming Guo
Rui Zhang
Z. Li
Yunkai Gao
Shaohui Peng
Siming Lan
Xing Hu
Zidong Du
Xishan Zhang
Ling Li
DiffM
134
0
0
26 Nov 2025
Dynamic Test-Time Compute Scaling in Control Policy: Difficulty-Aware Stochastic Interpolant Policy
Dynamic Test-Time Compute Scaling in Control Policy: Difficulty-Aware Stochastic Interpolant Policy
Inkook Chun
Seungjae Lee
M. S. Albergo
Saining Xie
Eric Vanden-Eijnden
108
0
0
25 Nov 2025
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
SFHand: A Streaming Framework for Language-guided 3D Hand Forecasting and Embodied Manipulation
Ruicong Liu
Yifei Huang
Liangyang Ouyang
Caixin Kang
Yoichi Sato
39
1
0
22 Nov 2025
MagBotSim: Physics-Based Simulation and Reinforcement Learning Environments for Magnetic Robotics
Lara Bergmann
Cedric Grothues
Klaus Neumann
61
0
0
20 Nov 2025
SeFA-Policy: Fast and Accurate Visuomotor Policy Learning with Selective Flow Alignment
SeFA-Policy: Fast and Accurate Visuomotor Policy Learning with Selective Flow Alignment
Rong Xue
Jiageng Mao
Mingtong Zhang
Y. Wang
160
0
0
11 Nov 2025
Learning Interactive World Model for Object-Centric Reinforcement Learning
Learning Interactive World Model for Object-Centric Reinforcement Learning
Fan Feng
Phillip Lippe
Sara Magliacane
OffRLOCL
226
0
0
04 Nov 2025
Mixed-Density Diffuser: Efficient Planning with Non-Uniform Temporal Resolution
Mixed-Density Diffuser: Efficient Planning with Non-Uniform Temporal Resolution
Crimson Stambaugh
Rajesh P. N. Rao
DiffM
150
0
0
27 Oct 2025
Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach
Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach
Sebastian Reboul
Hélène Halconruy
Randal Douc
OffRL
76
0
0
22 Oct 2025
Imitation Learning Policy based on Multi-Step Consistent Integration Shortcut Model
Imitation Learning Policy based on Multi-Step Consistent Integration Shortcut Model
Yu Fang
Xinyu Wang
Xuehe Zhang
Wanli Xue
Mingwei Zhang
Shengyong Chen
Jie Zhao
92
0
0
22 Oct 2025
Towards Robust Zero-Shot Reinforcement Learning
Towards Robust Zero-Shot Reinforcement Learning
Kexin Zheng
Lauriane Teyssier
Yinan Zheng
Yu Luo
Xiayuan Zhan
OffRL
263
0
0
17 Oct 2025
NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?
NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?
Jierui Peng
Yanyan Zhang
Yicheng Duan
Tuo Liang
Vipin Chaudhary
Yu Yin
129
0
0
17 Oct 2025
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
Junhyuk So
Chiwoong Lee
Shinyoung Lee
Jungseul Ok
Eunhyeok Park
AI4CE
110
0
0
14 Oct 2025
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model
Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model
Fuhao Li
Wenxuan Song
Han Zhao
Jingbo Wang
Pengxiang Ding
Donglin Wang
Long Zeng
Haoang Li
150
3
0
14 Oct 2025
Fast Visuomotor Policy for Robotic Manipulation
Fast Visuomotor Policy for Robotic Manipulation
Jingkai Jia
Tong Yang
Xueyao Chen
Chenhuan Liu
Wenqiang Zhang
76
0
0
14 Oct 2025
CDE: Concept-Driven Exploration for Reinforcement Learning
CDE: Concept-Driven Exploration for Reinforcement Learning
Le Mao
Andrew H. Liu
Renos Zabounidis
Zachary Kingston
Joseph Campbell
73
0
0
09 Oct 2025
BuilderBench -- A benchmark for generalist agents
BuilderBench -- A benchmark for generalist agents
Raj Ghugare
Catherine Ji
Kathryn Wantlin
Jin Schofield
Benjamin Eysenbach
84
0
0
07 Oct 2025
VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
Yixiao Wang
Mingxiao Huo
Zhixuan Liang
Yushi Du
Lingfeng Sun
...
Jinghuan Shang
Chensheng Peng
Mohit Bansal
Mingyu Ding
Masayoshi Tomizuka
116
1
0
06 Oct 2025
Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
Zhaoyi Li
Jingtao Ding
Yong Li
Shihua Li
172
0
0
02 Oct 2025
Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies
Act to See, See to Act: Diffusion-Driven Perception-Action Interplay for Adaptive Policies
Jing Wang
Weiting Peng
Jing Tang
Zeyu Gong
Xihua Wang
B. Tao
Li Cheng
196
0
0
30 Sep 2025
Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning
Hybrid Diffusion for Simultaneous Symbolic and Continuous Planning
Sigmund Hennum Høeg
Aksel Vaaler
Chaoqi Liu
Olav Egeland
Yilun Du
113
0
0
26 Sep 2025
SAGE:State-Aware Guided End-to-End Policy for Multi-Stage Sequential Tasks via Hidden Markov Decision Process
SAGE:State-Aware Guided End-to-End Policy for Multi-Stage Sequential Tasks via Hidden Markov Decision Process
BinXu Wu
TengFei Zhang
Chen Yang
JiaHao Wen
HaoCheng Li
JingTian Ma
Zhen Chen
Jingyuan Wang
197
0
0
24 Sep 2025
Policy Compatible Skill Incremental Learning via Lazy Learning Interface
Policy Compatible Skill Incremental Learning via Lazy Learning Interface
Daehee Lee
Dongsu Lee
TaeYoon Kwack
Wonje Choi
Honguk Woo
CLL
190
0
0
24 Sep 2025
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Pure Vision Language Action (VLA) Models: A Comprehensive Survey
Dapeng Zhang
Jin Sun
Chenghui Hu
Xiaoyan Wu
Zhenlong Yuan
R. Zhou
Fei Shen
Qingguo Zhou
LM&Ro
209
13
0
23 Sep 2025
Growing with Your Embodied Agent: A Human-in-the-Loop Lifelong Code Generation Framework for Long-Horizon Manipulation Skills
Growing with Your Embodied Agent: A Human-in-the-Loop Lifelong Code Generation Framework for Long-Horizon Manipulation Skills
Y. Meng
Zhenguo Sun
Max Fest
Xukun Li
Zhenshan Bing
Alois Knoll
LM&Ro
114
0
0
23 Sep 2025
Long-Horizon Visual Imitation Learning via Plan and Code Reflection
Long-Horizon Visual Imitation Learning via Plan and Code Reflection
Quan Chen
Chenrui Shi
Qi Chen
Yuwei Wu
Zhi Gao
Xintong Zhang
Rui Gao
Kun Wu
Yunde Jia
88
1
0
04 Sep 2025
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
Zeqiang Zhang
Fabian Wurzberger
Gerrit Schmid
Sebastian Gottwald
Daniel A. Braun
SSL
172
0
0
03 Sep 2025
SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation
SafeBimanual: Diffusion-based Trajectory Optimization for Safe Bimanual Manipulation
Haoyuan Deng
Wenkai Guo
Qianzhun Wang
Zhenyu Wu
Ziwei Wang
64
0
0
25 Aug 2025
Survey of Vision-Language-Action Models for Embodied Manipulation
Survey of Vision-Language-Action Models for Embodied Manipulation
Haoran Li
Yuhui Chen
Wenbo Cui
Weiheng Liu
Kai Liu
Mingcai Zhou
Zhengtao Zhang
Dongbin Zhao
LM&Ro
336
3
0
21 Aug 2025
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Rui Shao
W. Li
Lingsen Zhang
Renshan Zhang
Zhiyang Liu
Ran Chen
Liqiang Nie
LM&Ro
167
19
0
18 Aug 2025
D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning
D3P: Dynamic Denoising Diffusion Policy via Reinforcement Learning
Shu-Ang Yu
Feng Gao
Yi Wu
Chao Yu
Yu Wang
80
2
0
09 Aug 2025
VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation
VFP: Variational Flow-Matching Policy for Multi-Modal Robot Manipulation
Xuanran Zhai
Ce Hao
Qiaojun Yu
Ce Hao
116
2
0
03 Aug 2025
Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Chang Li
Yaren Zhang
Haoran Lv
Qiong Cao
Chao Xue
Xiaodong He
OffRLLRM
132
0
0
22 Jul 2025
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Graph-Assisted Stitching for Offline Hierarchical Reinforcement Learning
Seungho Baek
Taegeon Park
Jongchan Park
Seungjun Oh
Yusung Kim
OffRL
227
1
0
09 Jun 2025
Horizon Reduction Makes RL Scalable
Horizon Reduction Makes RL Scalable
Seohong Park
Kevin Frans
Deepinder Mann
Benjamin Eysenbach
Aviral Kumar
Sergey Levine
OffRL
474
14
0
04 Jun 2025
Diffusion Guidance Is a Controllable Policy Improvement Operator
Diffusion Guidance Is a Controllable Policy Improvement Operator
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
233
9
0
29 May 2025
ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
ReinFlow: Fine-tuning Flow Matching Policy with Online Reinforcement Learning
Tonghe Zhang
Chao Yu
Sichang Su
Yu Wang
417
9
0
28 May 2025
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning
Guanxing Lu
Wenkai Guo
Chubin Zhang
Yuheng Zhou
Haonan Jiang
Zifeng Gao
Yansong Tang
Ziwei Wang
OffRL
326
49
0
24 May 2025
Flattening Hierarchies with Policy Bootstrapping
Flattening Hierarchies with Policy Bootstrapping
John L. Zhou
Jonathan C. Kao
OffRL
316
1
0
20 May 2025
TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation
TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation
Hangyu Li
Qin Zhao
Haoran Xu
Xinyu Jiang
Qingwei Ben
...
Jia Zeng
Hanqing Wang
Bo Dai
Junting Dong
Jiangmiao Pang
359
3
0
19 May 2025
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning
Dongsu Lee
Minhae Kwon
OffRL
236
3
0
19 May 2025
MTIL: Encoding Full History with Mamba for Temporal Imitation Learning
MTIL: Encoding Full History with Mamba for Temporal Imitation LearningIEEE Robotics and Automation Letters (IEEE RA-L), 2025
Yulin Zhou
Yuankai Lin
Fanzhe Peng
Jiahui Chen
Zhuang Zhou
Kaiji Huang
Hua Yang
Mamba
349
2
0
18 May 2025
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Fine-tuning Diffusion Policies with Backpropagation Through Diffusion Timesteps
Ningyuan Yang
Jiaxuan Gao
Feng Gao
Yi Wu
Chao Yu
424
1
0
15 May 2025
Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation
Latent Theory of Mind: A Decentralized Diffusion Architecture for Cooperative Manipulation
Chengyang He
Gadiel Sznaier Camps
Xu Liu
Mac Schwager
Guillaume Sartoretti
188
3
0
14 May 2025
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement LearningInternational Conference on Learning Representations (ICLR), 2025
Caleb Chuck
Fan Feng
Carl Qi
Chang Shi
Siddhant Agarwal
Amy Zhang
S. Niekum
246
2
0
06 May 2025
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and Scalability
Generative AI in Embodied Systems: System-Level Analysis of Performance, Efficiency and ScalabilityIEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2025
Zishen Wan
Jiayi Qian
Yuhang Du
Jason J. Jabbour
Yilun Du
Yang Katie Zhao
A. Raychowdhury
Tushar Krishna
Vijay Janapa Reddi
LM&Ro
341
2
0
26 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Diffusion Models for Robotic Manipulation: A SurveyFrontiers in Robotics and AI (Front. Robot. AI), 2025
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
383
22
0
11 Apr 2025
MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos
MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos
Alexey Gavryushin
Xi Wang
Robert J. S. Malate
Chenyu Yang
Xiaojun Jia
Shubh Goel
Davide Liconti
René Zurbrugg
173
3
0
08 Apr 2025
RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics
RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics
Zhiyuan Zhang
Yuxin He
Yong Sun
Junyu Shi
Lijiang Liu
Qiang Nie
VLM
243
0
0
02 Apr 2025
Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models
Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025
Haonan Chen
Jiaming Xu
Lily Sheng
Tianchen Ji
Shuijing Liu
Yunzhu Li
Katherine Driggs-Campbell
264
4
0
30 Mar 2025
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities
Kevin Wang
Ishaan Javali
Michał Bortkiewicz
Tomasz Trzciñski
Benjamin Eysenbach
OffRLSSL
408
10
0
19 Mar 2025
1234567
Next