ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.12996
  4. Cited By
RLIF: Interactive Imitation Learning as Reinforcement Learning

RLIF: Interactive Imitation Learning as Reinforcement Learning

21 November 2023
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi-An Ma
Sergey Levine
    OffRL
ArXivPDFHTML

Papers citing "RLIF: Interactive Imitation Learning as Reinforcement Learning"

13 / 13 papers shown
Title
CubeDAgger: Improved Robustness of Interactive Imitation Learning without Violation of Dynamic Stability
CubeDAgger: Improved Robustness of Interactive Imitation Learning without Violation of Dynamic Stability
Taisuke Kobayashi
70
0
0
08 May 2025
Dexterous Hand Manipulation via Efficient Imitation-Bootstrapped Online Reinforcement Learning
Dongchi Huang
Tianle Zhang
Yihang Li
Ling Zhao
Jiayi Li
Zhirui Fang
Chunhe Xia
Lusong Li
Xiaodong He
OffRL
43
0
0
06 Mar 2025
High-Precision Transformer-Based Visual Servoing for Humanoid Robots in Aligning Tiny Objects
Jialong Xue
Wei Gao
Yu Wang
Chao Ji
Dongdong Zhao
Shi Yan
Shiwu Zhang
43
0
0
06 Mar 2025
MILE: Model-based Intervention Learning
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
83
1
0
21 Feb 2025
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
Yuhui Chen
Shuai Tian
Shugao Liu
Yingting Zhou
Haoran Li
Dongbin Zhao
OffRL
90
1
0
08 Feb 2025
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
Charles Xu
Qiyang Li
Jianlan Luo
Sergey Levine
OffRL
85
5
0
13 Dec 2024
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments
Robot See, Robot Do: Imitation Reward for Noisy Financial Environments
Sven Goluža
Tomislav Kovačević
Stjepan Begušić
Z. Kostanjčar
21
0
0
13 Nov 2024
Reinforcement Learning From Imperfect Corrective Actions And Proxy
  Rewards
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
Zhaohui Jiang
Xuening Feng
Paul Weng
Yifei Zhu
Yan Song
Tianze Zhou
Yujing Hu
Tangjie Lv
Changjie Fan
31
0
0
08 Oct 2024
VANP: Learning Where to See for Navigation with Self-Supervised
  Vision-Action Pre-Training
VANP: Learning Where to See for Navigation with Self-Supervised Vision-Action Pre-Training
Mohammad Nazeri
Junzhe Wang
Amirreza Payandeh
Xuesu Xiao
SSL
ViT
41
5
0
12 Mar 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRL
OnRL
34
40
0
29 Jan 2024
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
109
108
0
09 Mar 2023
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,881
0
04 Mar 2022
Efficient Learning of Safe Driving Policy via Human-AI Copilot
  Optimization
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
Quanyi Li
Zhenghao Peng
Bolei Zhou
75
35
0
17 Feb 2022
1