ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.09408
  4. Cited By
Guided Online Distillation: Promoting Safe Reinforcement Learning by
  Offline Demonstration

Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration

18 September 2023
Jinning Li
Xinyi Liu
Banghua Zhu
Jiantao Jiao
M. Tomizuka
Chen Tang
Wei Zhan
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration"

8 / 8 papers shown
Title
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning
Zijian Guo
Weichao Zhou
Wenchao Li
OffRL
91
2
0
28 Jan 2025
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
109
108
0
09 Mar 2023
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement
  Learning
Hierarchical Planning Through Goal-Conditioned Offline Reinforcement Learning
Jinning Li
Chen Tang
M. Tomizuka
Wei Zhan
OffRL
47
54
0
24 May 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
206
832
0
12 Oct 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
139
461
0
06 Aug 2021
Large Scale Interactive Motion Forecasting for Autonomous Driving : The
  Waymo Open Motion Dataset
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Scott Ettinger
Shuyang Cheng
Benjamin Caine
Chenxi Liu
Hang Zhao
...
Jiquan Ngiam
Vijay Vasudevan
Alexander McCauley
Jonathon Shlens
Drago Anguelov
129
528
0
20 Apr 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction
  and Tracking
Spatio-Temporal Graph Dual-Attention Network for Multi-Agent Prediction and Tracking
Jiachen Li
Hengbo Ma
Zhihao Zhang
Jinning Li
M. Tomizuka
50
68
0
18 Feb 2021
1