Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.11240
Cited By
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
29 May 2018
Wen Sun
J. Andrew Bagnell
Byron Boots
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning"
50 / 57 papers shown
Title
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
52
0
0
26 Apr 2025
A Simple Approach to Constraint-Aware Imitation Learning with Application to Autonomous Racing
Shengfan Cao
Eunhyek Joa
Francesco Borrelli
39
0
0
10 Mar 2025
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References
Xueyi Liu
Jianibieke Adalibieke
Qianwei Han
Yuzhe Qin
Li Yi
74
3
0
13 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
78
2
0
04 Feb 2025
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach
Riccardo Poiani
Nicole Nobili
Alberto Maria Metelli
Marcello Restelli
29
0
0
17 Oct 2024
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Yanxiao Zhao
Yangge Qian
Tianyi Wang
Jingyang Shan
Xiaolin Qin
21
0
0
01 Mar 2024
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping
David Wu
Sanjiban Choudhury
21
0
0
04 Feb 2024
Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Zhiming Zheng
36
0
0
30 Dec 2023
Visual Hindsight Self-Imitation Learning for Interactive Navigation
Kibeom Kim
Kisung Shin
Min Whoo Lee
Moonhoen Lee
Minsu Lee
Byoung-Tak Zhang
26
2
0
05 Dec 2023
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Yijun Yang
Tianyi Zhou
Kanxue Li
Dapeng Tao
Lusong Li
Li Shen
Xiaodong He
Jing Jiang
Yuhui Shi
LLMAG
LM&Ro
24
34
0
28 Nov 2023
RLIF: Interactive Imitation Learning as Reinforcement Learning
Jianlan Luo
Perry Dong
Yuexiang Zhai
Yi Ma
Sergey Levine
OffRL
27
14
0
21 Nov 2023
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
Xuefeng Liu
Takuma Yoneda
Rick L. Stevens
Matthew R. Walter
Yuxin Chen
29
10
0
03 Oct 2023
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making
Hui Niu
Siyuan Li
Jiahao Zheng
Zhou-Chen Lin
Jian Li
Jian Guo
Bo An
27
3
0
17 Aug 2023
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
19
41
0
20 Jun 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
24
2
0
07 May 2023
Learning Graph Search Heuristics
Michal Pándy
Weikang Qiu
Gabriele Corso
Petar Velivcković
Rex Ying
J. Leskovec
Pietro Lio'
GNN
27
8
0
07 Dec 2022
Real World Offline Reinforcement Learning with Realistic Data Source
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
40
21
0
12 Oct 2022
MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization
Hui Niu
Siyuan Li
Jian Li
AIFin
24
30
0
01 Sep 2022
Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction
Chia-Chi Chuang
Donglin Yang
Chuan Wen
Yang Gao
SSL
13
12
0
20 Jul 2022
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron C. Courville
Marc G. Bellemare
OffRL
OnRL
26
63
0
03 Jun 2022
Generalizing to New Tasks via One-Shot Compositional Subgoals
Xihan Bian
Oscar Alejandro Mendez Maldonado
Simon Hadfield
CLL
OffRL
32
1
0
16 May 2022
Pareto Conditioned Networks
Mathieu Reymond
Eugenio Bargiacchi
Ann Nowé
4
16
0
11 Apr 2022
Combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task
Vittorio Giammarino
Matthew F. Dunne
Kylie N. Moore
Michael Hasselmo
Chantal E. Stern
I. Paschalidis
OffRL
24
5
0
11 Mar 2022
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel
Haoran Zhang
Chenkun Yin
Yanxin Zhang
S. Jin
Zhenxuan Li
OffRL
16
3
0
23 Feb 2022
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Bogdan Mazoure
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OffRL
51
21
0
29 Nov 2021
Leveraging Experience in Lazy Search
M. Bhardwaj
Sanjiban Choudhury
Byron Boots
S. Srinivasa
8
12
0
10 Oct 2021
Reinforced Imitation Learning by Free Energy Principle
Ryoya Ogishima
Izumi Karino
Y. Kuniyoshi
17
0
0
25 Jul 2021
Heuristic-Guided Reinforcement Learning
Ching-An Cheng
Andrey Kolobov
Adith Swaminathan
OffRL
30
61
0
05 Jun 2021
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
18
4
0
01 Jun 2021
The Value of Planning for Infinite-Horizon Model Predictive Control
Nathan Hatch
Byron Boots
13
10
0
07 Apr 2021
Robust Asymmetric Learning in POMDPs
Andrew Warrington
J. Lavington
Adam Scibior
Mark W. Schmidt
Frank D. Wood
6
28
0
31 Dec 2020
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
26
10
0
26 Dec 2020
Blending MPC & Value Function Approximation for Efficient Reinforcement Learning
M. Bhardwaj
Sanjiban Choudhury
Byron Boots
20
30
0
10 Dec 2020
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Vihang Patil
M. Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
P. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
19
42
0
29 Sep 2020
Bridging the Imitation Gap by Adaptive Insubordination
Luca Weihs
Unnat Jain
Iou-Jen Liu
Jordi Salvador
Svetlana Lazebnik
Aniruddha Kembhavi
A. Schwing
24
34
0
23 Jul 2020
Policy Improvement via Imitation of Multiple Oracles
Ching-An Cheng
Andrey Kolobov
Alekh Agarwal
6
5
0
01 Jul 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
25
3
0
18 Jun 2020
Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
6
13
0
01 Apr 2020
Exploration-efficient Deep Reinforcement Learning with Demonstration Guidance for Robot Control
Ke Lin
Liang Gong
Xudong Li
Te Sun
Binhao Chen
Chengliang Liu
Zhengfeng Zhang
Jian Pu
Junping Zhang
14
8
0
27 Feb 2020
Provable Representation Learning for Imitation Learning via Bi-level Optimization
Sanjeev Arora
S. Du
Sham Kakade
Yuping Luo
Nikunj Saunshi
18
60
0
24 Feb 2020
Information Theoretic Model Predictive Q-Learning
M. Bhardwaj
Ankur Handa
D. Fox
Byron Boots
22
23
0
31 Dec 2019
Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance
Mingxuan Jing
Xiaojian Ma
Wenbing Huang
F. Sun
Chao Yang
Bin Fang
Huaping Liu
21
60
0
16 Nov 2019
Learning from Trajectories via Subgoal Discovery
S. Paul
J. Baar
A. Roy-Chowdhury
81
47
0
03 Nov 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Zhilin Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
17
120
0
27 Oct 2019
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods
Ching-An Cheng
Xinyan Yan
Byron Boots
14
22
0
08 Aug 2019
Learning to combine primitive skills: A step towards versatile robotic manipulation
Robin Strudel
Alexander Pashevich
Igor Kalevatykh
Ivan Laptev
Josef Sivic
Cordelia Schmid
11
4
0
02 Aug 2019
Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Yuping Luo
Huazhe Xu
Tengyu Ma
SSL
18
13
0
12 Jul 2019
Imitation-Projected Programmatic Reinforcement Learning
A. Verma
Hoang Minh Le
Yisong Yue
Swarat Chaudhuri
14
2
0
11 Jul 2019
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
19
219
0
13 Jun 2019
Watch, Try, Learn: Meta-Learning from Demonstrations and Reward
Allan Zhou
Eric Jang
Daniel Kappler
Alexander Herzog
Mohi Khansari
Paul Wohlhart
Yunfei Bai
Mrinal Kalakrishnan
Sergey Levine
Chelsea Finn
21
50
0
07 Jun 2019
1
2
Next