Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1504.00702
Cited By
End-to-End Training of Deep Visuomotor Policies
2 April 2015
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"End-to-End Training of Deep Visuomotor Policies"
50 / 922 papers shown
Title
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model
Hakan Girgin
Emmanuel Pignat
Noémie Jaquier
Sylvain Calinon
16
7
0
06 Aug 2020
Hindsight for Foresight: Unsupervised Structured Dynamics Models from Physical Interaction
Iman Nematollahi
Oier Mees
Lukás Hermann
Wolfram Burgard
20
15
0
02 Aug 2020
Data-efficient visuomotor policy training using reinforcement learning and generative models
Ali Ghadirzadeh
Petra Poklukar
Ville Kyrki
Danica Kragic
Mårten Björkman
OffRL
46
9
0
26 Jul 2020
Joint Mind Modeling for Explanation Generation in Complex Human-Robot Collaborative Tasks
Xiaofeng Gao
Ran Gong
Yizhou Zhao
Shu Wang
Tianmin Shu
Song-Chun Zhu
18
35
0
24 Jul 2020
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang
Igor Mordatch
Deepak Pathak
51
167
0
09 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
41
159
0
08 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
31
287
0
08 Jul 2020
FlowControl: Optical Flow Based Visual Servoing
Max Argus
Lukás Hermann
Jon Long
Thomas Brox
25
25
0
01 Jul 2020
Vision-Based Goal-Conditioned Policies for Underwater Navigation in the Presence of Obstacles
Travis Manderson
J. A. G. Higuera
Stefan Wapnick
J. Tremblay
Florian Shkurti
David Meger
Gregory Dudek
19
50
0
29 Jun 2020
Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration
Xiang Gao
J. Si
Yue Wen
Minhan Li
He
H. Huang
13
31
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
27
52
0
16 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OOD
OffRL
32
41
0
12 Jun 2020
Learning Navigation Costs from Demonstration with Semantic Observations
Tianyu Wang
Vikas Dhiman
Nikolay Atanasov
38
4
0
09 Jun 2020
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
Yufeng Zhang
Qi Cai
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
OOD
MLT
168
11
0
08 Jun 2020
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
34
0
02 Jun 2020
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Damian Bogunowicz
A. Rybnikov
Komal Vendidandi
Fedor Chervinskii
17
7
0
29 May 2020
LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision
Zheyu Zhuang
Xin Yu
Robert E. Mahony
42
5
0
25 May 2020
Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning
Michelle A. Lee
Carlos Florensa
Jonathan Tremblay
Nathan D. Ratliff
Animesh Garg
Fabio Ramos
Dieter Fox
23
60
0
21 May 2020
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
Yuan Wen
Andrew Anderson
Valentin Radu
Michael F. P. O'Boyle
David Gregg
29
10
0
21 May 2020
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
25
83
0
20 May 2020
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
G. Novati
Hugues Lascombes de Laroussilhe
Petros Koumoutsakos
AI4CE
34
15
0
18 May 2020
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
One-Shot Recognition of Manufacturing Defects in Steel Surfaces
Aditya M. Deshpande
A. Minai
Manish Kumar
16
58
0
12 May 2020
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
15
32
0
06 May 2020
GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning
Hanrui Wang
Kuan-Chieh Wang
Jiacheng Yang
Linxiao Shen
Nan Sun
Hae-Seung Lee
Song Han
GNN
21
232
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
48
140
0
30 Apr 2020
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning
Ryan Julian
Benjamin Swanson
Gaurav Sukhatme
Sergey Levine
Chelsea Finn
Karol Hausman
OnRL
CLL
33
43
0
21 Apr 2020
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
27
177
0
30 Mar 2020
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey
Chongzhen Zhang
Jianrui Wang
Gary G. Yen
Chaoqiang Zhao
Qiyu Sun
Yang Tang
Feng Qian
Jürgen Kurths
AAML
35
20
0
29 Mar 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
44
37
0
19 Mar 2020
Visual Task Progress Estimation with Appearance Invariant Embeddings for Robot Control and Planning
Guilherme J. Maeda
Joni Väätäinen
Hironori Yoshida
22
2
0
16 Mar 2020
Machine Learning for Intelligent Optical Networks: A Comprehensive Survey
Rentao Gu
Zeyuan Yang
Yuefeng Ji
27
109
0
11 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
36
17
0
09 Mar 2020
Transferable Task Execution from Pixels through Deep Planning Domain Learning
Kei Kase
Chris Paxton
H. Mazhar
T. Ogata
Dieter Fox
147
45
0
08 Mar 2020
Natural Language Processing Advancements By Deep Learning: A Survey
A. Torfi
Rouzbeh A. Shirvani
Yaser Keneshloo
Nader Tavvaf
Edward A. Fox
AI4CE
VLM
85
216
0
02 Mar 2020
Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs
Edouard Leurent
D. Efimov
Odalric-Ambrym Maillard
22
3
0
25 Feb 2020
Learning to Walk in the Real World with Minimal Human Effort
Sehoon Ha
P. Xu
Zhenyu Tan
Sergey Levine
Jie Tan
31
169
0
20 Feb 2020
Learning Pregrasp Manipulation of Objects from Ungraspable Poses
Zhaole Sun
Kai Yuan
Wenbin Hu
Chuanyu Yang
Zhibin Li
SSL
27
28
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
V. Cevher
31
60
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
30
25
0
14 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
27
35
0
10 Feb 2020
Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision
Jiajun Ou
Xiao Guo
Ming Zhu
Wenjie Lou
27
30
0
10 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
27
15
0
07 Feb 2020
Constrained Upper Confidence Reinforcement Learning
Liyuan Zheng
Lillian J. Ratliff
36
67
0
26 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
57
226
0
23 Jan 2020
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
41
1,175
0
19 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Aggressive Perception-Aware Navigation using Deep Optical Flow Dynamics and PixelMPC
Keuntaek Lee
Jason Gibson
Evangelos A. Theodorou
33
31
0
07 Jan 2020
A Boolean Task Algebra for Reinforcement Learning
Geraud Nangue Tasse
Steven D. James
Benjamin Rosman
22
54
0
06 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
30
38
0
02 Jan 2020
Previous
1
2
3
...
11
12
13
...
17
18
19
Next