ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1504.00702
  4. Cited By
End-to-End Training of Deep Visuomotor Policies

End-to-End Training of Deep Visuomotor Policies

2 April 2015
Sergey Levine
Chelsea Finn
Trevor Darrell
Pieter Abbeel
    BDL
ArXivPDFHTML

Papers citing "End-to-End Training of Deep Visuomotor Policies"

50 / 922 papers shown
Title
Active Improvement of Control Policies with Bayesian Gaussian Mixture
  Model
Active Improvement of Control Policies with Bayesian Gaussian Mixture Model
Hakan Girgin
Emmanuel Pignat
Noémie Jaquier
Sylvain Calinon
16
7
0
06 Aug 2020
Hindsight for Foresight: Unsupervised Structured Dynamics Models from
  Physical Interaction
Hindsight for Foresight: Unsupervised Structured Dynamics Models from Physical Interaction
Iman Nematollahi
Oier Mees
Lukás Hermann
Wolfram Burgard
20
15
0
02 Aug 2020
Data-efficient visuomotor policy training using reinforcement learning
  and generative models
Data-efficient visuomotor policy training using reinforcement learning and generative models
Ali Ghadirzadeh
Petra Poklukar
Ville Kyrki
Danica Kragic
Mårten Björkman
OffRL
46
9
0
26 Jul 2020
Joint Mind Modeling for Explanation Generation in Complex Human-Robot
  Collaborative Tasks
Joint Mind Modeling for Explanation Generation in Complex Human-Robot Collaborative Tasks
Xiaofeng Gao
Ran Gong
Yizhou Zhao
Shu Wang
Tianmin Shu
Song-Chun Zhu
18
35
0
24 Jul 2020
One Policy to Control Them All: Shared Modular Policies for
  Agent-Agnostic Control
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang
Igor Mordatch
Deepak Pathak
51
167
0
09 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
41
159
0
08 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
31
287
0
08 Jul 2020
FlowControl: Optical Flow Based Visual Servoing
FlowControl: Optical Flow Based Visual Servoing
Max Argus
Lukás Hermann
Jon Long
Thomas Brox
25
25
0
01 Jul 2020
Vision-Based Goal-Conditioned Policies for Underwater Navigation in the
  Presence of Obstacles
Vision-Based Goal-Conditioned Policies for Underwater Navigation in the Presence of Obstacles
Travis Manderson
J. A. G. Higuera
Stefan Wapnick
J. Tremblay
Florian Shkurti
David Meger
Gregory Dudek
19
50
0
29 Jun 2020
Reinforcement Learning Control of Robotic Knee with Human in the Loop by
  Flexible Policy Iteration
Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration
Xiang Gao
J. Si
Yue Wen
Minhan Li
He
H. Huang
13
31
0
16 Jun 2020
Model-based Adversarial Meta-Reinforcement Learning
Model-based Adversarial Meta-Reinforcement Learning
Zichuan Lin
G. Thomas
Guangwen Yang
Tengyu Ma
OOD
27
52
0
16 Jun 2020
Meta-Reinforcement Learning Robust to Distributional Shift via Model
  Identification and Experience Relabeling
Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
Russell Mendonca
Xinyang Geng
Chelsea Finn
Sergey Levine
OOD
OffRL
32
41
0
12 Jun 2020
Learning Navigation Costs from Demonstration with Semantic Observations
Learning Navigation Costs from Demonstration with Semantic Observations
Tianyu Wang
Vikas Dhiman
Nikolay Atanasov
38
4
0
09 Jun 2020
Can Temporal-Difference and Q-Learning Learn Representation? A
  Mean-Field Theory
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
Yufeng Zhang
Qi Cai
Zhuoran Yang
Yongxin Chen
Zhaoran Wang
OOD
MLT
168
11
0
08 Jun 2020
Temporally-Extended ε-Greedy Exploration
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
34
0
02 Jun 2020
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Sim2Real for Peg-Hole Insertion with Eye-in-Hand Camera
Damian Bogunowicz
A. Rybnikov
Komal Vendidandi
Fedor Chervinskii
17
7
0
29 May 2020
LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from
  Monocular Vision
LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision
Zheyu Zhuang
Xin Yu
Robert E. Mahony
42
5
0
25 May 2020
Guided Uncertainty-Aware Policy Optimization: Combining Learning and
  Model-Based Strategies for Sample-Efficient Policy Learning
Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning
Michelle A. Lee
Carlos Florensa
Jonathan Tremblay
Nathan D. Ratliff
Animesh Garg
Fabio Ramos
Dieter Fox
23
60
0
21 May 2020
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
Yuan Wen
Andrew Anderson
Valentin Radu
Michael F. P. O'Boyle
David Gregg
29
10
0
21 May 2020
Mirror Descent Policy Optimization
Mirror Descent Policy Optimization
Manan Tomar
Lior Shani
Yonathan Efroni
Mohammad Ghavamzadeh
25
83
0
20 May 2020
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
Automating Turbulence Modeling by Multi-Agent Reinforcement Learning
G. Novati
Hugues Lascombes de Laroussilhe
Petros Koumoutsakos
AI4CE
34
15
0
18 May 2020
A Distributional View on Multi-Objective Policy Optimization
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
One-Shot Recognition of Manufacturing Defects in Steel Surfaces
One-Shot Recognition of Manufacturing Defects in Steel Surfaces
Aditya M. Deshpande
A. Minai
Manish Kumar
16
58
0
12 May 2020
Robotic Arm Control and Task Training through Deep Reinforcement
  Learning
Robotic Arm Control and Task Training through Deep Reinforcement Learning
Andrea Franceschetti
E. Tosello
Nicola Castaman
Stefano Ghidoni
15
32
0
06 May 2020
GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph
  Neural Networks and Reinforcement Learning
GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning
Hanrui Wang
Kuan-Chieh Wang
Jiacheng Yang
Linxiao Shen
Nan Sun
Hae-Seung Lee
Song Han
GNN
21
232
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement
  Learning
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
48
140
0
30 Apr 2020
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic
  Reinforcement Learning
Never Stop Learning: The Effectiveness of Fine-Tuning in Robotic Reinforcement Learning
Ryan Julian
Benjamin Swanson
Gaurav Sukhatme
Sergey Levine
Chelsea Finn
Karol Hausman
OnRL
CLL
33
43
0
21 Apr 2020
Multi-Task Reinforcement Learning with Soft Modularization
Multi-Task Reinforcement Learning with Soft Modularization
Ruihan Yang
Huazhe Xu
Yi Wu
Xiaolong Wang
27
177
0
30 Mar 2020
When Autonomous Systems Meet Accuracy and Transferability through AI: A
  Survey
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey
Chongzhen Zhang
Jianrui Wang
Gary G. Yen
Chaoqiang Zhao
Qiyu Sun
Yang Tang
Feng Qian
Jürgen Kurths
AAML
35
20
0
29 Mar 2020
Learning to Fly via Deep Model-Based Reinforcement Learning
Learning to Fly via Deep Model-Based Reinforcement Learning
Philip Becker-Ehmck
Maximilian Karl
Jan Peters
Patrick van der Smagt
SSL
44
37
0
19 Mar 2020
Visual Task Progress Estimation with Appearance Invariant Embeddings for
  Robot Control and Planning
Visual Task Progress Estimation with Appearance Invariant Embeddings for Robot Control and Planning
Guilherme J. Maeda
Joni Väätäinen
Hironori Yoshida
22
2
0
16 Mar 2020
Machine Learning for Intelligent Optical Networks: A Comprehensive
  Survey
Machine Learning for Intelligent Optical Networks: A Comprehensive Survey
Rentao Gu
Zeyuan Yang
Yuefeng Ji
27
109
0
11 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
36
17
0
09 Mar 2020
Transferable Task Execution from Pixels through Deep Planning Domain
  Learning
Transferable Task Execution from Pixels through Deep Planning Domain Learning
Kei Kase
Chris Paxton
H. Mazhar
T. Ogata
Dieter Fox
147
45
0
08 Mar 2020
Natural Language Processing Advancements By Deep Learning: A Survey
Natural Language Processing Advancements By Deep Learning: A Survey
A. Torfi
Rouzbeh A. Shirvani
Yaser Keneshloo
Nader Tavvaf
Edward A. Fox
AI4CE
VLM
85
216
0
02 Mar 2020
Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs
Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs
Edouard Leurent
D. Efimov
Odalric-Ambrym Maillard
22
3
0
25 Feb 2020
Learning to Walk in the Real World with Minimal Human Effort
Learning to Walk in the Real World with Minimal Human Effort
Sehoon Ha
P. Xu
Zhenyu Tan
Sergey Levine
Jie Tan
31
169
0
20 Feb 2020
Learning Pregrasp Manipulation of Objects from Ungraspable Poses
Learning Pregrasp Manipulation of Objects from Ungraspable Poses
Zhaole Sun
Kai Yuan
Wenbin Hu
Chuanyu Yang
Zhibin Li
SSL
27
28
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin
  Dynamics
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
V. Cevher
31
60
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control
  Tasks with Path Planning
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
30
25
0
14 Feb 2020
Convergence Guarantees of Policy Optimization Methods for Markovian Jump
  Linear Systems
Convergence Guarantees of Policy Optimization Methods for Markovian Jump Linear Systems
Joao Paulo Jansch-Porto
Bin Hu
Geir Dullerud
27
35
0
10 Feb 2020
Autonomous quadrotor obstacle avoidance based on dueling double deep
  recurrent Q-learning with monocular vision
Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision
Jiajun Ou
Xiao Guo
Ming Zhu
Wenjie Lou
27
30
0
10 Feb 2020
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic
  with Advantage Weighted Mixture Policy(SAC-AWMP)
Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP)
Zhimin Hou
Kuangen Zhang
Yi Wan
Dongyu Li
Chenglong Fu
Haoyong Yu
27
15
0
07 Feb 2020
Constrained Upper Confidence Reinforcement Learning
Constrained Upper Confidence Reinforcement Learning
Liyuan Zheng
Lillian J. Ratliff
36
67
0
26 Jan 2020
Interpretable End-to-end Urban Autonomous Driving with Latent Deep
  Reinforcement Learning
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
Jianyu Chen
Shengbo Eben Li
Masayoshi Tomizuka
57
226
0
23 Jan 2020
Gradient Surgery for Multi-Task Learning
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
41
1,175
0
19 Jan 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Aggressive Perception-Aware Navigation using Deep Optical Flow Dynamics
  and PixelMPC
Aggressive Perception-Aware Navigation using Deep Optical Flow Dynamics and PixelMPC
Keuntaek Lee
Jason Gibson
Evangelos A. Theodorou
33
31
0
07 Jan 2020
A Boolean Task Algebra for Reinforcement Learning
A Boolean Task Algebra for Reinforcement Learning
Geraud Nangue Tasse
Steven D. James
Benjamin Rosman
22
54
0
06 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
30
38
0
02 Jan 2020
Previous
123...111213...171819
Next