ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1610.00633
  4. Cited By
Deep Reinforcement Learning for Robotic Manipulation with Asynchronous
  Off-Policy Updates

Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates

3 October 2016
S. Gu
E. Holly
Timothy Lillicrap
Sergey Levine
    OffRL
    SSL
ArXivPDFHTML

Papers citing "Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates"

50 / 231 papers shown
Title
RMBench: Benchmarking Deep Reinforcement Learning for Robotic
  Manipulator Control
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning
Henrique Donancio
L. Vercouter
H. Roclawski
AI4CE
18
1
0
20 Oct 2022
A Concise Introduction to Reinforcement Learning in Robotics
A Concise Introduction to Reinforcement Learning in Robotics
Akash Nagaraj
Mukund Sood
B. Patil
23
22
0
13 Oct 2022
How to Enable Uncertainty Estimation in Proximal Policy Optimization
How to Enable Uncertainty Estimation in Proximal Policy Optimization
Eugene Bykovets
Yannick Metz
Mennatallah El-Assady
Daniel A. Keim
J. M. Buhmann
UQCV
16
1
0
07 Oct 2022
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets
Chen Gong
Zhou Yang
Yunru Bai
Junda He
Jieke Shi
...
Arunesh Sinha
Bowen Xu
Xinwen Hou
David Lo
Guoliang Fan
AAML
OffRL
29
7
0
07 Oct 2022
Efficiently Learning Small Policies for Locomotion and Manipulation
Efficiently Learning Small Policies for Locomotion and Manipulation
Shashank Hegde
Gaurav Sukhatme
37
3
0
30 Sep 2022
Reward Shaping for User Satisfaction in a REINFORCE Recommender
Reward Shaping for User Satisfaction in a REINFORCE Recommender
Konstantina Christakopoulou
Can Xu
Sai Zhang
Sriraj Badam
Trevor Potter
...
Ya Le
Chris Berg
E. B. Dixon
Ed H. Chi
Minmin Chen
OffRL
25
8
0
30 Sep 2022
Backward Reachability Analysis of Neural Feedback Loops: Techniques for
  Linear and Nonlinear Systems
Backward Reachability Analysis of Neural Feedback Loops: Techniques for Linear and Nonlinear Systems
Nicholas Rober
Sydney M. Katz
Chelsea Sidrane
Esen Yel
Michael Everett
Mykel J. Kochenderfer
Jonathan P. How
37
26
0
28 Sep 2022
Design of experiments for the calibration of history-dependent models
  via deep reinforcement learning and an enhanced Kalman filter
Design of experiments for the calibration of history-dependent models via deep reinforcement learning and an enhanced Kalman filter
Ruben Villarreal
Nikolaos N. Vlassis
Nhon N. Phan
Tommie A. Catanach
Reese E. Jones
N. Trask
S. Kramer
WaiChing Sun
OffRL
32
11
0
27 Sep 2022
DEQGAN: Learning the Loss Function for PINNs with Generative Adversarial
  Networks
DEQGAN: Learning the Loss Function for PINNs with Generative Adversarial Networks
Blake Bullwinkel
Dylan Randle
P. Protopapas
David Sondak
26
3
0
15 Sep 2022
Decentralized Coordination in Partially Observable Queueing Networks
Decentralized Coordination in Partially Observable Queueing Networks
Jiekai Jia
Anam Tahir
Heinz Koeppl
39
1
0
29 Aug 2022
Robot Policy Learning from Demonstration Using Advantage Weighting and
  Early Termination
Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination
A. Mohtasib
Gerhard Neumann
Heriberto Cuayáhuitl
OffRL
44
2
0
31 Jul 2022
Deep Active Visual Attention for Real-time Robot Motion Generation:
  Emergence of Tool-body Assimilation and Adaptive Tool-use
Deep Active Visual Attention for Real-time Robot Motion Generation: Emergence of Tool-body Assimilation and Adaptive Tool-use
Hyogo Hiruma
Hiroshi Ito
Hiroki Mori
Tetsuya Ogata
28
5
0
29 Jun 2022
Low Emission Building Control with Zero-Shot Reinforcement Learning
Low Emission Building Control with Zero-Shot Reinforcement Learning
Scott Jeen
Alessandro Abate
Jonathan M. Cullen
AI4CE
25
5
0
28 Jun 2022
Meta Reinforcement Learning with Finite Training Tasks -- a Density
  Estimation Approach
Meta Reinforcement Learning with Finite Training Tasks -- a Density Estimation Approach
Zohar Rimon
Aviv Tamar
Gilad Adler
OOD
OffRL
36
8
0
21 Jun 2022
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU
  Spatial Multiplexing
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing
Yuke Wang
Boyuan Feng
Ziyi Wang
Tong Geng
Ang Li
Yufei Ding
AI4CE
49
0
0
16 Jun 2022
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning
  Framework
BulletArm: An Open-Source Robotic Manipulation Benchmark and Learning Framework
Dian Wang
Colin Kohler
Xu Zhu
Ming Jia
Robert Platt
32
9
0
28 May 2022
Neighborhood Mixup Experience Replay: Local Convex Interpolation for
  Improved Sample Efficiency in Continuous Control Tasks
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
Ryan M Sander
Wilko Schwarting
Tim Seyde
Igor Gilitschenski
S. Karaman
Daniela Rus
43
2
0
18 May 2022
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in
  Human Environments
Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments
Jakob Thumm
Matthias Althoff
58
34
0
12 May 2022
Training and Evaluation of Deep Policies using Reinforcement Learning
  and Generative Models
Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models
Ali Ghadirzadeh
Petra Poklukar
Karol Arndt
Chelsea Finn
Ville Kyrki
Danica Kragic
Mårten Björkman
OffRL
24
1
0
18 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
58
4
0
12 Apr 2022
Configuration Path Control
Configuration Path Control
S. Pankov
27
1
0
05 Apr 2022
Asynchronous Reinforcement Learning for Real-Time Control of Physical
  Robots
Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Yufeng Yuan
Rupam Mahmood
OffRL
36
19
0
23 Mar 2022
Real Robot Challenge 2021: Cartesian Position Control with Triangle
  Grasp and Trajectory Interpolation
Real Robot Challenge 2021: Cartesian Position Control with Triangle Grasp and Trajectory Interpolation
Rishabh Madan
Harshit S. Sikchi
E. Gordon
Tapomayukh Bhattacharjee
24
0
0
16 Mar 2022
On-Robot Learning With Equivariant Models
On-Robot Learning With Equivariant Models
Dian Wang
Ming Jia
Xu Zhu
Robin Walters
Robert Platt
OffRL
SSL
33
36
0
09 Mar 2022
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light
  Control
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light Control
Ruijie Zhu
Lulu Li
Shuning Wu
Pei Lv
Yafai Li
Mingliang Xu
28
50
0
08 Mar 2022
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement
  Learning
Cloud-Edge Training Architecture for Sim-to-Real Deep Reinforcement Learning
Hongpeng Cao
Mirco Theile
Federico G. Wyrwal
Marco Caccamo
43
6
0
04 Mar 2022
Graph Lifelong Learning: A Survey
Graph Lifelong Learning: A Survey
F. Febrinanto
Feng Xia
Kristen Moore
Chandra Thapa
Charu C. Aggarwal
CLL
AI4CE
44
51
0
22 Feb 2022
Intelligent Autonomous Intersection Management
Intelligent Autonomous Intersection Management
Udesh Gunarathna
S. Karunasekera
Renata Borovica-Gajic
E. Tanin
31
3
0
09 Feb 2022
Malleable Agents for Re-Configurable Robotic Manipulators
Malleable Agents for Re-Configurable Robotic Manipulators
Athindran Ramesh Kumar
Gurudutt Hosangadi
32
0
0
04 Feb 2022
Practical Imitation Learning in the Real World via Task Consistency Loss
Practical Imitation Learning in the Real World via Task Consistency Loss
Mohi Khansari
Daniel Ho
Yuqing Du
Armando Fuentes
Matthew Bennice
Nicolas Sievers
Sean Kirmani
Yunfei Bai
Eric Jang
SSL
24
8
0
03 Feb 2022
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal
  Point Processes
Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes
Chao Qu
Jue Chen
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
OffRL
30
17
0
29 Jan 2022
Can Wikipedia Help Offline Reinforcement Learning?
Can Wikipedia Help Offline Reinforcement Learning?
Machel Reid
Yutaro Yamada
S. Gu
3DV
RALM
OffRL
140
95
0
28 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
38
10
0
24 Jan 2022
Look Closer: Bridging Egocentric and Third-Person Views with
  Transformers for Robotic Manipulation
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
Rishabh Jangir
Nicklas Hansen
Sambaran Ghosal
Mohit Jain
Xiaolong Wang
32
66
0
19 Jan 2022
Modified DDPG car-following model with a real-world human driving
  experience with CARLA simulator
Modified DDPG car-following model with a real-world human driving experience with CARLA simulator
Dian-Tao Li
Ostap Okhrin
43
37
0
29 Dec 2021
Multiagent Model-based Credit Assignment for Continuous Control
Multiagent Model-based Credit Assignment for Continuous Control
Dongge Han
Chris Xiaoxuan Lu
Tomasz P. Michalak
Michael Wooldridge
27
5
0
27 Dec 2021
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Tianhao Wu
Yunchang Yang
Han Zhong
Liwei Wang
S. Du
Jiantao Jiao
55
14
0
21 Dec 2021
Tool as Embodiment for Recursive Manipulation
Tool as Embodiment for Recursive Manipulation
Yukiyasu Noguchi
T. Matsushima
Y. Matsuo
S. Gu
38
7
0
01 Dec 2021
Independent Learning in Stochastic Games
Independent Learning in Stochastic Games
Asuman Ozdaglar
M. O. Sayin
Kaipeng Zhang
21
22
0
23 Nov 2021
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Dealing with the Unknown: Pessimistic Offline Reinforcement Learning
Jinning Li
Chen Tang
Masayoshi Tomizuka
Wei Zhan
OffRL
19
21
0
09 Nov 2021
Time Discretization-Invariant Safe Action Repetition for Policy Gradient
  Methods
Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods
Seohong Park
Jaekyeom Kim
Gunhee Kim
38
23
0
06 Nov 2021
Causal versus Marginal Shapley Values for Robotic Lever Manipulation
  Controlled using Deep Reinforcement Learning
Causal versus Marginal Shapley Values for Robotic Lever Manipulation Controlled using Deep Reinforcement Learning
Sindre Benjamin Remman
Inga Strümke
A. Lekkas
CML
17
7
0
04 Nov 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
45
18
0
27 Oct 2021
Optimistic Policy Optimization is Provably Efficient in Non-stationary
  MDPs
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs
Han Zhong
Zhuoran Yang
Zhaoran Wang
Csaba Szepesvári
47
21
0
18 Oct 2021
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative
  Survey
Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey
Amjad Yousef Majid
Serge Saaybi
Tomas van Rietbergen
Vincent François-Lavet
R. V. Prasad
Chris Verhoeven
OffRL
64
55
0
28 Sep 2021
Deep Reinforcement Learning with Adjustments
Deep Reinforcement Learning with Adjustments
H. Khorasgani
Haiyan Wang
Chetan Gupta
Susumu Serita
23
2
0
28 Sep 2021
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement
  Learning
Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning
Nikita Rudin
David Hoeller
Philipp Reist
Marco Hutter
117
547
0
24 Sep 2021
Hierarchical Policy for Non-prehensile Multi-object Rearrangement with
  Deep Reinforcement Learning and Monte Carlo Tree Search
Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tree Search
Fan Bai
Fei Meng
Jianbang Liu
Jiankun Wang
Max Meng
23
6
0
18 Sep 2021
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual
  Patterns
Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns
Prasanth Buddareddygari
Travis Zhang
Yezhou Yang
Yi Ren
AAML
37
13
0
16 Sep 2021
Previous
12345
Next