ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.10293
  4. Cited By
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation

QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

27 June 2018
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
Eric Jang
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
ArXivPDFHTML

Papers citing "QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation"

50 / 321 papers shown
Title
Example-Driven Model-Based Reinforcement Learning for Solving
  Long-Horizon Visuomotor Tasks
Example-Driven Model-Based Reinforcement Learning for Solving Long-Horizon Visuomotor Tasks
Bohan Wu
Suraj Nair
Li Fei-Fei
Chelsea Finn
OffRL
LM&Ro
38
24
0
21 Sep 2021
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive
  Deep Reinforcement Learning
Carl-Lead: Lidar-based End-to-End Autonomous Driving with Contrastive Deep Reinforcement Learning
Peide Cai
Sukai Wang
Hengli Wang
Ming-Yu Liu
AI4TS
22
15
0
17 Sep 2021
ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive
  Imitation Learning
ThriftyDAgger: Budget-Aware Novelty and Risk Gating for Interactive Imitation Learning
Ryan Hoque
Ashwin Balakrishna
Ellen R. Novoseller
Albert Wilcox
Daniel S. Brown
Ken Goldberg
32
84
0
17 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Multi-Task Learning with Sequence-Conditioned Transporter Networks
Multi-Task Learning with Sequence-Conditioned Transporter Networks
M. H. Lim
Andy Zeng
Brian Ichter
Maryam Bandari
Erwin Coumans
Claire Tomlin
S. Schaal
Aleksandra Faust
37
14
0
15 Sep 2021
Implicit Behavioral Cloning
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
49
368
0
01 Sep 2021
Investigating Vulnerabilities of Deep Neural Policies
Investigating Vulnerabilities of Deep Neural Policies
Ezgi Korkmaz
AAML
18
33
0
30 Aug 2021
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Chen Wang
Claudia Pérez-DÁrpino
Danfei Xu
Li Fei-Fei
C. Karen Liu
Silvio Savarese
42
33
0
13 Aug 2021
Skill Preferences: Learning to Extract and Execute Robotic Skills from
  Human Feedback
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
Xiaofei Wang
Kimin Lee
Kourosh Hakhamaneshi
Pieter Abbeel
Michael Laskin
17
42
0
11 Aug 2021
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale
  Demonstrations
ManiSkill: Generalizable Manipulation Skill Benchmark with Large-Scale Demonstrations
Tongzhou Mu
Z. Ling
Fanbo Xiang
Derek Yang
Xuanlin Li
Stone Tao
Zhiao Huang
Zhiwei Jia
Hao Su
39
130
0
30 Jul 2021
Model Selection for Offline Reinforcement Learning: Practical
  Considerations for Healthcare Settings
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
Shengpu Tang
Jenna Wiens
OffRL
26
78
0
23 Jul 2021
Playful Interactions for Representation Learning
Playful Interactions for Representation Learning
Sarah Young
Jyothish Pari
Pieter Abbeel
Lerrel Pinto
SSL
44
14
0
19 Jul 2021
Model-free Reinforcement Learning for Robust Locomotion using
  Demonstrations from Trajectory Optimization
Model-free Reinforcement Learning for Robust Locomotion using Demonstrations from Trajectory Optimization
Miroslav Bogdanovic
Majid Khadiv
Ludovic Righetti
114
30
0
14 Jul 2021
Hierarchical Neural Dynamic Policies
Hierarchical Neural Dynamic Policies
Shikhar Bahl
Abhinav Gupta
Deepak Pathak
BDL
25
27
0
12 Jul 2021
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of
  Sparse Reward Iterative Tasks
LS3: Latent Space Safe Sets for Long-Horizon Visuomotor Control of Sparse Reward Iterative Tasks
Albert Wilcox
Ashwin Balakrishna
Brijen Thananjeyan
Joseph E. Gonzalez
Ken Goldberg
26
11
0
10 Jul 2021
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Lirui Wang
Xiangyun Meng
Yu Xiang
D. Fox
3DPC
DRL
21
27
0
04 Jul 2021
Learning to See before Learning to Act: Visual Pre-training for
  Manipulation
Learning to See before Learning to Act: Visual Pre-training for Manipulation
Yen-Chen Lin
Andy Zeng
Shuran Song
Phillip Isola
Nayeon Lee
SSL
16
87
0
01 Jul 2021
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under
  Data Augmentation
Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation
Nicklas Hansen
H. Su
Xiaolong Wang
OffRL
26
134
0
01 Jul 2021
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic
  Data via Stereo
SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo
Thomas Kollar
Michael Laskey
Kevin Stone
Brijen Thananjeyan
Mark Tjersland
48
25
0
30 Jun 2021
Hierarchically Integrated Models: Learning to Navigate from
  Heterogeneous Robots
Hierarchically Integrated Models: Learning to Navigate from Heterogeneous Robots
Katie Kang
G. Kahn
Sergey Levine
37
5
0
24 Jun 2021
Which Mutual-Information Representation Learning Objectives are
  Sufficient for Control?
Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Kate Rakelly
Abhishek Gupta
Carlos Florensa
Sergey Levine
SSL
26
38
0
14 Jun 2021
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for
  Reinforcement Learning
PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning
Tao Yu
Cuiling Lan
Wenjun Zeng
Mingxiao Feng
Zhizheng Zhang
Zhibo Chen
OffRL
20
46
0
08 Jun 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
21
184
0
17 May 2021
Robotic Surgery With Lean Reinforcement Learning
Robotic Surgery With Lean Reinforcement Learning
Yotam Barnoy
Molly O'Brien
Luu Anh Tuan
Gregory D. Hager
OffRL
41
20
0
03 May 2021
InsertionNet -- A Scalable Solution for Insertion
InsertionNet -- A Scalable Solution for Insertion
Oren Spector
Dotan Di Castro
24
51
0
29 Apr 2021
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale
Dmitry Kalashnikov
Jacob Varley
Yevgen Chebotar
Benjamin Swanson
Rico Jonschkowski
Chelsea Finn
Sergey Levine
Karol Hausman
OffRL
47
270
0
16 Apr 2021
Actionable Models: Unsupervised Offline Reinforcement Learning of
  Robotic Skills
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
Yevgen Chebotar
Karol Hausman
Yao Lu
Ted Xiao
Dmitry Kalashnikov
...
A. Irpan
Benjamin Eysenbach
Ryan Julian
Chelsea Finn
Sergey Levine
SSL
OffRL
32
146
0
15 Apr 2021
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human
  Videos
Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos
Annie S. Chen
Suraj Nair
Chelsea Finn
32
137
0
31 Mar 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Visionary: Vision architecture discovery for robot learning
Visionary: Vision architecture discovery for robot learning
Iretiayo Akinola
A. Angelova
Yao Lu
Yevgen Chebotar
Dmitry Kalashnikov
Jacob Varley
Julian Ibarz
Michael S. Ryoo
24
10
0
26 Mar 2021
Self-Imitation Learning by Planning
Self-Imitation Learning by Planning
Junhyuk Oh
Yijie Guo
Satinder Singh
SSL
26
86
0
25 Mar 2021
CLAMGen: Closed-Loop Arm Motion Generation via Multi-view Vision-Based
  RL
CLAMGen: Closed-Loop Arm Motion Generation via Multi-view Vision-Based RL
Iretiayo Akinola
Zizhao Wang
Peter K. Allen
37
2
0
24 Mar 2021
Lyapunov Barrier Policy Optimization
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
26
14
0
16 Mar 2021
Sample-efficient Reinforcement Learning Representation Learning with
  Curiosity Contrastive Forward Dynamics Model
Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen
Tung M. Luu
Thang Vu
Chang D. Yoo
15
17
0
15 Mar 2021
Training a Resilient Q-Network against Observational Interference
Training a Resilient Q-Network against Observational Interference
Chao-Han Huck Yang
I-Te Danny Hung
Ouyang Yi
Pin-Yu Chen
OOD
26
14
0
18 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
219
413
0
16 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous
  Agents via Personalized Simulators
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
Decoupled Exploration and Exploitation Policies for Sample-Efficient
  Reinforcement Learning
Decoupled Exploration and Exploitation Policies for Sample-Efficient Reinforcement Learning
William F. Whitney
Michael Bloesch
Jost Tobias Springenberg
A. Abdolmaleki
Kyunghyun Cho
Martin Riedmiller
OffRL
29
13
0
23 Jan 2021
Memory-Augmented Reinforcement Learning for Image-Goal Navigation
Memory-Augmented Reinforcement Learning for Image-Goal Navigation
Lina Mezghani
Sainbayar Sukhbaatar
Thibaut Lavril
Oleksandr Maksymets
Dhruv Batra
Piotr Bojanowski
Alahari Karteek
26
69
0
13 Jan 2021
The Distracting Control Suite -- A Challenging Benchmark for
  Reinforcement Learning from Pixels
The Distracting Control Suite -- A Challenging Benchmark for Reinforcement Learning from Pixels
Austin Stone
Oscar Ramirez
K. Konolige
Rico Jonschkowski
137
101
0
07 Jan 2021
Self-supervised Visual Reinforcement Learning with Object-centric
  Representations
Self-supervised Visual Reinforcement Learning with Object-centric Representations
Andrii Zadaianchuk
Maximilian Seitzer
Georg Martius
SSL
OCL
27
41
0
29 Nov 2020
From Pixels to Legs: Hierarchical Learning of Quadruped Locomotion
From Pixels to Legs: Hierarchical Learning of Quadruped Locomotion
Deepali Jain
Atil Iscen
Ken Caluwaerts
21
35
0
23 Nov 2020
Distributed Deep Reinforcement Learning: An Overview
Distributed Deep Reinforcement Learning: An Overview
Mohammad Reza Samsami
Hossein Alimadad
OffRL
14
27
0
22 Nov 2020
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Learning Dense Rewards for Contact-Rich Manipulation Tasks
Zheng Wu
Wenzhao Lian
Vaibhav Unhelkar
M. Tomizuka
S. Schaal
8
37
0
17 Nov 2020
A Geometric Perspective on Self-Supervised Policy Adaptation
A Geometric Perspective on Self-Supervised Policy Adaptation
Cristian Bodnar
Karol Hausman
Gabriel Dulac-Arnold
Rico Jonschkowski
SSL
44
5
0
14 Nov 2020
Joint Space Control via Deep Reinforcement Learning
Joint Space Control via Deep Reinforcement Learning
Visak C. V. Kumar
David Hoeller
Balakumar Sundaralingam
Jonathan Tremblay
Stan Birchfield
DRL
25
15
0
12 Nov 2020
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer
Daniel Ho
Kanishka Rao
Zhuo Xu
Eric Jang
Mohi Khansari
Yunfei Bai
GAN
LM&Ro
45
97
0
06 Nov 2020
Learning Robot Trajectories subject to Kinematic Joint Constraints
Learning Robot Trajectories subject to Kinematic Joint Constraints
Jonas C. Kiemel
Torsten Kröger
9
7
0
01 Nov 2020
COG: Connecting New Skills to Past Experience with Offline Reinforcement
  Learning
COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
Avi Singh
Albert Yu
Jonathan Yang
Jesse Zhang
Aviral Kumar
Sergey Levine
SSL
OffRL
OnRL
35
103
0
27 Oct 2020
High Acceleration Reinforcement Learning for Real-World Juggling with
  Binary Rewards
High Acceleration Reinforcement Learning for Real-World Juggling with Binary Rewards
Kai Ploeger
M. Lutter
Jan Peters
8
29
0
26 Oct 2020
Previous
1234567
Next