ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.10293
  4. Cited By
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic
  Manipulation

QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation

27 June 2018
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
Eric Jang
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
ArXivPDFHTML

Papers citing "QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation"

50 / 321 papers shown
Title
Guiding Data Collection via Factored Scaling Curves
Guiding Data Collection via Factored Scaling Curves
Lihan Zha
Apurva Badithela
Michael Zhang
Justin Lidard
Jeremy Bao
Emily Zhou
David Snyder
Allen Z. Ren
Dhruv Shah
Anirudha Majumdar
OffRL
34
0
0
12 May 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
Qingwen Bu
Yanting Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
119
0
0
09 May 2025
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Abdulaziz Almuzairee
Rohan Patil
Dwait Bhatt
Henrik I. Christensen
34
0
0
07 May 2025
Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning
Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning
Malte Mosbach
Sven Behnke
31
0
0
04 May 2025
Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control
Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control
Haoran Wang
Zhiwei Shi
Chengxi Zhu
Yafei Qiao
Cheng Zhang
Fan Yang
Pengjie Ren
Lan Lu
D. Xuan
64
1
0
24 Apr 2025
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation
Wenxuan Li
Hang Zhao
Zhiyuan Yu
Yu Du
Qin Zou
Ruizhen Hu
K. Xu
SSL
78
1
0
23 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
38
0
0
09 Apr 2025
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs
Nicolas Le Roux
Marc G. Bellemare
Jonathan Lebensold
Arnaud Bergeron
Joshua Greaves
Alex Fréchette
Carolyne Pelletier
Eric Thibodeau-Laufer
Sándor Toth
Sam Work
OffRL
89
2
0
18 Mar 2025
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu
Hao Chen
Pengju An
Zhuoyang Liu
Renrui Zhang
...
Chengkai Hou
Mengdi Zhao
KC alex Zhou
Pheng-Ann Heng
S. Zhang
72
8
0
13 Mar 2025
PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM
Alan Dao
Dinh Bach Vu
Tuan Le Duc Anh
Bui Quang Huy
46
0
0
10 Mar 2025
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
T. Lee
Donghwan Lee
35
0
0
28 Feb 2025
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
SALSA-RL: Stability Analysis in the Latent Space of Actions for Reinforcement Learning
Xuyang Li
Romit Maulik
46
0
0
24 Feb 2025
MILE: Model-based Intervention Learning
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
88
2
0
21 Feb 2025
COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping
COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping
Jun Yamada
Alexander L. Mitchell
Jack Collins
Ingmar Posner
OffRL
90
0
0
17 Feb 2025
Adaptive Grasping of Moving Objects in Dense Clutter via Global-to-Local Detection and Static-to-Dynamic Planning
Adaptive Grasping of Moving Objects in Dense Clutter via Global-to-Local Detection and Static-to-Dynamic Planning
Hao Chen
Takuya Kiyokawa
Weiwei Wan
Kensuke Harada
56
0
0
09 Feb 2025
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
Ziyang Xie
Zhizheng Liu
Zhenghao Peng
Wayne Wu
Bolei Zhou
VGen
48
3
0
12 Jan 2025
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning
Utsav Singh
Souradip Chakraborty
Wesley A Suttle
Brian M. Sadler
Vinay P. Namboodiri
Amrit Singh Bedi
OffRL
53
0
0
03 Jan 2025
Perception Stitching: Zero-Shot Perception Encoder Transfer for Visuomotor Robot Policies
Perception Stitching: Zero-Shot Perception Encoder Transfer for Visuomotor Robot Policies
Pingcheng Jian
Easop Lee
Zachary I. Bell
Michael M. Zavlanos
Boyuan Chen
77
1
0
03 Jan 2025
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
Xin Liu
Yaran Chen
Haoran Li
SSL
94
0
0
14 Dec 2024
Environment as Policy: Learning to Race in Unseen Tracks
Environment as Policy: Learning to Race in Unseen Tracks
Hongze Wang
Jiaxu Xing
Nico Messikommer
Davide Scaramuzza
29
1
0
29 Oct 2024
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive
  Revaluation
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
Jaehyun Park
Yunho Kim
Sejin Kim
Byung-Jun Lee
Sundong Kim
OffRL
30
1
0
15 Oct 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
34
77
0
10 Oct 2024
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Claire Chen
Shuze Liu
Shangtong Zhang
OffRL
100
1
0
08 Oct 2024
Synthesizing Interpretable Control Policies through Large Language Model Guided Search
Synthesizing Interpretable Control Policies through Large Language Model Guided Search
Carlo Bosio
Mark W. Mueller
26
0
0
07 Oct 2024
Predictive Coding for Decision Transformer
Predictive Coding for Decision Transformer
Tung M. Luu
Donghoon Lee
Chang D. Yoo
OffRL
58
2
0
04 Oct 2024
Doubly Optimal Policy Evaluation for Reinforcement Learning
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
32
2
0
03 Oct 2024
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
Ricardo Garcia
Shizhe Chen
Cordelia Schmid
LM&Ro
39
7
0
02 Oct 2024
Autonomous loading of ore piles with Load-Haul-Dump machines using Deep
  Reinforcement Learning
Autonomous loading of ore piles with Load-Haul-Dump machines using Deep Reinforcement Learning
Rodrigo Salas
Francisco Leiva
Javier Ruiz-del-Solar
OffRL
18
0
0
11 Sep 2024
Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics
Points2Plans: From Point Clouds to Long-Horizon Plans with Composable Relational Dynamics
Yixuan Huang
Christopher Agia
Jimmy Wu
Tucker Hermans
Jeannette Bohg
3DPC
49
1
0
27 Aug 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
76
1
0
22 Aug 2024
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
89
3
0
20 Aug 2024
Answerability Fields: Answerable Location Estimation via Diffusion
  Models
Answerability Fields: Answerable Location Estimation via Diffusion Models
Daich Azuma
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
M. Kawanabe
DiffM
48
0
0
26 Jul 2024
GET-Zero: Graph Embodiment Transformer for Zero-shot Embodiment
  Generalization
GET-Zero: Graph Embodiment Transformer for Zero-shot Embodiment Generalization
Austin Patel
Shuran Song
LM&Ro
40
3
0
20 Jul 2024
Robotic Control via Embodied Chain-of-Thought Reasoning
Robotic Control via Embodied Chain-of-Thought Reasoning
Michał Zawalski
William Chen
Karl Pertsch
Oier Mees
Chelsea Finn
Sergey Levine
LRM
LM&Ro
34
54
0
11 Jul 2024
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Jiaming Zhou
Teli Ma
Kun-Yu Lin
Ronghe Qiu
Zifan Wang
Junwei Liang
52
4
0
20 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
47
1
0
09 Jun 2024
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary
  Trajectories
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
Qianlan Yang
Yu-Xiong Wang
OnRL
39
1
0
06 Jun 2024
Data Efficient Behavior Cloning for Fine Manipulation via
  Continuity-based Corrective Labels
Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels
Abhay Deshpande
Liyiming Ke
Quinn Pfeifer
Abhishek Gupta
S. Srinivasa
47
1
0
29 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Jianye Hao
Mingsheng Long
VGen
46
22
0
24 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
42
0
23 May 2024
Learning Future Representation with Synthetic Observations for
  Sample-efficient Reinforcement Learning
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
43
1
0
20 May 2024
On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks
On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks
Nicholas H. Barbara
Ruigang Wang
I. Manchester
37
4
0
19 May 2024
Policy Learning with a Language Bottleneck
Policy Learning with a Language Bottleneck
Megha Srivastava
Cédric Colas
Dorsa Sadigh
Jacob Andreas
40
3
0
07 May 2024
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Rank2Reward: Learning Shaped Reward Functions from Passive Video
Daniel Yang
Davin Tjia
Jacob Berg
Dima Damen
Pulkit Agrawal
Abhishek Gupta
OffRL
37
5
0
23 Apr 2024
Lyapunov-stable Neural Control for State and Output Feedback: A Novel
  Formulation
Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation
Lujie Yang
Hongkai Dai
Zhouxing Shi
Cho-Jui Hsieh
Russ Tedrake
Huan Zhang
52
14
0
11 Apr 2024
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic
  Agent
AdaDemo: Data-Efficient Demonstration Expansion for Generalist Robotic Agent
Tongzhou Mu
Yijie Guo
Jie Xu
Ankit Goyal
Hao Su
Dieter Fox
Animesh Garg
LM&Ro
42
0
0
11 Apr 2024
STITCH: Augmented Dexterity for Suture Throws Including Thread
  Coordination and Handoffs
STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs
Kush Hari
Hansoul Kim
Will Panitch
Kishore Srinivas
Vincent Schorp
K. Dharmarajan
Shreya Ganti
Tara Sadjadpour
Kenneth Y. Goldberg
33
6
0
08 Apr 2024
Entity-Centric Reinforcement Learning for Object Manipulation from
  Pixels
Entity-Centric Reinforcement Learning for Object Manipulation from Pixels
Dan Haramati
Tal Daniel
Aviv Tamar
LM&Ro
OffRL
OCL
37
10
0
01 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active
  Online Exploration
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
25
0
0
31 Mar 2024
1234567
Next