Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2203.06173
Cited By
Masked Visual Pre-training for Motor Control
11 March 2022
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Masked Visual Pre-training for Motor Control"
50 / 218 papers shown
Title
Dexterity from Smart Lenses: Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations
Irmak Güzey
Haozhi Qi
Julen Urain
Changhao Wang
Jessica Yin
...
A. Rai
Jitendra Malik
Tingfan Wu
Akash Sharma
Homanga Bharadhwaj
36
1
0
20 Nov 2025
Uni-Hand: Universal Hand Motion Forecasting in Egocentric Views
Junyi Ma
Wentao Bao
Jingyi Xu
Guanzhong Sun
Yu Zheng
Erhang Zhang
Xieyuanli Chen
Hesheng Wang
EgoV
235
0
0
17 Nov 2025
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
162
0
0
11 Nov 2025
DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation
Jingyi Tian
Le Wang
Sanping Zhou
Sen Wang
Jiayi Li
Gang Hua
72
0
0
28 Oct 2025
Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Qixiu Li
Yu Deng
Yaobo Liang
L. Luo
Lei Zhou
...
Hao Chen
Lily Sun
Dong Chen
J. Yang
B. Guo
101
3
0
24 Oct 2025
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
GigaBrain Team
Angen Ye
Boyuan Wang
Chaojun Ni
Guan Huang
...
Yukun Zhou
Z. Dong
Z. J. Wang
Zhichao Liu
Zheng Hua Zhu
LM&Ro
VLM
325
4
0
22 Oct 2025
Learning to Grasp Anything by Playing with Random Toys
Dantong Niu
Yuvan Sharma
Baifeng Shi
Rachel Ding
Matteo Gioia
...
Anirudh Pai
Shankar Shastry
Trevor Darrell
Jitendra Malik
Roei Herzig
89
0
0
14 Oct 2025
Actron3D: Learning Actionable Neural Functions from Videos for Transferable Robotic Manipulation
Anran Zhang
Hanzhi Chen
Yannick Burkhardt
Yao Zhong
Johannes Betz
Helen Oleynikova
Stefan Leutenegger
60
1
0
14 Oct 2025
Population-Coded Spiking Neural Networks for High-Dimensional Robotic Control
Kanishkha Jaisankar
Xiaoyang Jiang
Feifan Liao
Jeethu Sreenivas Amuthan
68
0
0
12 Oct 2025
VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
Yixiao Wang
Mingxiao Huo
Zhixuan Liang
Yushi Du
Lingfeng Sun
...
Jinghuan Shang
Chensheng Peng
Mohit Bansal
Mingyu Ding
Masayoshi Tomizuka
112
1
0
06 Oct 2025
StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
Mingyu Liu
Jiuhe Shu
Hui Chen
Zeju Li
Canyu Zhao
J. Yang
Shenyuan Gao
Hao Chen
Chunhua Shen
76
1
0
06 Oct 2025
Best of Sim and Real: Decoupled Visuomotor Manipulation via Learning Control in Simulation and Perception in Real
Jialei Huang
Zhaoheng Yin
Yingdong Hu
S. Wang
Xingyu Lin
Yang Gao
60
0
0
30 Sep 2025
GLUE: Global-Local Unified Encoding for Imitation Learning via Key-Patch Tracking
Ye Chen
Zichen Zhou
Jianyu Dou
Te Cui
Yi Yang
Yufeng Yue
64
0
0
27 Sep 2025
exUMI: Extensible Robot Teaching System with Action-aware Task-agnostic Tactile Representation
Yue Xu
Litao Wei
Pengyu An
Qingyu Zhang
Yong-Lu Li
81
0
0
18 Sep 2025
Pre-trained Visual Representations Generalize Where it Matters in Model-Based Reinforcement Learning
Scott Jones
Liyou Zhou
Sebastian W. Pattinson
112
0
0
16 Sep 2025
4D Visual Pre-training for Robot Learning
Chengkai Hou
Yanjie Ze
Y. Fu
Zeyu Gao
Songbo Hu
Yue Yu
Shanghang Zhang
Huazhe Xu
144
3
0
24 Aug 2025
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Zezeng Li
Alexandre Chapin
Enda Xiang
Rui Yang
Bruno Machado
Na Lei
Emmanuel Dellandrea
Di Huang
Liming Chen
191
2
0
24 Aug 2025
Video Generators are Robot Policies
Junbang Liang
P. Tokmakov
Ruoshi Liu
Sruthi Sudhakar
Paarth Shah
Rares Andrei Ambrus
Carl Vondrick
VGen
187
8
0
01 Aug 2025
FMimic: Foundation Models are Fine-grained Action Learners from Human Videos
The international journal of robotics research (IJRR), 2025
Guangyan Chen
Meiling Wang
Te Cui
Yao Mu
Haoyang Lu
...
Mengxiao Hu
Tianxing Zhou
M. Fu
Yi Yang
Yufeng Yue
LM&Ro
VLM
97
4
0
28 Jul 2025
GR-3 Technical Report
Chilam Cheang
S. Chen
Zhongren Cui
Yingdong Hu
Liqun Huang
...
Hongtao Wu
Xin Xiao
Yuyang Xiao
Jiafeng Xu
Yichu Yang
240
41
0
21 Jul 2025
Multimodal Visual Transformer for Sim2real Transfer in Visual Reinforcement Learning
Zichun Xu
Yuntao Li
Zhaomin Wang
Lei Zhuang
Guocai Yang
Jingdong Zhao
MDE
193
0
0
12 Jul 2025
Demonstrating Multi-Suction Item Picking at Scale via Multi-Modal Learning of Pick Success
Robotics (RAS), 2025
Che Wang
Jeroen van Baar
Chaitanya Mitash
Shuai-Peng Li
Dylan Randle
Weiyao Wang
Sumedh Sontakke
Kostas E. Bekris
Kapil Katyal
SSL
254
2
0
12 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
212
2
0
10 Jun 2025
UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2025
Yihe Tang
Wenlong Huang
Yingke Wang
Chengshu Li
Roy Yuan
Ruohan Zhang
Jiajun Wu
Li Fei-Fei
212
12
0
10 Jun 2025
BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models
Peiyan Li
Yixiang Chen
Hongtao Wu
Xiao Ma
Xiangnan Wu
Y. Huang
Liang Wang
Tao Kong
Tieniu Tan
182
21
0
09 Jun 2025
Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning
Junlin Wang
Zhiyun Lin
1.4K
0
0
24 May 2025
GLOVER++: Unleashing the Potential of Affordance Learning from Human Behaviors for Robotic Manipulation
Teli Ma
Jia Zheng
Zifan Wang
Ziyao Gao
Jiaming Zhou
Junwei Liang
255
4
0
17 May 2025
Efficient Robotic Policy Learning via Latent Space Backward Planning
Dongxiu Liu
Haoyi Niu
Zhihao Wang
Jinliang Zheng
Yinan Zheng
Zhonghong Ou
Jianming Hu
Jianxiong Li
Xianyuan Zhan
249
4
0
11 May 2025
π
0.5
π_{0.5}
π
0.5
: a Vision-Language-Action Model with Open-World Generalization
Physical Intelligence
Kevin Black
Noah Brown
James Darpinian
Karan Dhabalia
...
Homer Walke
Anna Walling
Haohuan Wang
Lili Yu
Ury Zhilinsky
LM&Ro
VLM
5.4K
300
0
22 Apr 2025
ViTaMIn: Learning Contact-Rich Tasks Through Robot-Free Visuo-Tactile Manipulation Interface
Fangchen Liu
Chuanyu Li
Yihua Qin
Ankit Shaw
Jinfeng Xu
Pieter Abbeel
338
14
0
08 Apr 2025
MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos
Alexey Gavryushin
Xi Wang
Robert J. S. Malate
Chenyu Yang
Xiaojun Jia
Shubh Goel
Davide Liconti
René Zurbrugg
173
3
0
08 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
484
0
0
05 Apr 2025
R900: Understanding the Cost-Effectiveness of Random Exploration from 900 Hours of Robotic Data Collection
Shutong Jin
Axel Kaliff
Ruiyu Wang
Muhammad Zahid
Florian T. Pokorny
VGen
161
0
0
30 Mar 2025
What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning
Chi-Hsi Kung
Frangil Ramirez
Juhyung Ha
Yi-Ting Chen
David J. Crandall
Yi-Hsuan Tsai
610
2
0
27 Mar 2025
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Computer Vision and Pattern Recognition (CVPR), 2025
Hanzhi Chen
Boyang Sun
Anran Zhang
Marc Pollefeys
Stefan Leutenegger
LM&Ro
360
24
0
10 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
412
3
0
10 Mar 2025
SRSA: Skill Retrieval and Adaptation for Robotic Assembly Tasks
International Conference on Learning Representations (ICLR), 2025
Yijie Guo
Bingjie Tang
Iretiayo Akinola
Dieter Fox
Abhishek Gupta
Yashraj S. Narang
197
11
0
06 Mar 2025
A comparison of visual representations for real-world reinforcement learning in the context of vacuum gripping
Nico Sutter
Valentin N. Hartmann
Stelian Coros
OffRL
211
0
0
04 Mar 2025
Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Sicheng Xie
Haidong Cao
Zejia Weng
Zhen Xing
Shiwei Shen
Shiwei Shen
Jiaqi Leng
Yanwei Fu
Zuxuan Wu
339
8
0
23 Feb 2025
Pre-training Auto-regressive Robotic Models with 4D Representations
Dantong Niu
Yuvan Sharma
Haoru Xue
Giscard Biamby
Junyi Zhang
Ziteng Ji
Trevor Darrell
Roei Herzig
353
17
0
18 Feb 2025
Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
AAAI Conference on Artificial Intelligence (AAAI), 2025
Yuhan Zhang
Guoqing Ma
Guangfu Hao
Liangxuan Guo
Yang Chen
S. Yu
OnRL
331
1
0
08 Feb 2025
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
International Conference on Learning Representations (ICLR), 2024
Yang Tian
Sizhe Yang
Jia Zeng
P. Wang
Dahua Lin
Hao Dong
Jiangmiao Pang
309
70
0
19 Dec 2024
Learning from Massive Human Videos for Universal Humanoid Pose Control
Jiageng Mao
Siheng Zhao
Siqi Song
Tianheng Shi
Junjie Ye
Mingtong Zhang
Haoran Geng
Jitendra Malik
Vitor Campagnolo Guizilini
Yue Wang
283
21
0
18 Dec 2024
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
IEEE International Conference on Robotics and Automation (ICRA), 2024
Xin Liu
Yaran Chen
Haoran Li
SSL
462
3
0
14 Dec 2024
Reinforcement Learning from Wild Animal Videos
Elliot Chane-Sane
Constant Roux
O. Stasse
Nicolas Mansard
877
1
0
05 Dec 2024
Stem-OB: Generalizable Visual Imitation Learning with Stem-Like Convergent Observation through Diffusion Inversion
International Conference on Learning Representations (ICLR), 2024
Kaizhe Hu
Zihang Rui
Yao He
Yuyao Liu
Pu Hua
Huazhe Xu
237
4
0
07 Nov 2024
DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning
G. Zhou
Hengkai Pan
Yann LeCun
Lerrel Pinto
VGen
LM&Ro
OffRL
298
94
0
07 Nov 2024
Pre-trained Visual Dynamics Representations for Efficient Policy Learning
European Conference on Computer Vision (ECCV), 2024
Hao Luo
Bohan Zhou
Zongqing Lu
207
2
0
05 Nov 2024
Sparsh: Self-supervised touch representations for vision-based tactile sensing
Conference on Robot Learning (CoRL), 2024
Carolina Higuera
Akash Sharma
Chaithanya Krishna Bodduluri
Taosha Fan
Patrick E. Lancaster
...
Michael Kaess
Byron Boots
Mike Lambeta
Tingfan Wu
Mustafa Mukadam
218
45
0
31 Oct 2024
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets
International Conference on Learning Representations (ICLR), 2024
Guangqi Jiang
Yifei Sun
Tao Huang
Huanyu Li
Yongyuan Liang
Huazhe Xu
256
17
0
29 Oct 2024
1
2
3
4
5
Next