Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2406.09246
Cited By
v1
v2 (latest)
OpenVLA: An Open-Source Vision-Language-Action Model
13 June 2024
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
Suraj Nair
Rafael Rafailov
Ethan P. Foster
Grace Lam
Pannag R Sanketi
Quan Vuong
Thomas Kollar
Benjamin Burchfiel
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (40 upvotes)
Papers citing
"OpenVLA: An Open-Source Vision-Language-Action Model"
50 / 676 papers shown
Title
VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation
Hui Zhou
Siyuan Huang
Minxing Li
Hao Zhang
Lue Fan
Shaoshuai Shi
123
0
0
26 Nov 2025
TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos
Seungjae Lee
Yoonkyo Jung
Inkook Chun
Yao-Chih Lee
Zikui Cai
...
Aayush Talreja
Tan Dat Dao
Yongyuan Liang
Jia-Bin Huang
Furong Huang
28
0
0
26 Nov 2025
On the Feasibility of Hijacking MLLMs' Decision Chain via One Perturbation
Changyue Li
Jiaying Li
Youliang Yuan
Jiaming He
Zhicong Huang
Pinjia He
AAML
124
0
0
25 Nov 2025
Reinforcing Action Policies by Prophesying
Jiahui Zhang
Ze Huang
Chun Gu
Zipei Ma
Li Zhang
104
0
0
25 Nov 2025
DeeAD: Dynamic Early Exit of Vision-Language Action for Efficient Autonomous Driving
Haibo Hu
Lianming Huang
Nan Guan
Chun Jason Xue
VLM
133
0
0
25 Nov 2025
Unifying Perception and Action: A Hybrid-Modality Pipeline with Implicit Visual Chain-of-Thought for Robotic Action Generation
Xiangkai Ma
Lekai Xing
Han Zhang
Wenzhong Li
Sanglu Lu
LM&Ro
VGen
111
0
0
25 Nov 2025
LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight
Yunze Man
S. S. Wang
Guowen Zhang
Johan Bjorck
Zhiqi Li
Liang-Yan Gui
Jim Fan
Jan Kautz
Yu Wang
Zhiding Yu
60
0
0
25 Nov 2025
Dynamic Test-Time Compute Scaling in Control Policy: Difficulty-Aware Stochastic Interpolant Policy
Inkook Chun
Seungjae Lee
M. S. Albergo
Saining Xie
Eric Vanden-Eijnden
60
0
0
25 Nov 2025
Discover, Learn, and Reinforce: Scaling Vision-Language-Action Pretraining with Diverse RL-Generated Trajectories
Rushuai Yang
Zhiyuan Feng
Tianxiang Zhang
Kaixin Wang
Chuheng Zhang
Li Zhao
Xiu Su
Yi-Ling Chen
Jiang Bian
OffRL
149
0
0
24 Nov 2025
Mixture of Horizons in Action Chunking
Dong Jing
Gang Wang
Jiaqi Liu
Weiliang Tang
Zelong Sun
Yunchao Yao
Zhenyu Wei
Y. Liu
Zhiwu Lu
Mingyu Ding
114
0
0
24 Nov 2025
EchoVLA: Robotic Vision-Language-Action Model with Synergistic Declarative Memory for Mobile Manipulation
Min Lin
Xiwen Liang
Bingqian Lin
Liu Jingzhi
Zijian Jiao
...
Yuhan Ma
Yuecheng Liu
Shen Zhao
Yuzheng Zhuang
Xiaodan Liang
LM&Ro
175
0
0
22 Nov 2025
MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots
Ting Huang
Dongjian Li
Rui Yang
Zeyu Zhang
Zida Yang
Hao Tang
LRM
44
1
0
22 Nov 2025
VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation
Hanyu Zhou
Chuanhao Ma
Gim Hee Lee
88
0
0
21 Nov 2025
METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model
Y. Fu
Ning Chen
Junkai Zhao
Shaozhe Shan
Guocai Yao
Pengwei Wang
Zhongyuan Wang
Shanghang Zhang
124
0
0
21 Nov 2025
SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding
Nikolay Nikolov
Giuliano Albanese
Sombit Dey
Aleksandar Yanev
Luc Van Gool
Jan-Nico Zaech
D. Paudel
LM&Ro
220
0
0
21 Nov 2025
IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation
Y. Li
Lichi Li
Anh Dao
Xinyu Zhou
Yicheng Qiao
...
Daeun Lee
Z. Chen
Zhen Tan
Mohit Bansal
Yu Kong
93
0
0
21 Nov 2025
H-GAR: A Hierarchical Interaction Framework via Goal-Driven Observation-Action Refinement for Robotic Manipulation
Yijie Zhu
Rui Shao
Ziyang Liu
Jie He
Jizhihui Liu
Jiuru Wang
Zitong Yu
110
1
0
21 Nov 2025
RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation
Shihan Wu
Xuecheng Liu
Shaoxuan Xie
Pengwei Wang
Xinghang Li
...
Tiejun Huang
Shanghang Zhang
Yonghua Lin
Zhongyuan Wang
Guocai Yao
132
0
0
21 Nov 2025
RynnVLA-002: A Unified Vision-Language-Action and World Model
Jun Cen
Siteng Huang
Yuqian Yuan
Kehan Li
Hangjie Yuan
...
Xin Li
Hao Luo
Fan Wang
Deli Zhao
H. Chen
VGen
SyDa
209
0
0
21 Nov 2025
Learning Diffusion Policies for Robotic Manipulation of Timber Joinery under Fabrication Uncertainty
Salma Mozaffari
Daniel Ruan
W. V. D. Bogert
Nima Fazeli
Sigrid Adriaenssens
Arash Adel
24
0
0
21 Nov 2025
Stable Offline Hand-Eye Calibration for any Robot with Just One Mark
Sicheng Xie
Lingchen Meng
Zhiying Du
Shuyuan Tu
Haidong Cao
Jiaqi Leng
Z. F. Wu
Yu-Gang Jiang
112
0
0
21 Nov 2025
When Alignment Fails: Multimodal Adversarial Attacks on Vision-Language-Action Models
Yuping Yan
Yuhan Xie
Yinxin Zhang
Lingjuan Lyu
Yaochu Jin
Yaochu Jin
AAML
104
0
0
20 Nov 2025
Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight
Yi Yang
X. Li
Yiyang Chen
Jin Song
Yihan Wang
Zipeng Xiao
Jiadi Su
You Qiaoben
Pengfei Liu
Zhijie Deng
VLM
137
0
0
20 Nov 2025
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
V. Bhat
Sungsu Kim
Valts Blukis
Greg Heinrich
Prashanth Krishnamurthy
Ramesh Karri
Stan Birchfield
Farshad Khorrami
Jonathan Tremblay
VLM
133
1
0
20 Nov 2025
InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy
Yang Tian
Yuyin Yang
Yiman Xie
Zetao Cai
Xu Shi
...
Ping Wang
Junhao Cai
Jia Zeng
Hao Dong
Jiangmiao Pang
72
0
0
20 Nov 2025
FT-NCFM: An Influence-Aware Data Distillation Framework for Efficient VLA Models
Kewei Chen
Yayu Long
Shuai Li
Mingsheng Shang
32
0
0
20 Nov 2025
VLA-Pruner: Temporal-Aware Dual-Level Visual Token Pruning for Efficient Vision-Language-Action Inference
Ziyan Liu
Y. Chen
Hongyi Cai
Tao Lin
Shuo Yang
Zheng Liu
Bo Zhao
VLM
211
0
0
20 Nov 2025
Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies
Gabriel Lauzier
Alexandre Girard
François Ferland
44
0
0
19 Nov 2025
In-N-On: Scaling Egocentric Manipulation with in-the-wild and on-task Data
Xiongyi Cai
Ri-Zhao Qiu
Geng Chen
Lai Wei
Isabella Liu
Tianshu Huang
Xuxin Cheng
Xiaolong Wang
EgoV
229
1
0
19 Nov 2025
HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation
Lai Wei
Xuanbin Peng
Ri-Zhao Qiu
Tianshu Huang
Xuxin Cheng
Xiaolong Wang
40
1
0
18 Nov 2025
FlexiCup: Wireless Multimodal Suction Cup with Dual-Zone Vision-Tactile Sensing
Junhao Gong
Shoujie Li
Kit-Wa Sou
Changqing Guo
Hourong Huang
...
Yifan Xie
Chenxin Liang
Chuqiao Lyu
Xiaojun Liang
Wenbo Ding
88
0
0
18 Nov 2025
VLA-R: Vision-Language Action Retrieval toward Open-World End-to-End Autonomous Driving
Hyunki Seong
Seongwoo Moon
Hojin Ahn
Jehun Kang
David Hyunchul Shim
VLM
108
0
0
16 Nov 2025
AttackVLA: Benchmarking Adversarial and Backdoor Attacks on Vision-Language-Action Models
Jiayu Li
Yunhan Zhao
Xiang Zheng
Zonghuan Xu
Yige Li
Xingjun Ma
Yu-Gang Jiang
AAML
228
0
0
15 Nov 2025
Decoupled Action Head: Confining Task Knowledge to Conditioning Layers
Jian Zhou
Sihao Lin
Shuai Fu
Qi Wu
OffRL
56
0
0
15 Nov 2025
Learning a Thousand Tasks in a Day
Science Robotics (Sci. Robot.), 2025
Kamil Dreczkowski
Pietro Vitiello
Vitalis Vosylius
Edward Johns
OffRL
256
1
0
13 Nov 2025
Audio-VLA: Adding Contact Audio Perception to Vision-Language-Action Model for Robotic Manipulation
Xiangyi Wei
Haotian Zhang
Xinyi Cao
Siyu Xie
Weifeng Ge
Yang Li
C. Wang
156
0
0
13 Nov 2025
SemanticVLA: Semantic-Aligned Sparsification and Enhancement for Efficient Robotic Manipulation
W. Li
Renshan Zhang
Rui Shao
Zhijian Fang
Kaiwen Zhou
Zhuotao Tian
Liqiang Nie
235
1
0
13 Nov 2025
ViPRA: Video Prediction for Robot Actions
Sandeep Routray
Hengkai Pan
Unnat Jain
Shikhar Bahl
Deepak Pathak
162
0
0
11 Nov 2025
How Do VLAs Effectively Inherit from VLMs?
Chuheng Zhang
Rushuai Yang
Xiaoyu Chen
Kaixin Wang
Li Zhao
Yi-Ling Chen
Jiang Bian
LM&Ro
214
0
0
10 Nov 2025
SlotVLA: Towards Modeling of Object-Relation Representations in Robotic Manipulation
Taisei Hanyu
Nhat Chung
H. Le
T. Nguyen
Yuki Ikebe
...
Tung Kieu
Kashu Yamazaki
Chase Rainwater
A. Nguyen
Ngan Le
188
1
0
10 Nov 2025
ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval
Shahram Najam Syed
Yatharth Ahuja
Arthur Jakobsson
Jeff Ichnowski
VLM
61
0
0
09 Nov 2025
Towards Human-AI-Robot Collaboration and AI-Agent based Digital Twins for Parkinson's Disease Management: Review and Outlook
Hassan Hizeh
Rim Chighri
Muhammad Mahboob Ur Rahman
Mohamed A. Bahloul
Ali Muqaibel
Tareq Y. Al-Naffouri
64
0
0
08 Nov 2025
From Words to Safety: Language-Conditioned Safety Filtering for Robot Navigation
Zeyuan Feng
Haimingyue Zhang
Somil Bansal
48
0
0
08 Nov 2025
10 Open Challenges Steering the Future of Vision-Language-Action Models
Soujanya Poria
Navonil Majumder
Chia-Yu Hung
Amir Ali Bagherzadeh
Chuan Li
Kenneth Kwok
Z. Wang
Cheston Tan
Jiajun Wu
David Hsu
LM&Ro
VLM
243
0
0
08 Nov 2025
Visual Spatial Tuning
Rui Yang
Ziyu Zhu
Yanwei Li
Jingjia Huang
Shen Yan
...
Xiangtai Li
S. Li
Wenqian Wang
Yi Lin
Hengshuang Zhao
VLM
273
3
0
07 Nov 2025
Let Me Show You: Learning by Retrieving from Egocentric Video for Robotic Manipulation
Yichen Zhu
Feifei Feng
56
1
0
07 Nov 2025
EveryDayVLA: A Vision-Language-Action Model for Affordable Robotic Manipulation
Samarth Chopra
Alex McMoil
Ben Carnovale
Evan Sokolson
Rajkumar Kubendran
Samuel Dickerson
54
0
0
07 Nov 2025
GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies
Maelic Neau
Zoe Falomir
Paulo E. Santos
Anne-Gwenn Bosser
Cédric Buche
52
0
0
06 Nov 2025
Real-to-Sim Robot Policy Evaluation with Gaussian Splatting Simulation of Soft-Body Interactions
Kaifeng Zhang
Shuo Sha
Hanxiao Jiang
M. Loper
Hyunjong Song
Guangyan Cai
Zhuo Xu
Xiaochen Hu
Changxi Zheng
Yunzhu Li
188
0
0
06 Nov 2025
Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment
Tao Lin
Yilei Zhong
Yuxin Du
Jingjing Zhang
Jiting Liu
...
Yanwen Zou
Lixing Zou
Zhaoye Zhou
Gen Li
Bo Zhao
VLM
98
2
0
06 Nov 2025
1
2
3
4
...
12
13
14
Next