Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.09246
Cited By
v1
v2 (latest)
OpenVLA: An Open-Source Vision-Language-Action Model
13 June 2024
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
Suraj Nair
Rafael Rafailov
Ethan P. Foster
Grace Lam
Pannag R Sanketi
Quan Vuong
Thomas Kollar
Benjamin Burchfiel
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (40 upvotes)
Papers citing
"OpenVLA: An Open-Source Vision-Language-Action Model"
50 / 723 papers shown
COMMET: A System for Human-Induced Conflicts in Mobile Manipulation of Everyday Tasks
Dongping Li
Shaoting Peng
John Pohovey
Katherine Rose Driggs-Campbell
117
0
0
05 Sep 2025
FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies
Moritz Reuss
Hongyi Zhou
Marcel Rühle
Ömer Erdinç Yagmurlu
Fabian Otto
Rudolf Lioutikov
LM&Ro
VLM
179
17
0
05 Sep 2025
FPC-VLA: A Vision-Language-Action Framework with a Supervisor for Failure Prediction and Correction
Yifan Yang
Zhixiang Duan
Tianshi Xie
Fuyu Cao
Pinxi Shen
...
Piaopiao Jin
Guokang Sun
Shaoqing Xu
Yangwei You
Jingtai Liu
218
6
0
04 Sep 2025
RL's Razor: Why Online Reinforcement Learning Forgets Less
Idan Shenfeld
Jyothish Pari
Pulkit Agrawal
CLL
194
43
0
04 Sep 2025
Long-Horizon Visual Imitation Learning via Plan and Code Reflection
Quan Chen
Chenrui Shi
Qi Chen
Yuwei Wu
Zhi Gao
Xintong Zhang
Rui Gao
Kun Wu
Yunde Jia
175
1
0
04 Sep 2025
EMMA: Scaling Mobile Manipulation via Egocentric Human Data
Lawrence Y. Zhu
Pranav Kuppili
Ryan Punamiya
Patcharapong Aphiwetsa
Dhruv Patel
Simar Kareer
Sehoon Ha
Danfei Xu
155
6
0
04 Sep 2025
Balancing Signal and Variance: Adaptive Offline RL Post-Training for VLA Flow Models
Hongyin Zhang
Shiyuan Zhang
Junxi Jin
Qixin Zeng
Yifan Qiao
Hongchao Lu
Xuetao Zhang
OffRL
142
6
0
04 Sep 2025
ANNIE: Be Careful of Your Robots
Yiyang Huang
Zixuan Wang
Zishen Wan
Yapeng Tian
Haobo Xu
Yinhe Han
Yiming Gan
AAML
147
0
0
03 Sep 2025
OmniActor: A Generalist GUI and Embodied Agent for 2D&3D Worlds
Longrong Yang
Zhixiong Zeng
Yufeng Zhong
Jing Huang
Liming Zheng
Lei Chen
Haibo Qiu
Zequn Qin
Lin Ma
Xi Li
LLMAG
LM&Ro
141
3
0
02 Sep 2025
U-ARM : Ultra low-cost general teleoperation interface for robot manipulation
Yanwen Zou
Zhaoye Zhou
Chenyang Shi
Zewei Ye
Junda Huang
Yan Ding
Bo Zhao
213
0
0
02 Sep 2025
Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance
Y. Zhang
C. Wang
Ouyang Lu
Yuan Zhao
Yunfei Ge
Zhenglong Sun
Xiu Li
Chi Zhang
Chenjia Bai
Xuelong Li
247
6
0
02 Sep 2025
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
Minghuan Liu
Zhengbang Zhu
Xiaoshen Han
Peng Hu
Haotong Lin
...
Xinghang Li
Yong Yu
Weinan Zhang
Tao Kong
Bingyi Kang
130
4
0
02 Sep 2025
MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation
Zhenyu Wu
Angyuan Ma
Xiuwei Xu
Hang Yin
Yinan Liang
Ziwei Wang
Jiwen Lu
Haibin Yan
LM&Ro
176
3
0
01 Sep 2025
Articulated Object Estimation in the Wild
Abdelrhman Werby
Martin Buchner
Adrian Rofer
Chenguang Huang
Wolfram Burgard
Abhinav Valada
204
6
0
01 Sep 2025
Mechanistic interpretability for steering vision-language-action models
Bear Häon
Kaylene C. Stocking
Ian Chuang
Claire Tomlin
LLMSV
175
2
0
30 Aug 2025
Galaxea Open-World Dataset and G0 Dual-System VLA Model
Tao Jiang
Tianyuan Yuan
Yicheng Liu
Chenhao Lu
Jianning Cui
Xiao Liu
Shuiqi Cheng
Jiyang Gao
Huazhe Xu
Hang Zhao
LM&Ro
129
24
0
30 Aug 2025
ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory
Ying Li
Xiaobao Wei
Yatian Wang
Yuming Li
Zhongyu Zhao
Hao Wang
Ningning MA
Ming Lu
Shanghang Zhang
Shanghang Zhang
VGen
378
9
0
29 Aug 2025
RoboInspector: Unveiling the Unreliability of Policy Code for LLM-enabled Robotic Manipulation
Chenduo Ying
L. Du
Peng Cheng
Yuanchao Shu
160
0
0
29 Aug 2025
Prompt-to-Product: Generative Assembly via Bimanual Manipulation
Ruixuan Liu
Philip Huang
Ava Pun
Kangle Deng
Shobhit Aggarwal
...
M. Liu
Deva Ramanan
Jun-Yan Zhu
Jiaoyang Li
Changliu Liu
100
0
0
28 Aug 2025
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Qiao Sun
Liujia Yang
Wei Tang
Wei Huang
Kaixin Xu
...
Tong He
Yilun Chen
Xili Dai
Nanyang Ye
Qinying Gu
VGen
LM&Ro
421
1
0
28 Aug 2025
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
Wei Li
Renshan Zhang
Rui Shao
Jie He
Liqiang Nie
VLM
224
23
0
28 Aug 2025
EO-1: Interleaved Vision-Text-Action Pretraining for General Robot Control
Delin Qu
Haoming Song
Qizhi Chen
Zhaoqing Chen
Xianqiang Gao
...
Maoqing Yao
Haoran Yang
Jiacheng Bao
Jiangwei Zhong
Dong Wang
LM&Ro
335
5
0
28 Aug 2025
Embodied AI: Emerging Risks and Opportunities for Policy Action
Jared Perlo
Alexander Robey
Fazl Barez
Luciano Floridi
Jakob Mokander
315
2
0
28 Aug 2025
Ego-centric Predictive Model Conditioned on Hand Trajectories
Binjie Zhang
Mike Zheng Shou
EgoV
317
0
0
27 Aug 2025
Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies
Zhixuan Liang
Yizhuo Li
Tianshuo Yang
Chengyue Wu
Sitong Mao
...
Jiangmiao Pang
Xiaokang Yang
Ping Luo
Yao Mu
Ping Luo
179
30
0
27 Aug 2025
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation
Yiguo Fan
Pengxiang Ding
Shuanghao Bai
Xinyang Tong
Yuyang Zhu
...
Yang Liu
Siteng Huang
Zhaoxin Fan
Badong Chen
Xuetao Zhang
207
12
0
27 Aug 2025
HyperTASR: Hypernetwork-Driven Task-Aware Scene Representations for Robust Manipulation
Li Sun
Jiefeng Wu
Feng Chen
Ruizhe Liu
Yanchao Yang
211
1
0
26 Aug 2025
MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
Hao Shi
Bin Xie
Yingfei Liu
Lin Sun
Fengrong Liu
Tiancai Wang
Erjin Zhou
Haoqiang Fan
Xiangyu Zhang
Gao Huang
LM&Ro
132
29
0
26 Aug 2025
SEBVS: Synthetic Event-based Visual Servoing for Robot Navigation and Manipulation
Krishna Vinod
Prithvi Jai Ramesh
Pavan Kumar B N
Bharatesh Chakravarthi
88
1
0
25 Aug 2025
HLG: Comprehensive 3D Room Construction via Hierarchical Layout Generation
Xiping Wang
Yuxi Wang
Mengqi Zhou
Junsong Fan
Zhaoxiang Zhang
3DV
150
0
0
25 Aug 2025
FlowVLA: Visual Chain of Thought-based Motion Reasoning for Vision-Language-Action Models
Zhide Zhong
Haodong Yan
Junfeng Li
Xiangchen Liu
Xin Gong
...
Wenxuan Song
Jiayi Chen
Xinhu Zheng
Hesheng Wang
Haoang Li
LRM
VGen
234
3
0
25 Aug 2025
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Guanxing Lu
Baoxiong Jia
Puhao Li
Yixin Chen
Ziwei Wang
Yansong Tang
Siyuan Huang
3DGS
220
10
0
25 Aug 2025
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Shouwei Ruan
Liyuan Wang
Caixin Kang
Qihui Zhu
Songming Liu
Xingxing Wei
Hang Su
LM&Ro
163
5
0
24 Aug 2025
Robotic Manipulation via Imitation Learning: Taxonomy, Evolution, Benchmark, and Challenges
Zezeng Li
Alexandre Chapin
Enda Xiang
Rui Yang
Bruno Machado
Na Lei
Emmanuel Dellandrea
Di Huang
Liming Chen
267
3
0
24 Aug 2025
NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Nikita Lyubaykin
Andrei Polubarov
Alexander Derevyagin
Vladislav Kurenkov
AI4CE
218
2
0
23 Aug 2025
Do What? Teaching Vision-Language-Action Models to Reject the Impossible
Wen-Han Hsieh
Elvis Hsieh
Dantong Niu
Trevor Darrell
Roei Herzig
David M. Chan
LM&Ro
178
2
0
22 Aug 2025
Survey of Vision-Language-Action Models for Embodied Manipulation
Haoran Li
Yuhui Chen
Wenbo Cui
Weiheng Liu
Kai Liu
Mingcai Zhou
Zhengtao Zhang
Dongbin Zhao
LM&Ro
476
4
0
21 Aug 2025
TransLLM: A Unified Multi-Task Foundation Framework for Urban Transportation via Learnable Prompting
Jiaming Leng
Yunying Bi
Chuan Qin
Bing Yin
Yanyong Zhang
Chao Wang
AI4TS
101
0
0
20 Aug 2025
The Social Context of Human-Robot Interactions
Sydney Thompson
Kate Candon
Marynel Vázquez
97
2
0
19 Aug 2025
CAST: Counterfactual Labels Improve Instruction Following in Vision-Language-Action Models
Catherine Glossop
William Chen
Arjun Bhorkar
Dhruv Shah
Sergey Levine
LM&Ro
203
6
0
19 Aug 2025
Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yaoting Huang
Yibin Chen
Fei Ni
Zibin Dong
Pengyi Li
Yan Zheng
Jianye Hao
LM&Ro
209
16
0
19 Aug 2025
Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation
Zhuoling Li
Xiaoyang Wu
Zhenhua Xu
Hengshuang Zhao
122
1
0
19 Aug 2025
Grounding Actions in Camera Space: Observation-Centric Vision-Language-Action Policy
Tianyi Zhang
Haonan Duan
Haoran Hao
Yu Qiao
Jifeng Dai
Zhi Hou
147
3
0
18 Aug 2025
Large VLM-based Vision-Language-Action Models for Robotic Manipulation: A Survey
Rui Shao
W. Li
Lingsen Zhang
Renshan Zhang
Zhiyang Liu
Ran Chen
Liqiang Nie
LM&Ro
249
31
0
18 Aug 2025
Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
Zhongang Cai
Yubo Wang
Qingping Sun
Ruisi Wang
Chenyang Gu
...
Quan-ding Wang
Dahua Lin
Lei Yang
Dahua Lin
L. Yang
ELM
272
0
0
18 Aug 2025
Improving Pre-Trained Vision-Language-Action Policies with Model-Based Search
Cyrus Neary
Omar G. Younis
Artur Kuramshin
Ozgur Aslan
Glen Berseth
140
6
0
17 Aug 2025
Human Centric General Physical Intelligence for Agile Manufacturing Automation
Sandeep Kanta
Mehrdad Tavassoli
Varun Teja Chirkuri
Venkata Akhil Kumar
Santhi Bharath Punati
Praveen Damacharla
S. Katyara
AI4CE
137
1
0
16 Aug 2025
OmniD: Generalizable Robot Manipulation Policy via Image-Based BEV Representation
Jilei Mao
Jiarui Guan
Yingjuan Tang
Qirui Hu
Zhihang Li
Junjie Yu
Yongjie Mao
Yunzhe Sun
Shuang Liu
Xiaozhu Ju
105
2
0
16 Aug 2025
Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation
Hongbin Lin
Juan Rojas
K. W. S. Au
170
1
0
15 Aug 2025
ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
Jingyu Li
Bozhou Zhang
Jianfeng Dong
Jiankang Deng
Xiatian Zhu
Li Zhang
163
2
0
15 Aug 2025
Previous
1
2
3
...
6
7
8
...
13
14
15
Next
Page 7 of 15
Page
of 15
Go