Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.09246
Cited By
v1
v2 (latest)
OpenVLA: An Open-Source Vision-Language-Action Model
13 June 2024
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
Suraj Nair
Rafael Rafailov
Ethan P. Foster
Grace Lam
Pannag R Sanketi
Quan Vuong
Thomas Kollar
Benjamin Burchfiel
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (40 upvotes)
Papers citing
"OpenVLA: An Open-Source Vision-Language-Action Model"
50 / 723 papers shown
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
Cheng Yin
Yankai Lin
Wang Xu
Sikyuen Tam
Xiangrui Zeng
Zhiyuan Liu
Zhouping Yin
LRM
188
1
0
31 Oct 2025
Learning Generalizable Visuomotor Policy through Dynamics-Alignment
Dohyeok Lee
Jung Min Lee
Munkyung Kim
Seokhun Ju
Jin Woo Koo
Kyungjae Lee
Dohyeong Kim
Taehyun Cho
Jungwoo Lee
111
0
0
31 Oct 2025
Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model
John Won
Kyungmin Lee
Huiwon Jang
Dongyoung Kim
Jinwoo Shin
209
4
0
31 Oct 2025
Towards a Multi-Embodied Grasping Agent
Roman Freiberg
Alexander Qualmann
Ngo Anh Vien
Gerhard Neumann
168
0
0
31 Oct 2025
A Step Toward World Models: A Survey on Robotic Manipulation
Peng-Fei Zhang
Ying Cheng
Xiaofan Sun
S. Wang
Lei Zhu
Lei Zhu
Heng Tao Shen
LM&Ro
757
3
0
31 Oct 2025
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
Xu Zheng
Zihao Dongfang
Lutao Jiang
Boyuan Zheng
Yulong Guo
...
L. Zhang
Danda Pani Paudel
Nicu Sebe
Luc Van Gool
Xuming Hu
LRM
VLM
731
5
0
29 Oct 2025
Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation
Xiucheng Zhang
Yang Jiang
Hongwei Qing
Jiashuo Bai
LM&Ro
157
0
0
28 Oct 2025
BLM
1
_1
1
: A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning
Wentao Tan
Bowen Wang
Heng Zhi
Chenyu Liu
Z. Li
...
Chen Xu
Zhibin Wang
Tianshi Wang
Lei Zhu
Heng Tao Shen
LM&Ro
175
0
0
28 Oct 2025
Reliable Robotic Task Execution in the Face of Anomalies
Bharath Santhanam
Alex Mitrevski
Santosh Thoduka
Sebastian Houben
Teena Hassan
115
0
0
27 Oct 2025
OmniDexGrasp: Generalizable Dexterous Grasping via Foundation Model and Force Feedback
Yi-Lin Wei
Zhexi Luo
Yuhao Lin
Mu Lin
Zhizhao Liang
Shuoyu Chen
Wei-Shi Zheng
106
1
0
27 Oct 2025
RoboOmni: Proactive Robot Manipulation in Omni-modal Context
Siyin Wang
Jinlan Fu
Feihong Liu
Xinzhe He
Huangxuan Wu
...
Z. F. Wu
Yugang Jiang
See-Kiong Ng
Tat-Seng Chua
Xipeng Qiu
LM&Ro
302
1
0
27 Oct 2025
UrbanVLA: A Vision-Language-Action Model for Urban Micromobility
Anqi Li
Z. T. Wang
JIazhao Zhang
Minghan Li
Y. Qi
Zhibo Chen
Zhizheng Zhang
He Wang
148
1
0
27 Oct 2025
RobotArena
∞
\infty
∞
: Scalable Robot Benchmarking via Real-to-Sim Translation
Yash Jangir
Yidi Zhang
Kashu Yamazaki
Chenyu Zhang
Kuan-Hsun Tu
Tsung-Wei Ke
Lei Ke
Yonatan Bisk
Katerina Fragkiadaki
154
3
0
27 Oct 2025
Dexbotic: Open-Source Vision-Language-Action Toolbox
Bin Xie
Erjin Zhou
Fan Jia
Hao Shi
Haoqiang Fan
...
Zhao Wu
Ziheng Zhang
Ziming Liu
Ziwei Yan
Z. Zhang
LM&Ro
VLM
204
3
0
27 Oct 2025
ACG: Action Coherence Guidance for Flow-based VLA models
Minho Park
Kinam Kim
J. Hyung
Hyojin Jang
Hoiyeong Jin
Jooyeol Yun
Hojoon Lee
Jaegul Choo
135
0
0
25 Oct 2025
Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Qixiu Li
Yu Deng
Yaobo Liang
L. Luo
Lei Zhou
...
Hao Chen
Lily Sun
Dong Chen
J. Yang
B. Guo
130
8
0
24 Oct 2025
Generalizable Hierarchical Skill Learning via Object-Centric Representation
Haibo Zhao
Yu Qi
Boce Hu
Yizhe Zhu
Ziyan Chen
...
Owen Howell
Haojie Huang
Robin Walters
Dian Wang
Robert Platt
151
0
0
24 Oct 2025
PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning
Xiaogang Jia
Qian Wang
Anrui Wang
Han A. Wang
B. Gyenes
...
Xi Huang
Maximilian Beck
Moritz Reuss
Rudolf Lioutikov
Gerhard Neumann
3DPC
213
1
0
23 Oct 2025
Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning
Kevin Huang
Rosario Scalise
Cleah Winston
Ayush Agrawal
Yunchu Zhang
...
Byron Boots
Benjamin Burchfiel
Hongkai Dai
Masha Itkina
Paarth Shah
OffRL
301
0
0
22 Oct 2025
Semantic World Models
Jacob Berg
Chuning Zhu
Yanda Bao
Ishan Durugkar
Abhishek Gupta
LM&Ro
VGen
150
1
0
22 Oct 2025
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
GigaBrain Team
Angen Ye
Boyuan Wang
Chaojun Ni
Guan Huang
...
Yukun Zhou
Z. Dong
Z. J. Wang
Zhichao Liu
Zheng Hua Zhu
LM&Ro
VLM
458
1
0
22 Oct 2025
Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes
Zhiyuan Feng
Zhaolu Kang
Qijie Wang
Zhiying Du
Jiongrui Yan
...
Shawn Chen
Sicheng Xu
Yaobo Liang
Jiaolong Yang
B. Guo
160
1
0
22 Oct 2025
EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval
Zebin Yang
Sunjian Zheng
Tong Xie
Tianshi Xu
Bo Yu
Fan Wang
Jie Tang
Shaoshan Liu
Meng Li
127
0
0
21 Oct 2025
MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning
Wenhui Huang
Changhe Chen
Han Qi
Chen Lv
Yilun Du
Heng Yang
LM&Ro
LRM
360
2
0
21 Oct 2025
A Compositional Paradigm for Foundation Models: Towards Smarter Robotic Agents
Luigi Quarantiello
Elia Piccoli
Jack Bell
Malio Li
Giacomo Carfì
...
Gerlando Gramaglia
Lanpei Li
Mauro Madeddu
Irene Testa
Vincenzo Lomonaco
LM&Ro
141
0
0
21 Oct 2025
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors
Z. Zhang
Hao Li
Yalun Dai
Zhengbang Zhu
Lei Zhou
...
S. Chen
Ziwei Liu
Y. Liu
Xinghang Li
Pan Zhou
109
2
0
20 Oct 2025
RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation
Yuquan Xue
Guanxing Lu
Zhenyu Wu
Chuanrui Zhang
Bofang Jia
Zhengyi Gu
Yansong Tang
Ziwei Wang
216
0
0
20 Oct 2025
Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
Weifan Guan
Qinghao Hu
Aosheng Li
Jian Cheng
LM&Ro
371
9
0
20 Oct 2025
RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies
Adina Yakefu
Bin Xie
C. Xu
Enwen Zhang
Erjin Zhou
...
Z. Chen
Zhengyuan Du
Ziheng Zhang
Ziming Liu
Ziwei Yan
OffRL
101
1
0
20 Oct 2025
Learning to play: A Multimodal Agent for 3D Game-Play
Yuguang Yue
Irakli Salia
Samuel Hunt
Christopher Green
Wenzhe Shi
Jonathan J. Hunt
DiffM
LM&Ro
155
0
0
19 Oct 2025
Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models
Chenrui Tie
Shengxiang Sun
Yudi Lin
Yanbo Wang
Zhongrui Li
...
Yiman Pang
Haonan Chen
Junting Chen
Ruihai Wu
Lin Shao
115
1
0
18 Oct 2025
MoS-VLA: A Vision-Language-Action Model with One-Shot Skill Adaptation
Ruihan Zhao
Tyler Ingebrand
Sandeep Chinchali
Ufuk Topcu
VLM
112
0
0
18 Oct 2025
DexCanvas: Bridging Human Demonstrations and Robot Learning for Dexterous Manipulation
Xinyue Xu
Jieqiang Sun
Jing
Siyuan Chen
Lanjie Ma
...
Bin Zhao
Jianbo Yuan
Sheng Yi
Haohua Zhu
Yiwen Lu
193
1
0
17 Oct 2025
GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement
Yao Zhong
Hanzhi Chen
Simon Schaefer
Anran Zhang
Stefan Leutenegger
268
0
0
16 Oct 2025
RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks
Mingxuan Yan
Yuping Wang
Zechun Liu
Jiachen Li
150
1
0
16 Oct 2025
VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation
Han Zhao
Jiaxuan Zhang
Wenxuan Song
Pengxiang Ding
Donglin Wang
122
2
0
16 Oct 2025
RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation
Xiangyu Chen
Chuhao Zhou
Yuxi Liu
Jianfei Yang
OffRL
152
0
0
16 Oct 2025
From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails
Ravi Pandya
Madison Bland
D. Nguyen
Changliu Liu
J. F. Fisac
Andrea V. Bajcsy
143
1
0
15 Oct 2025
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models
Senyu Fei
Siyin Wang
Junhao Shi
Zihao Dai
Jikun Cai
...
Shiduo Zhang
Zhaoye Fei
Jinlan Fu
Jingjing Gong
Xipeng Qiu
AAML
228
11
0
15 Oct 2025
Reasoning in Space via Grounding in the World
Yiming Chen
Zekun Qi
Wenyao Zhang
Xin Jin
Li Zhang
Peidong Liu
LRM
188
3
0
15 Oct 2025
Dedelayed: Deleting remote inference delay via on-device correction
Dan G. Jacobellis
Mateen Ulhaq
Fabien Racapé
Hyomin Choi
N. Yadwadkar
179
0
0
15 Oct 2025
DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
Tianyuan Yuan
Yicheng Liu
Chenhao Lu
Zhuoguang Chen
Tao Jiang
Hang Zhao
VLM
142
1
0
15 Oct 2025
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
Xinyi Chen
Yilun Chen
Y. Fu
Ning Gao
Jiaya Jia
...
Jinyu Zhang
Shi Zhang
Feng Zheng
Bowen Zhou
Y. Zhu
LM&Ro
LRM
178
10
0
15 Oct 2025
Learning to Grasp Anything by Playing with Random Toys
Dantong Niu
Yuvan Sharma
Baifeng Shi
Rachel Ding
Matteo Gioia
...
Anirudh Pai
Shankar Shastry
Trevor Darrell
Jitendra Malik
Roei Herzig
173
0
0
14 Oct 2025
Reflection-Based Task Adaptation for Self-Improving VLA
Baicheng Li
Dong Wu
Zike Yan
Xinchen Liu
Zecui Zeng
Lusong Li
Hongbin Zha
142
1
0
14 Oct 2025
Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking
Junhyuk So
Chiwoong Lee
Shinyoung Lee
Jungseul Ok
Eunhyeok Park
AI4CE
154
0
0
14 Oct 2025
EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making
Zixing Lei
Sheng Yin
Yichen Xiong
Yuanzhuo Ding
Wenhao Huang
...
Qingyao Xu
Yiming Li
Weixin Li
Yunhong Wang
Siheng Chen
LM&Ro
AI4CE
146
1
0
14 Oct 2025
A Survey on Agentic Multimodal Large Language Models
Huanjin Yao
Ruifei Zhang
Jiaxing Huang
Jingyi Zhang
Yibo Wang
...
Ruolin Zhu
Yongcheng Jing
Shunyu Liu
Guanbin Li
Dacheng Tao
LM&Ro
AIFin
AI4TS
LRM
AI4CE
250
6
0
13 Oct 2025
ManiAgent: An Agentic Framework for General Robotic Manipulation
Yi Yang
Kefan Gu
Yuqing Wen
Hebei Li
Yucheng Zhao
Tiancai Wang
Xudong Liu
LM&Ro
231
0
0
13 Oct 2025
HiMaCon: Discovering Hierarchical Manipulation Concepts from Unlabeled Multi-Modal Data
Ruizhe Liu
Pei Zhou
Qian Luo
Li Sun
Jun Cen
Yibing Song
Yanchao Yang
SSL
406
0
0
13 Oct 2025
Previous
1
2
3
4
5
6
...
13
14
15
Next
Page 3 of 15
Page
of 15
Go