Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2406.09246
Cited By
v1
v2 (latest)
OpenVLA: An Open-Source Vision-Language-Action Model
13 June 2024
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
Suraj Nair
Rafael Rafailov
Ethan P. Foster
Grace Lam
Pannag R Sanketi
Quan Vuong
Thomas Kollar
Benjamin Burchfiel
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (40 upvotes)
Papers citing
"OpenVLA: An Open-Source Vision-Language-Action Model"
50 / 723 papers shown
Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Haoxuan Li
Sixu Yan
Yongqian Li
Xinggang Wang
LM&Ro
335
2
0
13 Mar 2025
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu
Hao Chen
Pengju An
Zhuoyang Liu
Renrui Zhang
...
Chengkai Hou
Mengdi Zhao
KC alex Zhou
Pheng-Ann Heng
Shanghang Zhang
633
100
0
13 Mar 2025
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
Jian-Jian Jiang
Xiao-Ming Wu
Yi-Xiang He
Ling-an Zeng
Yi-Lin Wei
Dandan Zhang
Wei-Shi Zheng
415
6
0
12 Mar 2025
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter
IEEE Transactions on Automation Science and Engineering (T-ASE), 2025
Kechun Xu
Xunlong Xia
Kaixuan Wang
Yifei Yang
Yunxuan Mao
Bing Deng
R. Xiong
Longji Xu
Yue Wang
OffRL
506
2
0
12 Mar 2025
Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds
Dikai Liu
Tianwei Zhang
Jianxiong Yin
Simon See
OffRL
337
1
0
12 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
596
2
0
11 Mar 2025
MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models
IEEE International Conference on Robotics and Automation (ICRA), 2025
Han Zhao
Wenxuan Song
Donglin Wang
Xinyang Tong
Pengxiang Ding
Xuelian Cheng
Zongyuan Ge
429
22
0
11 Mar 2025
TLA: Tactile-Language-Action Model for Contact-Rich Manipulation
Peng Hao
Chaofan Zhang
Dingzhe Li
Xiaoge Cao
Xiaoshuai Hao
Shaowei Cui
Shuo Wang
LM&Ro
321
31
0
11 Mar 2025
Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies
Chen Xu
Tony Nguyen
Emma Dixon
Christopher Rodriguez
Patrick Miller
Robert Lee
Paarth Shah
Rares Andrei Ambrus
Haruki Nishimura
Masha Itkina
OffRL
569
13
0
11 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Computer Vision and Pattern Recognition (CVPR), 2025
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
503
4
0
10 Mar 2025
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
Bo Jiang
Shaoyu Chen
Qian Zhang
Wenyu Liu
Xinggang Wang
OffRL
LRM
VLM
351
48
0
10 Mar 2025
iManip: Skill-Incremental Learning for Robotic Manipulation
Zexin Zheng
Jia-Feng Cai
Xiao-Ming Wu
Yi-Lin Wei
Yu-Ming Tang
Wei-Shi Zheng
CLL
268
4
0
10 Mar 2025
Towards Safe Robot Foundation Models
Maximilian Tölle
Theo Gruner
Daniel Palenicek
Jonas Günster
Puze Liu
Joe Watson
Davide Tateo
Jan Peters
OffRL
371
0
0
10 Mar 2025
System 0/1/2/3: Quad-process theory for multi-timescale embodied collective cognitive systems
Tadahiro Taniguchi
Yasushi Hirai
Masahiro Suzuki
Shingo Murata
Takato Horii
Kazutoshi Tanaka
AI4CE
357
4
0
08 Mar 2025
Object-Centric World Model for Language-Guided Manipulation
Youngjoon Jeong
Junha Chun
S. Cha
Taesup Kim
OCL
VGen
838
8
0
08 Mar 2025
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities
Yunfan Jiang
Ruohan Zhang
J. Wong
Chen Wang
Yanjie Ze
Hang Yin
Cem Gokmen
Shuran Song
Jiajun Wu
L. Fei-Fei
384
29
0
07 Mar 2025
Refined Policy Distillation: From VLA Generalists to RL Experts
Tobias Jülg
Wolfram Burgard
Florian Walter
OffRL
299
12
0
06 Mar 2025
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning
Borong Zhang
Yuhao Zhang
Yalan Qin
Yingshan Lei
Josef Dai
Yuanpei Chen
Yaodong Yang
537
4
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Kun Zhang
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
674
15
0
05 Mar 2025
AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons
Hongjie Fang
Chenxi Wang
Yiming Wang
J. Chen
Shangning Xia
...
Xinyu Zhan
Lixin Yang
Weiming Wang
Cewu Lu
Hao-Shu Fang
529
15
0
05 Mar 2025
RaceVLA: VLA-based Racing Drone Navigation with Human-like Behaviour
Valerii Serpiva
Artem Lykov
Artyom Myshlyaev
Muhammad Haris Khan
Ali Alridha Abdulkarim
Oleg Sautenkov
Dzmitry Tsetserukou
372
16
0
04 Mar 2025
ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation
Yufei Wang
Ziyu Wang
Mino Nakura
Pratik Bhowal
Chia-Liang Kuo
Yi-Ting Chen
Zackory M. Erickson
David Held
408
3
0
04 Mar 2025
UAV-VLPA*: A Vision-Language-Path-Action System for Optimal Route Generation on a Large Scales
Oleg Sautenkov
Aibek Akhmetkazy
Malaika Zafar
Muhammad Ahsan Mustafa
Grik Tadevosyan
Artem Lykov
Dzmitry Tsetserukou
343
2
0
04 Mar 2025
FLAME: A Federated Learning Benchmark for Robotic Manipulation
Santiago Bou Betran
Alberta Longhini
Miguel Vasco
Yuchong Zhang
Jens Lundell
338
2
0
03 Mar 2025
Action Tokenizer Matters in In-Context Imitation Learning
An Vuong
M. Vu
Dong An
Ian Reid
471
3
0
03 Mar 2025
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
Computer Vision and Pattern Recognition (CVPR), 2025
Yuheng Ji
Huajie Tan
Jiayu Shi
Xiaoshuai Hao
Yuan Zhang
...
Huaihai Lyu
Xiaolong Zheng
Jiaming Liu
Zhongyuan Wang
Shanghang Zhang
496
88
0
28 Feb 2025
Unified Video Action Model
Shuang Li
Yihuai Gao
Dorsa Sadigh
Shuran Song
VGen
691
67
0
28 Feb 2025
Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization
Lujie Yang
H.J. Terry Suh
Tong Zhao
B. P. Graesdal
Tarik Kelestemur
Jiuguang Wang
Tao Pang
Russ Tedrake
401
17
0
27 Feb 2025
Data-Efficient Multi-Agent Spatial Planning with LLMs
Huangyuan Su
Aaron Walsman
Daniel Garces
Sham Kakade
Stephanie Gil
LLMAG
461
1
0
26 Feb 2025
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models
Lucy Xiaoyang Shi
Brian Ichter
Michael Equi
Liyiming Ke
Karl Pertsch
...
Adrian Li-Bell
Danny Driess
Lachy Groom
Sergey Levine
Chelsea Finn
LM&Ro
LRM
494
111
0
26 Feb 2025
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling
Florent Bartoccioni
Elias Ramzi
Victor Besnier
Shashanka Venkataramanan
Tuan-Hung Vu
...
Mickael Chen
Éloi Zablocki
Andrei Bursuc
Eduardo Valle
Matthieu Cord
VGen
328
14
0
24 Feb 2025
BOSS: Benchmark for Observation Space Shift in Long-Horizon Task
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Yue Yang
Linfeng Zhao
Mingyu Ding
Gedas Bertasius
D. Szafir
260
2
0
24 Feb 2025
COMPASS: Cross-embodiment Mobility Policy via Residual RL and Skill Synthesis
Wei Liu
Huihua Zhao
Chenran Li
Joydeep Biswas
Joydeep Biswas
Soha Pouya
Yan Chang
295
3
0
22 Feb 2025
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Atalay Donat
Xiaogang Jia
Xi Huang
Aleksandar Taranovic
Denis Blessing
Ge Li
Hongyi Zhou
Hanyi Zhang
Rudolf Lioutikov
Gerhard Neumann
3DPC
SSL
304
7
0
20 Feb 2025
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
Zekun Qi
Wenyao Zhang
Yufei Ding
Runpei Dong
Xinqiang Yu
...
Xin Jin
Kaisheng Ma
Zhizheng Zhang
He Wang
Li Yi
LM&Ro
459
15
0
18 Feb 2025
Magma: A Foundation Model for Multimodal AI Agents
Computer Vision and Pattern Recognition (CVPR), 2025
Jianwei Yang
Reuben Tan
Qianhui Wu
Ruijie Zheng
Baolin Peng
...
Seonghyeon Ye
Joel Jang
Yuquan Deng
Lars Liden
Jianfeng Gao
VLM
AI4TS
371
95
0
18 Feb 2025
RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations
Jingxiao Chen
Xinyao Li
Jiahang Cao
Zhengbang Zhu
Wentao Dong
Minghuan Liu
Ying Wen
Yong Yu
Li Zhang
Weinan Zhang
406
3
0
18 Feb 2025
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization
Shuo Xing
Peiran Li
Peiran Li
Ruizheng Bai
Longji Xu
Chan-wei Hu
Chengxuan Qian
Huaxiu Yao
Zhengzhong Tu
520
20
0
18 Feb 2025
Efficient Evaluation of Multi-Task Robot Policies With Active Experiment Selection
Abrar Anwar
Rohan Gupta
Zain Merchant
Sayan Ghosh
Willie Neiswanger
Jesse Thomason
OffRL
455
3
0
14 Feb 2025
ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor Policy
IEEE Robotics and Automation Letters (IEEE RA-L), 2025
Yuhang Dong
Haizhou Ge
Yupei Zeng
Jing Zhang
Beiwen Tian
...
Ruixiang Wang
Ruixiang Wang
Ran Yi
Longhua Ma
Longhua Ma
327
1
0
11 Feb 2025
Discovery of skill switching criteria for learning agile quadruped locomotion
Wanming Yu
Fernando Acero
Vassil Atanassov
Chuanyu Yang
Ioannis Havoutis
Dimitrios Kanoulas
Zhibin Li
246
2
0
10 Feb 2025
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
Junjie Wen
Yinlin Zhu
Jinming Li
Zhibin Tang
Yaxin Peng
Feifei Feng
VLM
505
113
0
09 Feb 2025
Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following
Vivek Myers
Bill Chunyuan Zheng
Anca Dragan
Kuan Fang
Sergey Levine
503
6
0
08 Feb 2025
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
Yuhui Chen
Shuai Tian
Shugao Liu
Yingting Zhou
Haoran Li
Dongbin Zhao
OffRL
697
64
0
08 Feb 2025
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation
International Conference on Learning Representations (ICLR), 2025
Yi Li
Yuquan Deng
Jing Zhang
Joel Jang
Marius Memme
...
Fabio Ramos
Dieter Fox
Anqi Li
Abhishek Gupta
Ankit Goyal
LM&Ro
756
68
0
08 Feb 2025
Large Language Models for Multi-Robot Systems: A Survey
Peihan Li
Zijian An
Shams Abrar
Lifeng Zhou
LRM
LM&Ro
532
31
0
06 Feb 2025
AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Hongxin Li
Jingfan Chen
Jingran Su
Yuntao Chen
Qing Li
Rundong Wang
996
9
0
04 Feb 2025
The AI Agent Index
Stephen Casper
Luke Bailey
Rosco Hunter
Carson Ezell
Emma Cabalé
...
Phillip J. K. Christoffersen
A. Pinar Ozisik
Rakshit Trivedi
Dylan Hadfield-Menell
Noam Kolt
463
21
0
03 Feb 2025
Scalable, Training-Free Visual Language Robotics: A Modular Multi-Model Framework for Consumer-Grade GPUs
IEEE/SICE International Symposium on System Integration (SII), 2025
Marie Samson
Bastien Muraccioli
Fumio Kanehiro
517
5
0
03 Feb 2025
Strengthening Generative Robot Policies through Predictive World Modeling
Han Qi
Haocheng Yin
Aris Zhu
Yilun Du
Heng Yang
637
18
0
02 Feb 2025
Previous
1
2
3
...
12
13
14
15
Next
Page 13 of 15
Page
of 15
Go