ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.09246
  4. Cited By
OpenVLA: An Open-Source Vision-Language-Action Model
v1v2 (latest)

OpenVLA: An Open-Source Vision-Language-Action Model

13 June 2024
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
Suraj Nair
Rafael Rafailov
Ethan P. Foster
Grace Lam
Pannag R Sanketi
Quan Vuong
Thomas Kollar
Benjamin Burchfiel
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
    LM&RoVLM
ArXiv (abs)PDFHTMLHuggingFace (40 upvotes)

Papers citing "OpenVLA: An Open-Source Vision-Language-Action Model"

50 / 723 papers shown
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review
Matthew Lisondra
B. Benhabib
G. Nejat
LM&Ro
285
3
0
26 May 2025
RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback
RFTF: Reinforcement Fine-tuning for Embodied Agents with Temporal Feedback
Junyang Shu
Zhiwei Lin
Yongtao Wang
296
12
0
26 May 2025
Benign-to-Toxic Jailbreaking: Inducing Harmful Responses from Harmless Prompts
Benign-to-Toxic Jailbreaking: Inducing Harmful Responses from Harmless Prompts
H. Kim
Minbeom Kim
Wonjun Lee
Kihyun Kim
Changick Kim
173
0
0
26 May 2025
RetroMotion: Retrocausal Motion Forecasting Models are Instructable
RetroMotion: Retrocausal Motion Forecasting Models are Instructable
Royden Wagner
Ömer Sahin Tas
Felix Hauser
Marlon Steiner
Dominik Strutz
Abhishek Vivekanandan
Carlos Fernandez
Christoph Stiller
310
0
0
26 May 2025
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
Xueyi Liu
Zuodong Zhong
Yuxin Guo
Yun-Fu Liu
Zhiguo Su
...
Yinfeng Gao
Yupeng Zheng
Qiao Lin
Huiyong Chen
Dongbin Zhao
LRM
284
11
0
26 May 2025
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects
Yixin Cui
Haotian Lin
Shuo Yang
Yixiao Wang
Yanjun Huang
Hong Chen
LM&RoLRMELM
367
6
0
26 May 2025
What Can RL Bring to VLA Generalization? An Empirical Study
What Can RL Bring to VLA Generalization? An Empirical Study
Jijia Liu
Feng Gao
Bingwen Wei
Xinlei Chen
Qingmin Liao
Yi Wu
Xinlei Chen
Yu Wang
OffRL
905
37
0
26 May 2025
HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval
HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval
Matthew Hong
Anthony Liang
Kevin Kim
Harshitha Rajaprakash
Jesse Thomason
Erdem Bıyık
Jesse Zhang
523
5
0
26 May 2025
WorldEval: World Model as Real-World Robot Policies Evaluator
WorldEval: World Model as Real-World Robot Policies Evaluator
Yaxuan Li
Yichen Zhu
Junjie Wen
Chaomin Shen
Yi Xu
OffRLVGen
198
0
0
25 May 2025
ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning
ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning
Tuan V. Vo
T. Nguyen
Khang Nguyen
Duy Ho Minh Nguyen
Minh Nhat Vu
LRM
189
4
0
25 May 2025
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning
VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning
Guanxing Lu
Wenkai Guo
Chubin Zhang
Yuheng Zhou
Haonan Jiang
Zifeng Gao
Yansong Tang
Ziwei Wang
OffRL
432
67
0
24 May 2025
Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance
Wenhao Wang
Jianheng Song
Chiming Liu
Jiayao Ma
Siyuan Feng
...
Modi Shi
Xindong He
Guanghui Ren
Yang Yang
Maoqing Yao
OffRL
246
6
0
24 May 2025
HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning
Chuhao Zhou
Jianfei Yang
VLM
490
0
0
23 May 2025
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration
One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration
Jinbang Huang
Yixin Xiao
Zhanguang Zhang
Mark Coates
Jianye Hao
Yingxue Zhang
LM&RoLRM
466
0
0
23 May 2025
Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space
Jinrong Yang
Kexun Chen
Zhuoling Li
Shengkai Wu
Yong Zhao
...
Chaohui Shang
Meiyu Zhi
Linfeng Gao
Mingshan Sun
Hui Cheng
272
1
0
23 May 2025
ScanBot: Towards Intelligent Surface Scanning in Embodied Robotic Systems
Zhiling Chen
Yang Zhang
Fardin Jalil Piran
Qianyu Zhou
Jiong Tang
Farhad Imani
LM&Ro
256
0
0
22 May 2025
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models
Runsen Xu
Weiyao Wang
Hao Tang
Xingyu Chen
Xiaodong Wang
Fu-Jen Chu
Dahua Lin
Matt Feiszli
Kevin J. Liang
LRM
358
24
0
22 May 2025
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation
Xuewu Lin
Tianwei Lin
Lichao Huang
Hongyu Xie
Yiwei Jin
Keyu Li
Zhizhong Su
307
4
0
22 May 2025
VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving
VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving
Yansong Qu
Zilin Huang
Zihao Sheng
Jiancong Chen
Sikai Chen
Samuel Labi
OffRL
248
3
0
22 May 2025
Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation
Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation
Yihang Li
Tianle Zhang
Xuelong Wei
Jiayi Li
Lin Zhao
Dongchi Huang
Zhirui Fang
Minhua Zheng
Wenjun Dai
Xiaodong He
321
0
0
21 May 2025
Robo-DM: Data Management For Large Robot Datasets
Robo-DM: Data Management For Large Robot DatasetsIEEE International Conference on Robotics and Automation (ICRA), 2025
Kaiyuan Chen
Letian Fu
David Huang
Yanxiang Zhang
Lawrence Yunliang Chen
...
Ashwin Balakrishna
Ted Xiao
Pannag R Sanketi
John Kubiatowicz
Ken Goldberg
198
0
0
21 May 2025
AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation
AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation
Meenal Parakh
Alexandre Kirchmeyer
Beining Han
Jia Deng
LM&Ro
443
0
0
21 May 2025
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
Jiaming Zhou
Ke Ye
Jiayi Liu
Teli Ma
Zifang Wang
Ronghe Qiu
Kun-Yu Lin
Zhilin Zhao
Junwei Liang
422
17
0
21 May 2025
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control
Seongmin Park
Hyungmin Kim
Sangwoo kim
Wonseok Jeon
Juyoung Yang
Byeongwook Jeon
Yoonseon Oh
Jungwook Choi
439
1
0
21 May 2025
APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight
APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight
Wanjing Huang
Weixiang Yan
Zhen Zhang
Ambuj Singh
LRM
357
0
0
20 May 2025
GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation
GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation
Abhay Deshpande
Yuquan Deng
Arijit Ray
Jordi Salvador
Winson Han
Jiafei Duan
Kuo-Hao Zeng
Yuke Zhu
Ranjay Krishna
Rose Hendrix
459
6
0
19 May 2025
Policy Contrastive Decoding for Robotic Foundation Models
Policy Contrastive Decoding for Robotic Foundation Models
Shihan Wu
Ji Zhang
Xu Luo
Junlin Xie
Jingkuan Song
Heng Tao Shen
Lianli Gao
OffRL
859
4
0
19 May 2025
RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction
RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction
Weifeng Lu
Minghao Ye
Zewei Ye
Ruihan Tao
Shuo Yang
Bo Zhao
322
12
0
18 May 2025
OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning
OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning
Fanqi Lin
Ruiqian Nai
Yingdong Hu
Jiacheng You
Junming Zhao
Yang Gao
LRM
232
49
0
17 May 2025
Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions
Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions
Wei Zhao
Gongsheng Li
Zhefei Gong
Pengxiang Ding
Han Zhao
Donglin Wang
LM&Ro
265
9
0
16 May 2025
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
Jiahui Zhang
Yusen Luo
Abrar Anwar
Sumedh Anand Sontakke
Joseph J Lim
Jesse Thomason
Erdem Biyik
Jesse Zhang
OffRLLM&Ro
426
19
0
16 May 2025
Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild
Search-TTA: A Multimodal Test-Time Adaptation Framework for Visual Search in the Wild
Derek Ming Siang Tan
Shailesh
Boyang Liu
Alok Raj
Qi Xuan Ang
...
Tanishq Duhan
Jimmy Chiun
Yuhong Cao
Florian Shkurti
Guillaume Sartoretti
683
1
0
16 May 2025
Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning
Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning
Milan Ganai
Rohan Sinha
Christopher Agia
Daniel Morton
Marco Pavone
Marco Pavone
OffRLLRMAI4CE
464
5
0
15 May 2025
TransDiffuser: Diverse Trajectory Generation with Decorrelated Multi-modal Representation for End-to-end Autonomous Driving
TransDiffuser: Diverse Trajectory Generation with Decorrelated Multi-modal Representation for End-to-end Autonomous Driving
Xuefeng Jiang
Yuan Ma
Pengxiang Li
Leimeng Xu
Xin Wen
Kun Zhan
Zhongpu Xia
Fu Liu
Xianpeng Lang
Sheng Sun
DiffM
389
2
0
14 May 2025
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation
Enyu Zhao
Vedant Raval
Hejia Zhang
Jiageng Mao
Zeyu Shangguan
Stefanos Nikolaidis
Yun Wang
Daniel Seita
LM&RoCoGe
376
13
0
14 May 2025
Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches
Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches
Yutong Hu
Pinhao Song
Kehan Wen
Renaud Detry
VLM
295
0
0
14 May 2025
VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation
VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation
Chaofan Zhang
Peng Hao
Xiaoge Cao
Xiaoshuai Hao
Shaowei Cui
Shuo Wang
294
25
0
14 May 2025
RT-Cache: Training-Free Retrieval for Real-Time Manipulation
RT-Cache: Training-Free Retrieval for Real-Time Manipulation
Owen Kwon
Abraham George
Alison Bartsch
A. Farimani
390
1
0
14 May 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
487
14
0
13 May 2025
Training Strategies for Efficient Embodied Reasoning
Training Strategies for Efficient Embodied Reasoning
William Chen
Suneel Belkhale
Suvir Mirchandani
Oier Mees
Danny Driess
Karl Pertsch
Sergey Levine
OffRLLRM
434
28
0
13 May 2025
LaDi-WM: A Latent Diffusion-based World Model for Predictive Manipulation
LaDi-WM: A Latent Diffusion-based World Model for Predictive Manipulation
Yuhang Huang
JIazhao Zhang
SHilong Zou
Xinwang Liu
Ruizhen Hu
Kai Xu
539
7
0
13 May 2025
Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness
Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness
Reihaneh Mirjalili
Tobias Jülg
Florian Walter
Wolfram Burgard
414
5
0
13 May 2025
Pixel Motion as Universal Representation for Robot Control
Pixel Motion as Universal Representation for Robot Control
Kanchana Ranasinghe
Xiang Li
Cristina Mata
Cristina Mata
Michael S. Ryoo
Michael Ryoo
VGen
404
8
0
12 May 2025
DexWild: Dexterous Human Interactions for In-the-Wild Robot Policies
DexWild: Dexterous Human Interactions for In-the-Wild Robot PoliciesRobotics (RAS), 2025
Tony Tao
Mohan Kumar Srirama
Jason Jingzhou Liu
Kenneth Shaw
Deepak Pathak
223
15
0
12 May 2025
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
Prithwish Dan
Kushal Kedia
Angela Chao
Edward Weiyi Duan
Maximus Adrian Pace
Wei-Chiu Ma
Sanjiban Choudhury
673
9
0
11 May 2025
Efficient Robotic Policy Learning via Latent Space Backward Planning
Efficient Robotic Policy Learning via Latent Space Backward Planning
Dongxiu Liu
Haoyi Niu
Zhihao Wang
Jinliang Zheng
Yinan Zheng
Zhonghong Ou
Jianming Hu
Jianxiong Li
Xianyuan Zhan
316
5
0
11 May 2025
UniVLA: Learning to Act Anywhere with Task-centric Latent Actions
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsRobotics (RAS), 2025
Qingwen Bu
Yanting Yang
Jisong Cai
Shenyuan Gao
Guanghui Ren
Maoqing Yao
Ping Luo
Hongyang Li
901
109
0
09 May 2025
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments
Pranav Guruprasad
Yangyue Wang
Sudipta Chowdhury
Harshvardhan Sikka
Paul Pu Liang
LM&RoVLM
1.1K
4
0
08 May 2025
Multi-agent Embodied AI: Advances and Future Directions
Multi-agent Embodied AI: Advances and Future Directions
Zhaohan Feng
Ruiqi Xue
Lei Yuan
Yang Yu
Ning Ding
M. Liu
Bingzhao Gao
Jian Sun
Xinhu Zheng
Gang Wang
AI4CE
550
24
0
08 May 2025
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant
Haibo Wang
Bo Feng
Zhengfeng Lai
Mingze Xu
Shiyu Li
Weifeng Ge
Afshin Dehghan
Meng Cao
Ping Huang
OffRL
623
7
0
08 May 2025
Previous
123...101112131415
Next
Page 11 of 15
Pageof 15