v1v2 (latest)

OpenVLA: An Open-Source Vision-Language-Action Model

13 June 2024

Quan Vuong

Dorsa Sadigh

Percy Liang

Chelsea Finn

LM&Ro

VLM

ArXiv (abs)PDF HTML HuggingFace (40 upvotes)

Papers citing "OpenVLA: An Open-Source Vision-Language-Action Model"

50 / 727 papers shown

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

219

31 Oct 2025

DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models

200

31 Oct 2025

Learning Generalizable Visuomotor Policy through Dynamics-Alignment

124

31 Oct 2025

Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model

232

31 Oct 2025

Towards a Multi-Embodied Grasping Agent

184

31 Oct 2025

A Step Toward World Models: A Survey on Robotic Manipulation

797

31 Oct 2025

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

...

770

29 Oct 2025

Language-Conditioned Representations and Mixture-of-Experts Policy for Robust Multi-Task Robotic Manipulation

175

28 Oct 2025

BLM$_1$: A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning

BLM

_1

: A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning

...

198

28 Oct 2025

Reliable Robotic Task Execution in the Face of Anomalies

129

27 Oct 2025

OmniDexGrasp: Generalizable Dexterous Grasping via Foundation Model and Force Feedback

113

27 Oct 2025

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

...

307

27 Oct 2025

UrbanVLA: A Vision-Language-Action Model for Urban Micromobility

154

27 Oct 2025

$RobotArena $\infty$: Scalable Robot Benchmarking via Real-to-Sim Translation$

RobotArena

\infty

: Scalable Robot Benchmarking via Real-to-Sim Translation

168

27 Oct 2025

Dexbotic: Open-Source Vision-Language-Action Toolbox

...

223

27 Oct 2025

ACG: Action Coherence Guidance for Flow-based VLA models

154

25 Oct 2025

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

...

146

24 Oct 2025

Generalizable Hierarchical Skill Learning via Object-Centric Representation

...

151

24 Oct 2025

PointMapPolicy: Structured Point Cloud Processing for Multi-Modal Imitation Learning

...

242

23 Oct 2025

Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning

...

305

22 Oct 2025

Semantic World Models

155

22 Oct 2025

Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes

...

172

22 Oct 2025

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

...

494

22 Oct 2025

A Compositional Paradigm for Foundation Models: Towards Smarter Robotic Agents

...

141

21 Oct 2025

EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval

133

21 Oct 2025

MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning

369

21 Oct 2025

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors

...

125

20 Oct 2025

RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

224

20 Oct 2025

Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey

383

20 Oct 2025

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

...

104

20 Oct 2025

Learning to play: A Multimodal Agent for 3D Game-Play

161

19 Oct 2025

Manual2Skill++: Connector-Aware General Robotic Assembly from Instruction Manuals via Vision-Language Models

...

122

18 Oct 2025

MoS-VLA: A Vision-Language-Action Model with One-Shot Skill Adaptation

118

18 Oct 2025

DexCanvas: Bridging Human Demonstrations and Robot Learning for Dexterous Manipulation

...

212

17 Oct 2025

GOPLA: Generalizable Object Placement Learning via Synthetic Augmentation of Human Arrangement

277

16 Oct 2025

RDD: Retrieval-Based Demonstration Decomposer for Planner Alignment in Long-Horizon Tasks

153

16 Oct 2025

VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

134

16 Oct 2025

RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation

190

16 Oct 2025

From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails

160

15 Oct 2025

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

...

243

15 Oct 2025

Reasoning in Space via Grounding in the World

209

15 Oct 2025

Dedelayed: Deleting remote inference delay via on-device correction

187

15 Oct 2025

DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning

153

15 Oct 2025

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

...

191

15 Oct 2025

Learning to Grasp Anything by Playing with Random Toys

...

176

14 Oct 2025

Reflection-Based Task Adaptation for Self-Improving VLA

169

14 Oct 2025

Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking

167

14 Oct 2025

EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making

...

163

14 Oct 2025

A Survey on Agentic Multimodal Large Language Models

...

LM&Ro AIFin AI4TS LRM AI4CE

269

13 Oct 2025

ManiAgent: An Agentic Framework for General Robotic Manipulation

240

13 Oct 2025