v1v2 (latest)

OpenVLA: An Open-Source Vision-Language-Action Model

13 June 2024

Quan Vuong

Dorsa Sadigh

Percy Liang

Chelsea Finn

LM&Ro

VLM

ArXiv (abs)PDF HTML HuggingFace (40 upvotes)

Papers citing "OpenVLA: An Open-Source Vision-Language-Action Model"

50 / 723 papers shown

Towards Fast, Memory-based and Data-Efficient Vision-Language Policy

335

13 Mar 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

...

633

100

13 Mar 2025

Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework

415

12 Mar 2025

Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in ClutterIEEE Transactions on Automation Science and Engineering (T-ASE), 2025

506

12 Mar 2025

Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds

337

12 Mar 2025

EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments

Katherine Rose Driggs-Campbell

Gaoang Wang

LM&Ro

596

11 Mar 2025

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action ModelsIEEE International Conference on Robotics and Automation (ICRA), 2025

429

11 Mar 2025

TLA: Tactile-Language-Action Model for Contact-Rich Manipulation

321

11 Mar 2025

Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies

Chen Xu

Tony Nguyen

Emma Dixon

Christopher Rodriguez

569

11 Mar 2025

A Data-Centric Revisit of Pre-Trained Vision Models for Robot LearningComputer Vision and Pattern Recognition (CVPR), 2025

503

10 Mar 2025

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

351

10 Mar 2025

iManip: Skill-Incremental Learning for Robotic Manipulation

268

10 Mar 2025

Towards Safe Robot Foundation Models

371

10 Mar 2025

System 0/1/2/3: Quad-process theory for multi-timescale embodied collective cognitive systems

357

08 Mar 2025

Object-Centric World Model for Language-Guided Manipulation

838

08 Mar 2025

BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities

384

07 Mar 2025

Refined Policy Distillation: From VLA Generalists to RL Experts

299

06 Mar 2025

SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning

537

05 Mar 2025

Generative Artificial Intelligence in Robotic Manipulation: A Survey

...

674

05 Mar 2025

AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons

...

529

05 Mar 2025

RaceVLA: VLA-based Racing Drone Navigation with Human-like Behaviour

Ali Alridha Abdulkarim

Oleg Sautenkov

Dzmitry Tsetserukou

372

04 Mar 2025

ArticuBot: Learning Universal Articulated Object Manipulation Policy via Large Scale Simulation

408

04 Mar 2025

UAV-VLPA*: A Vision-Language-Path-Action System for Optimal Route Generation on a Large Scales

Oleg Sautenkov

Aibek Akhmetkazy

Malaika Zafar

Muhammad Ahsan Mustafa

Grik Tadevosyan

Artem Lykov

Dzmitry Tsetserukou

343

04 Mar 2025

FLAME: A Federated Learning Benchmark for Robotic Manipulation

338

03 Mar 2025

Action Tokenizer Matters in In-Context Imitation Learning

471

03 Mar 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to ConcreteComputer Vision and Pattern Recognition (CVPR), 2025

...

496

28 Feb 2025

Unified Video Action Model

691

28 Feb 2025

Physics-Driven Data Generation for Contact-Rich Manipulation via Trajectory Optimization

401

27 Feb 2025

Data-Efficient Multi-Agent Spatial Planning with LLMs

461

26 Feb 2025

Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

...

494

111

26 Feb 2025

VaViM and VaVAM: Autonomous Driving through Video Generative Modeling

Florent Bartoccioni

Elias Ramzi

Victor Besnier

Shashanka Venkataramanan

...

328

24 Feb 2025

BOSS: Benchmark for Observation Space Shift in Long-Horizon TaskIEEE Robotics and Automation Letters (IEEE RA-L), 2025

260

24 Feb 2025

COMPASS: Cross-embodiment Mobility Policy via Residual RL and Skill Synthesis

295

22 Feb 2025

Towards Fusing Point Cloud and Visual Representations for Imitation Learning

304

20 Feb 2025

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

...

459

18 Feb 2025

Magma: A Foundation Model for Multimodal AI AgentsComputer Vision and Pattern Recognition (CVPR), 2025

...

371

18 Feb 2025

RHINO: Learning Real-Time Humanoid-Human-Object Interaction from Human Demonstrations

406

18 Feb 2025

Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization

520

18 Feb 2025

Efficient Evaluation of Multi-Task Robot Policies With Active Experiment Selection

455

14 Feb 2025

ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor PolicyIEEE Robotics and Automation Letters (IEEE RA-L), 2025

...

327

11 Feb 2025

Discovery of skill switching criteria for learning agile quadruped locomotion

246

10 Feb 2025

DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control

505

113

09 Feb 2025

Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following

503

08 Feb 2025

ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy

697

08 Feb 2025

HAMSTER: Hierarchical Action Models For Open-World Robot ManipulationInternational Conference on Learning Representations (ICLR), 2025

...

756

08 Feb 2025

Large Language Models for Multi-Robot Systems: A Survey

532

06 Feb 2025

AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

996

04 Feb 2025

...

Phillip J. K. Christoffersen

A. Pinar Ozisik

Rakshit Trivedi

Dylan Hadfield-Menell

Noam Kolt

463

03 Feb 2025

Scalable, Training-Free Visual Language Robotics: A Modular Multi-Model Framework for Consumer-Grade GPUsIEEE/SICE International Symposium on System Integration (SII), 2025

Marie Samson

Bastien Muraccioli

Fumio Kanehiro

517

03 Feb 2025

Strengthening Generative Robot Policies through Predictive World Modeling

637

02 Feb 2025