ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.07775
  4. Cited By
Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs
  and Topological Graphs

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

10 July 2024
Hao-Tien Lewis Chiang
Zhuo Xu
Zipeng Fu
M. Jacob
Tingnan Zhang
T. Lee
Wenhao Yu
Connor Schenck
David Rendleman
Dhruv Shah
Fei Xia
Jasmine Hsu
Jonathan Hoech
Pete Florence
Sean Kirmani
Sumeet Singh
Vikas Sindhwani
Carolina Parada
Chelsea Finn
Peng Xu
Sergey Levine
Jie Tan
    LM&Ro
ArXivPDFHTML

Papers citing "Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs"

7 / 7 papers shown
Title
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
Chen Wang
Fei Xia
Wenhao Yu
Tingnan Zhang
Ruohan Zhang
Ce Liu
Li Fei-Fei
Jie Tan
Jacky Liang
31
0
0
17 Apr 2025
MotionGlot: A Multi-Embodied Motion Generation Model
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
68
1
0
22 Oct 2024
SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments
SPINE: Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments
Zachary Ravichandran
Varun Murali
Mariliza Tzes
George J. Pappas
Vijay Kumar
LRM
51
6
0
03 Oct 2024
KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems
KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems
Zixuan Wang
Bo Yu
Junzhe Zhao
Wenhao Sun
Sai Hou
Shuai Liang
Xing Hu
Yinhe Han
Yiming Gan
37
1
0
23 Sep 2024
Open-vocabulary Queryable Scene Representations for Real World Planning
Open-vocabulary Queryable Scene Representations for Real World Planning
Boyuan Chen
F. Xia
Brian Ichter
Kanishka Rao
K. Gopalakrishnan
Michael S. Ryoo
Austin Stone
Daniel Kappler
LM&Ro
144
179
0
20 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
136
430
0
10 Jul 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Arjun Majumdar
Gunjan Aggarwal
Bhavika Devnani
Judy Hoffman
Dhruv Batra
LM&Ro
144
148
0
24 Jun 2022
1