ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.07997
  4. Cited By
$A^2$Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting
  Vision-and-Language Ability of Foundation Models

A2A^2A2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models

15 August 2023
Peihao Chen
Xinyu Sun
Hongyan Zhi
Runhao Zeng
Thomas H. Li
Gaowen Liu
Mingkui Tan
Chuang Gan
    LLMAG
    LM&Ro
ArXivPDFHTML

Papers citing "$A^2$Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models"

29 / 29 papers shown
Title
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
Weichen Zhang
Chen Gao
Shiquan Yu
Ruiying Peng
Baining Zhao
Qian Zhang
Jinqiang Cui
Xinlei Chen
Y. Li
LLMAG
LM&Ro
28
0
0
08 May 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Y. Zhang
Chuan Qin
Bo Li
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
43
0
0
23 Apr 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
J. Liu
LM&Ro
69
0
0
18 Mar 2025
Perceiving, Reasoning, Adapting: A Dual-Layer Framework for VLM-Guided Precision Robotic Manipulation
Qingxuan Jia
Guoqin Tang
Zeyuan Huang
Zixuan Hao
Ning Ji
Shihang
Gang Chen
29
0
0
07 Mar 2025
Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach
Mobile Robot Navigation Using Hand-Drawn Maps: A Vision Language Model Approach
A. H. Tan
Angus Fung
Haitong Wang
G. Nejat
73
1
0
31 Jan 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Mohit Bansal
Parisa Kordjamshidi
LRM
51
17
0
31 Dec 2024
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
Hongyan Zhi
Peihao Chen
Junyan Li
Shuailei Ma
Xinyu Sun
Tianhang Xiang
Yinjie Lei
Mingkui Tan
Chuang Gan
67
3
0
02 Dec 2024
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Mingkui Tan
Qi Wu
LRM
34
2
0
27 Sep 2024
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language
  Navigation
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
Zehao Wang
Minye Wu
Yixin Cao
Yubo Ma
Meiqi Chen
Tinne Tuytelaars
23
1
0
25 Sep 2024
Affordances-Oriented Planning using Foundation Models for Continuous
  Vision-Language Navigation
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee Kenneth Wong
LM&Ro
44
7
0
08 Jul 2024
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language
  Navigation
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
LM&Ro
19
5
0
14 Jun 2024
CoNav: A Benchmark for Human-Centered Collaborative Navigation
CoNav: A Benchmark for Human-Centered Collaborative Navigation
Changhao Li
Xinyu Sun
Peihao Chen
Jugang Fan
Zixu Wang
Yanxia Liu
Jinhui Zhu
Chuang Gan
Mingkui Tan
37
1
0
04 Jun 2024
Memory-Maze: Scenario Driven Benchmark and Visual Language Navigation
  Model for Guiding Blind People
Memory-Maze: Scenario Driven Benchmark and Visual Language Navigation Model for Guiding Blind People
Masaki Kuribayashi
Kohei Uehara
Allan Wang
Daisuke Sato
Simon Chu
Shigeo Morishima
25
0
0
11 May 2024
CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor
  and Indoor Environments
CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
A. Sathyamoorthy
K. Weerakoon
Mohamed Bashir Elnoor
Anuj Zore
Brian Ichter
Fei Xia
Jie Tan
Wenhao Yu
Dinesh Manocha
LM&Ro
36
4
0
22 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
26
23
0
21 Mar 2024
Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a
  Compact Representation
Embedding Pose Graph, Enabling 3D Foundation Model Capabilities with a Compact Representation
Hugues Thomas
Jian Zhang
24
1
0
20 Mar 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Prioritized Semantic Learning for Zero-shot Instance Navigation
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
22
8
0
18 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision
  Language Navigation
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
18
4
0
13 Mar 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language
  Navigation
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Jiazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Zhizheng Zhang
Wang He
LM&Ro
21
11
0
24 Feb 2024
Contrastive Vision-Language Alignment Makes Efficient Instruction
  Learner
Contrastive Vision-Language Alignment Makes Efficient Instruction Learner
Lizhao Liu
Xinyu Sun
Tianhang Xiang
Zhuangwei Zhuang
Liuren Yin
Mingkui Tan
VLM
9
2
0
29 Nov 2023
Advances in Embodied Navigation Using Large Language Models: A Survey
Advances in Embodied Navigation Using Large Language Models: A Survey
Jinzhou Lin
Han Gao
Xuxiang Feng
Rongtao Xu
Changwei Wang
Man Zhang
Li Guo
Shibiao Xu
LM&Ro
LLMAG
44
9
0
01 Nov 2023
Interactive Navigation in Environments with Traversable Obstacles Using
  Large Language and Vision-Language Models
Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models
Zhen Zhang
Anran Lin
Chun Wai Wong
X. Chu
Qi Dou
K. W. S. Au
LM&Ro
14
3
0
13 Oct 2023
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation
Xinyu Sun
Peihao Chen
Jugang Fan
Thomas H. Li
Jian Chen
Mingkui Tan
19
12
0
11 Oct 2023
SurrealDriver: Designing Generative Driver Agent Simulation Framework in
  Urban Contexts based on Large Language Model
SurrealDriver: Designing Generative Driver Agent Simulation Framework in Urban Contexts based on Large Language Model
Ye Jin
Xiaoxi Shen
Huiling Peng
Xiaoan Liu
Jingli Qin
Jiayang Li
Jintao Xie
Peizhong Gao
Guyue Zhou
Jiangtao Gong
LLMAG
23
8
0
22 Sep 2023
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language,
  Vision, and Action
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
136
430
0
10 Jul 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Arjun Majumdar
Gunjan Aggarwal
Bhavika Devnani
Judy Hoffman
Dhruv Batra
LM&Ro
141
148
0
24 Jun 2022
Waypoint Models for Instruction-guided Navigation in Continuous
  Environments
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
123
76
0
05 Oct 2021
Audio-Visual Floorplan Reconstruction
Audio-Visual Floorplan Reconstruction
Senthil Purushwalkam
S. V. A. Garí
V. Ithapu
Carl Schissler
Philip Robinson
Abhinav Gupta
Kristen Grauman
VGen
3DV
51
41
0
31 Dec 2020
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
237
444
0
07 Jun 2018
1