Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing

12 July 2024

Papers citing "Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing"

8 / 8 papers shown

Title
L3MVN: Leveraging Large Language Models for Visual Target Navigation Bangguo Yu H. Kasaei M. Cao LM&Ro 49 83 0 11 Apr 2023
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects Bowen Wen Jonathan Tremblay Valts Blukis Stephen Tyree Thomas Müller Alex Evans D. Fox Jan Kautz Stan Birchfield 3DH 73 125 0 24 Mar 2023
Visual Language Maps for Robot Navigation Chen Huang Oier Mees Andy Zeng Wolfram Burgard LM&Ro 145 337 0 11 Oct 2022
Open-vocabulary Queryable Scene Representations for Real World Planning Boyuan Chen F. Xia Brian Ichter Kanishka Rao K. Gopalakrishnan Michael S. Ryoo Austin Stone Daniel Kappler LM&Ro 144 179 0 20 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action Dhruv Shah B. Osinski Brian Ichter Sergey Levine LM&Ro 139 430 0 10 Jul 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings Arjun Majumdar Gunjan Aggarwal Bhavika Devnani Judy Hoffman Dhruv Batra LM&Ro 147 148 0 24 Jun 2022
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models Yuan Yao Ao Zhang Zhengyan Zhang Zhiyuan Liu Tat-Seng Chua Maosong Sun MLLM VPVLM VLM 194 218 0 24 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation Xiuye Gu Tsung-Yi Lin Weicheng Kuo Yin Cui VLM ObjD 223 897 0 28 Apr 2021