Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.09053
Cited By
Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing
12 July 2024
Jun Zhu
Zihao Du
Haotian Xu
Fengbo Lan
Zilong Zheng
Bo Ma
Shengjie Wang
Tao Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Navi2Gaze: Leveraging Foundation Models for Navigation and Target Gazing"
8 / 8 papers shown
Title
L3MVN: Leveraging Large Language Models for Visual Target Navigation
Bangguo Yu
H. Kasaei
M. Cao
LM&Ro
49
83
0
11 Apr 2023
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
Bowen Wen
Jonathan Tremblay
Valts Blukis
Stephen Tyree
Thomas Müller
Alex Evans
D. Fox
Jan Kautz
Stan Birchfield
3DH
73
125
0
24 Mar 2023
Visual Language Maps for Robot Navigation
Chen Huang
Oier Mees
Andy Zeng
Wolfram Burgard
LM&Ro
145
337
0
11 Oct 2022
Open-vocabulary Queryable Scene Representations for Real World Planning
Boyuan Chen
F. Xia
Brian Ichter
Kanishka Rao
K. Gopalakrishnan
Michael S. Ryoo
Austin Stone
Daniel Kappler
LM&Ro
144
179
0
20 Sep 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
139
430
0
10 Jul 2022
ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Arjun Majumdar
Gunjan Aggarwal
Bhavika Devnani
Judy Hoffman
Dhruv Batra
LM&Ro
147
148
0
24 Jun 2022
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
Yuan Yao
Ao Zhang
Zhengyan Zhang
Zhiyuan Liu
Tat-Seng Chua
Maosong Sun
MLLM
VPVLM
VLM
194
218
0
24 Sep 2021
Open-vocabulary Object Detection via Vision and Language Knowledge Distillation
Xiuye Gu
Tsung-Yi Lin
Weicheng Kuo
Yin Cui
VLM
ObjD
223
897
0
28 Apr 2021
1