Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1901.03035
Cited By
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
10 January 2019
Chih-Yao Ma
Jiasen Lu
Zuxuan Wu
G. Al-Regib
Z. Kira
R. Socher
Caiming Xiong
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Github (122★)
Papers citing
"Self-Monitoring Navigation Agent via Auxiliary Progress Estimation"
50 / 202 papers shown
MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots
Ting Huang
Dongjian Li
Rui Yang
Zeyu Zhang
Zida Yang
Hao Tang
LRM
125
3
0
22 Nov 2025
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
Shanshan Li
Da Huang
Yu He
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
238
0
0
21 Nov 2025
STRIDER: Navigation via Instruction-Aligned Structural Decision Space Optimization
Diqi He
Xuehao Gao
Hao Li
Junwei Han
Dingwen Zhang
150
1
0
27 Oct 2025
Embodied Navigation with Auxiliary Task of Action Description Prediction
Haru Kondoh
Asako Kanezaki
148
0
0
21 Oct 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
141
0
0
18 Oct 2025
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation
X. Ding
Jianyu Wei
Yifan Yang
Shiqi Jiang
Qianxi Zhang
...
Yuxuan Yan
Weijun Wang
Yunxin Liu
Zhibo Chen
Ting Cao
LRM
168
0
0
29 Sep 2025
DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation
Haoxiang Shi
Xiang Deng
Zaijing Li
Gongwei Chen
Yaowei Wang
Liqiang Nie
83
0
0
13 Aug 2025
Real-Time Progress Prediction in Reasoning Language Models
Hans Peter Lynsgøe Raaschou-jensen
Constanza Fierro
Anders Søgaard
LRM
217
0
0
29 Jun 2025
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Ruoyu Wang
Tong Yu
Junda Wu
Yao Liu
Julian McAuley
Lina Yao
208
2
0
18 Jun 2025
Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations
Yibo Cui
Liang Xie
Yu Zhao
Jiawei Sun
Erwei Yin
175
2
0
10 Jun 2025
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
P. Zhang
Yifei Su
Pengyuan Wu
Dong An
Li Zhang
Zhigang Wang
Dong Wang
Yan Ding
Jiangwei Zhong
Xuelong Li
LM&Ro
396
4
0
27 May 2025
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language Models
Hengxing Cai
Jinhan Dong
Jingjun Tan
Jingcheng Deng
Changhao Nai
Zhifeng Gao
Haidong Wang
Zicheng Su
Agachai Sumalee
Renxin Zhong
255
5
0
19 May 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Information Fusion (Inf. Fusion), 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
439
37
0
03 Apr 2025
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Ziming Wei
Bingqian Lin
Yunshuang Nie
Jiaqi Chen
Shikui Ma
Hang Xu
Xiaodan Liang
488
3
0
23 Mar 2025
HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environments with Dynamic Multi-Human Interactions
Yifei Dong
Fengyi Wu
Qi He
Heng Li
Heng Li
...
Yuxuan Zhou
Yuxuan Zhou
Jingdong Sun
Zhi-Qi Cheng
Alexander G. Hauptmann
LM&Ro
295
1
0
18 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
Neural Networks (NN), 2025
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
330
7
0
13 Mar 2025
Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
IEEE International Conference on Robotics and Automation (ICRA), 2025
Zerui Li
Gengze Zhou
Haodong Hong
Yanyan Shao
Wenqi Lyu
Yanyuan Qiao
Qi Wu
302
5
0
26 Feb 2025
OpenFly: A Comprehensive Platform for Aerial Vision-Language Navigation
Yunpeng Gao
Xuefei Liu
Zhongrui You
Jing Liu
Zhen Li
...
Yan Ding
Dong Wang
Liang Luo
Jiangwei Zhong
Xuelong Li
487
4
0
25 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
382
65
0
31 Dec 2024
Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments
Sangmim Song
S. Kodagoda
A. Gunatilake
Marc G. Carmichael
Karthick Thiyagarajan
Jodi Martin
LM&Ro
378
4
0
28 Oct 2024
Vision-Language Navigation with Energy-Based Policy
Neural Information Processing Systems (NeurIPS), 2024
Rui Liu
Wenguan Wang
Yue Yang
229
18
0
18 Oct 2024
SYNERGAI: Perception Alignment for Human-Robot Collaboration
IEEE International Conference on Robotics and Automation (ICRA), 2024
Yixin Chen
Guoxi Zhang
Yaowei Zhang
Hongming Xu
Peiyuan Zhi
Qing Li
Siyuan Huang
190
0
0
24 Sep 2024
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2024
Muraleekrishna Gopinathan
Jumana Abu-Khalaf
David Suter
Martin Masek
223
0
0
09 Sep 2024
UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
265
2
0
08 Aug 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
312
74
0
17 Jul 2024
PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Renjie Lu
Jingke Meng
Wei-Shi Zheng
222
6
0
16 Jul 2024
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation
Jiaqi Chen
Bingqian Lin
Xinmin Liu
Lin Ma
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
380
44
0
08 Jul 2024
Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions
Heng Li
Heng Li
Zhi-Qi Cheng
Yifei Dong
Yuxuan Zhou
Jun-Yan He
Jingdong Sun
Teruko Mitamura
Alexander G. Hauptmann
LM&Ro
268
21
0
27 Jun 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
220
18
0
03 Jun 2024
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
311
20
0
29 May 2024
Vision-and-Language Navigation Generative Pretrained Transformer
Hanlin Wen
LM&Ro
258
0
0
27 May 2024
MC-GPT: Empowering Vision-and-Language Navigation with Memory Map and Reasoning Chains
Zhaohuan Zhan
Lisha Yu
Sijie Yu
Guang Tan
LLMAG
LM&Ro
311
22
0
17 May 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
207
5
0
15 Apr 2024
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning
International Conference on Language Resources and Evaluation (LREC), 2024
Mengfei Du
Binhao Wu
Jiwen Zhang
Zhihao Fan
Zejun Li
Ruipu Luo
Xuanjing Huang
Zhongyu Wei
169
5
0
02 Apr 2024
Scaling Vision-and-Language Navigation With Offline RL
Valay Bundele
Mahesh Bhupati
Biplab Banerjee
Aditya Grover
OffRL
183
1
0
27 Mar 2024
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation
Bowen Huang
Yanwei Zheng
Chuanlin Lan
Xinpeng Zhao
Yifei Zou
Dongxiao Yu
303
0
0
23 Mar 2024
Continual Vision-and-Language Navigation
Seongjun Jeong
Gi-Cheon Kang
Seongho Choi
Joochan Kim
Byoung-Tak Zhang
429
4
0
22 Mar 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation
IEEE Robotics and Automation Letters (RA-L), 2024
Ming Xu
Zilong Xie
287
3
0
18 Mar 2024
Online Continual Learning For Interactive Instruction Following Agents
International Conference on Learning Representations (ICLR), 2024
Byeonghwi Kim
Minhyuk Seo
Jonghyun Choi
CLL
LM&Ro
318
21
0
12 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
368
78
0
12 Mar 2024
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Bingqian Lin
Yanxin Long
Yi Zhu
Fengda Zhu
Xiaodan Liang
QiXiang Ye
Liang Lin
234
7
0
09 Mar 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
JIazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Dongbin Zhao
Wang He
LM&Ro
657
154
0
24 Feb 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
357
8
0
22 Feb 2024
NavHint: Vision and Language Navigation Agent with a Hint Generator
Yue Zhang
Quan Guo
Parisa Kordjamshidi
LLMAG
287
12
0
04 Feb 2024
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Jiaqi Chen
Bingqian Lin
Ran Xu
Zhenhua Chai
Xiaodan Liang
Kwan-Yee K. Wong
LM&Ro
LLMAG
271
73
0
14 Jan 2024
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
International Conference on Machine Learning (ICML), 2023
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
563
15
0
22 Nov 2023
LangNav: Language as a Perceptual Representation for Navigation
Bowen Pan
Yikang Shen
SouYoung Jin
Rogerio Feris
Aude Oliva
Phillip Isola
Yoon Kim
LM&Ro
309
40
0
11 Oct 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
223
27
0
24 Aug 2023
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Yanyuan Qiao
Zheng Yu
Qi Wu
VLM
179
25
0
20 Aug 2023
March in Chat: Interactive Prompting for Remote Embodied Referring Expression
IEEE International Conference on Computer Vision (ICCV), 2023
Yanyuan Qiao
Yuankai Qi
Zheng Yu
Qingbin Liu
Qi Wu
LM&Ro
274
47
0
20 Aug 2023
1
2
3
4
5
Next