ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.03035
  4. Cited By
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

10 January 2019
Chih-Yao Ma
Jiasen Lu
Zuxuan Wu
G. Al-Regib
Z. Kira
R. Socher
Caiming Xiong
    LM&Ro
ArXiv (abs)PDFHTMLGithub (122★)

Papers citing "Self-Monitoring Navigation Agent via Auxiliary Progress Estimation"

50 / 202 papers shown
Multi-Level Compositional Reasoning for Interactive Instruction
  Following
Multi-Level Compositional Reasoning for Interactive Instruction FollowingAAAI Conference on Artificial Intelligence (AAAI), 2023
Suvaansh Bhambri
Byeonghwi Kim
Jonghyun Choi
LM&Ro
268
14
0
18 Aug 2023
$A^2$Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting
  Vision-and-Language Ability of Foundation Models
A2A^2A2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models
Peihao Chen
Xinyu Sun
Hongyan Zhi
Runhao Zeng
Thomas H. Li
Gaowen Liu
Zhuliang Yu
Chuang Gan
LLMAGLM&Ro
269
59
0
15 Aug 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
DREAMWALKER: Mental Planning for Continuous Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
201
76
0
14 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Bird's-Eye-View Scene Graph for Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
328
85
0
09 Aug 2023
Scaling Data Generation in Vision-and-Language Navigation
Scaling Data Generation in Vision-and-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Joey Tianyi Zhou
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
320
106
0
28 Jul 2023
Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for
  Navigation Instruction Generation
Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for Navigation Instruction Generation
Haitian Zeng
Xiaohan Wang
Wenguan Wang
Yi Yang
264
10
0
25 Jul 2023
Learning Vision-and-Language Navigation from YouTube Videos
Learning Vision-and-Language Navigation from YouTube VideosIEEE International Conference on Computer Vision (ICCV), 2023
Kun-Li Channing Lin
Peihao Chen
Di Huang
Thomas H. Li
Zhuliang Yu
Chuang Gan
LM&Ro
213
44
0
22 Jul 2023
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for
  Vision and Language Decision Making
Breaking Down the Task: A Unit-Grained Hybrid Training Framework for Vision and Language Decision Making
Ruipu Luo
Jiwen Zhang
Zhongyu Wei
VLM
214
0
0
16 Jul 2023
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot
  Vision-and-Language Navigation
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language NavigationAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Xiwen Liang
Liang Ma
Shanshan Guo
Jianhua Han
Hang Xu
Shikui Ma
Xiaodan Liang
LM&RoLLMAG
367
5
0
17 Jun 2023
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual
  Navigation in Noisy Environments
CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy EnvironmentsAAAI Conference on Artificial Intelligence (AAAI), 2023
Xiulong Liu
Sudipta Paul
Moitreya Chatterjee
A. Cherian
214
12
0
06 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for
  Vision-and-Language Navigation
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language NavigationNeural Information Processing Systems (NeurIPS), 2023
Jialu Li
Joey Tianyi Zhou
DiffM
259
82
0
30 May 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot
  Attention for Vision-and-Language Navigation
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2023
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
333
26
0
26 May 2023
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large
  Language Models
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2023
Gengze Zhou
Yicong Hong
Qi Wu
ELMLM&RoLLMAGLRM
477
270
0
26 May 2023
Masked Path Modeling for Vision-and-Language Navigation
Masked Path Modeling for Vision-and-Language NavigationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zi-Yi Dou
Feng Gao
Nanyun Peng
LM&Ro
190
5
0
23 May 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For
  Vision-and-Language Navigation
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language NavigationEngineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023
Liuyi Wang
Chengju Liu
Zongtao He
Shu Li
Qingqing Yan
Huiyi Chen
Qi Chen
195
13
0
19 May 2023
ETPNav: Evolving Topological Planning for Vision-Language Navigation in
  Continuous Environments
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous EnvironmentsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Dongyan An
Hongru Wang
Wenguan Wang
Zun Wang
Yan Huang
Keji He
Liang Wang
481
141
0
06 Apr 2023
Lana: A Language-Capable Navigator for Instruction Following and
  Generation
Lana: A Language-Capable Navigator for Instruction Following and GenerationComputer Vision and Pattern Recognition (CVPR), 2023
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAGLM&Ro
237
56
0
15 Mar 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation
  Using Scene Object Spectrum Grounding
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum GroundingComputer Vision and Pattern Recognition (CVPR), 2023
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
236
27
0
07 Mar 2023
MLANet: Multi-Level Attention Network with Sub-instruction for
  Continuous Vision-and-Language Navigation
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
Zongtao He
Liuyi Wang
Shu Li
Qingqing Yan
Chengju Liu
Qi Chen
197
12
0
02 Mar 2023
ESceme: Vision-and-Language Navigation with Episodic Scene Memory
ESceme: Vision-and-Language Navigation with Episodic Scene MemoryInternational Journal of Computer Vision (IJCV), 2023
Qinjie Zheng
Daqing Liu
Chaoyue Wang
Jing Zhang
Dadong Wang
Dacheng Tao
LM&Ro
211
10
0
02 Mar 2023
VLN-Trans: Translator for the Vision and Language Navigation Agent
VLN-Trans: Translator for the Vision and Language Navigation AgentAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yue Zhang
Parisa Kordjamshidi
244
24
0
18 Feb 2023
Actional Atomic-Concept Learning for Demystifying Vision-Language
  Navigation
Actional Atomic-Concept Learning for Demystifying Vision-Language NavigationAAAI Conference on Artificial Intelligence (AAAI), 2023
Bingqian Lin
Yi Zhu
Xiaodan Liang
Liang Lin
Jian-zhuo Liu
CoGeLM&Ro
299
5
0
13 Feb 2023
RREx-BoT: Remote Referring Expressions with a Bag of Tricks
RREx-BoT: Remote Referring Expressions with a Bag of TricksIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Gunnar Sigurdsson
Jesse Thomason
Gaurav Sukhatme
Robinson Piramuthu
LM&Ro
235
14
0
30 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon PredictionComputer Vision and Pattern Recognition (CVPR), 2023
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
243
46
0
21 Jan 2023
Graph based Environment Representation for Vision-and-Language
  Navigation in Continuous Environments
Graph based Environment Representation for Vision-and-Language Navigation in Continuous EnvironmentsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Ting Wang
Zongkai Wu
Feiyu Yao
Xuetao Zhang
280
13
0
11 Jan 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
278
106
0
08 Dec 2022
Layout-aware Dreamer for Embodied Referring Expression Grounding
Layout-aware Dreamer for Embodied Referring Expression Grounding
Mingxiao Li
Zehao Wang
Tinne Tuytelaars
Marie-Francine Moens
LM&Ro
158
7
0
30 Nov 2022
Predicting Topological Maps for Visual Navigation in Unexplored
  Environments
Predicting Topological Maps for Visual Navigation in Unexplored Environments
Huangying Zhan
Hamid Rezatofighi
Ian Reid
270
0
0
23 Nov 2022
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in
  Vision-and-Language Navigation
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language NavigationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
204
12
0
20 Nov 2022
Towards Versatile Embodied Navigation
Towards Versatile Embodied NavigationNeural Information Processing Systems (NeurIPS), 2022
Hongru Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
229
31
0
30 Oct 2022
Bridging the visual gap in VLN via semantically richer instructions
Bridging the visual gap in VLN via semantically richer instructionsEuropean Conference on Computer Vision (ECCV), 2022
Joaquín Ossandón
Benjamín Earle
Alvaro Soto
241
3
0
27 Oct 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in
  Interactive Autonomous Driving Agents
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving AgentsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
295
12
0
22 Oct 2022
ULN: Towards Underspecified Vision-and-Language Navigation
ULN: Towards Underspecified Vision-and-Language NavigationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weixi Feng
Tsu-Jui Fu
Yujie Lu
William Yang Wang
293
5
0
18 Oct 2022
AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments
AVLEN: Audio-Visual-Language Embodied Navigation in 3D EnvironmentsNeural Information Processing Systems (NeurIPS), 2022
Sudipta Paul
Amit K. Roy-Chowdhury
A. Cherian
188
32
0
14 Oct 2022
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language
  Navigation
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language NavigationNeural Information Processing Systems (NeurIPS), 2022
Peihao Chen
Dongyu Ji
Kun-Li Channing Lin
Runhao Zeng
Thomas H. Li
Zhuliang Yu
Chuang Gan
SSL
211
93
0
14 Oct 2022
A New Path: Scaling Vision-and-Language Navigation with Synthetic
  Instructions and Imitation Learning
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation LearningComputer Vision and Pattern Recognition (CVPR), 2022
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
415
61
0
06 Oct 2022
Iterative Vision-and-Language Navigation
Iterative Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Jacob Krantz
Shurjo Banerjee
Peng Guo
Jason J. Corso
Peter Anderson
Stefan Lee
Jesse Thomason
LM&Ro
322
33
0
06 Oct 2022
LOViS: Learning Orientation and Visual Signals for Vision and Language
  Navigation
LOViS: Learning Orientation and Visual Signals for Vision and Language NavigationInternational Conference on Computational Linguistics (COLING), 2022
Yue Zhang
Parisa Kordjamshidi
190
11
0
26 Sep 2022
Learning from Unlabeled 3D Environments for Vision-and-Language
  Navigation
Learning from Unlabeled 3D Environments for Vision-and-Language NavigationEuropean Conference on Computer Vision (ECCV), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
330
58
0
24 Aug 2022
One for All: One-stage Referring Expression Comprehension with Dynamic
  Reasoning
One for All: One-stage Referring Expression Comprehension with Dynamic ReasoningNeurocomputing (Neurocomputing), 2022
Zhipeng Zhang
Zhimin Wei
Zhongzhen Huang
Rui Niu
Peng Wang
ObjDLRM
298
10
0
31 Jul 2022
Target-Driven Structured Transformer Planner for Vision-Language
  Navigation
Target-Driven Structured Transformer Planner for Vision-Language NavigationACM Multimedia (ACM MM), 2022
Yusheng Zhao
Jinyu Chen
Chen Gao
Wenguan Wang
Lirong Yang
Haibing Ren
Huaxia Xia
Si Liu
LM&Ro
441
74
0
19 Jul 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual,
  Environment-Agnostic Representations
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
222
12
0
05 Jul 2022
Local Slot Attention for Vision-and-Language Navigation
Local Slot Attention for Vision-and-Language NavigationInternational Conference on Multimedia Retrieval (ICMR), 2022
Yifeng Zhuang
Qiang Sun
Yanwei Fu
Lifeng Chen
Xiangyang Xue
277
2
0
17 Jun 2022
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation
FOAM: A Follower-aware Speaker Model For Vision-and-Language NavigationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2022
Zi-Yi Dou
Nanyun Peng
293
24
0
09 Jun 2022
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
ADAPT: Vision-Language Navigation with Modality-Aligned Action PromptsComputer Vision and Pattern Recognition (CVPR), 2022
Bingqian Lin
Yi Zhu
Zicong Chen
Xiwen Liang
Jian-zhuo Liu
Xiaodan Liang
LM&Ro
194
60
0
31 May 2022
Reinforced Structured State-Evolution for Vision-Language Navigation
Reinforced Structured State-Evolution for Vision-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Jinyu Chen
Chen Gao
Erli Meng
Qiong Zhang
Si Liu
LM&Ro
229
50
0
20 Apr 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and
  Generation in Vision-Language Navigation
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Hongru Wang
Wei Liang
Jianbing Shen
Luc Van Gool
Wenguan Wang
216
73
0
30 Mar 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
EnvEdit: Environment Editing for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Jialu Li
Hao Tan
Joey Tianyi Zhou
337
107
0
29 Mar 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future
  Directions
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future DirectionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Xinze Wang
LM&Ro
367
152
0
22 Mar 2022
HOP: History-and-Order Aware Pre-training for Vision-and-Language
  Navigation
HOP: History-and-Order Aware Pre-training for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Yanyuan Qiao
Yuankai Qi
Yicong Hong
Zheng Yu
Peifeng Wang
Qi Wu
AI4TS
263
91
0
22 Mar 2022
Previous
12345
Next