ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.03035
  4. Cited By
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation

10 January 2019
Chih-Yao Ma
Jiasen Lu
Zuxuan Wu
G. Al-Regib
Z. Kira
R. Socher
Caiming Xiong
    LM&Ro
ArXiv (abs)PDFHTMLGithub (122★)

Papers citing "Self-Monitoring Navigation Agent via Auxiliary Progress Estimation"

50 / 202 papers shown
elBERto: Self-supervised Commonsense Learning for Question Answering
elBERto: Self-supervised Commonsense Learning for Question AnsweringKnowledge-Based Systems (KBS), 2022
Xunlin Zhan
Yuan Li
Xiao Dong
Xiaodan Liang
Zhiting Hu
Lawrence Carin
SSLRALMLRM
184
9
0
17 Mar 2022
Cross-modal Map Learning for Vision and Language Navigation
Cross-modal Map Learning for Vision and Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
G. Georgakis
Karl Schmeckpeper
Karan Wanchoo
Soham Dan
E. Miltsakaki
Dan Roth
Kostas Daniilidis
377
97
0
10 Mar 2022
Visual-Language Navigation Pretraining via Prompt-based Environmental
  Self-exploration
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-explorationAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiwen Liang
Fengda Zhu
Lingling Li
Hang Xu
Xiaodan Liang
LM&RoVLM
128
33
0
08 Mar 2022
Find a Way Forward: a Language-Guided Semantic Map Navigator
Find a Way Forward: a Language-Guided Semantic Map Navigator
Zehao Wang
Mingxiao Li
Minye Wu
Marie-Francine Moens
Tinne Tuytelaars
LM&Ro
151
4
0
07 Mar 2022
Bridging the Gap Between Learning in Discrete and Continuous
  Environments for Vision-and-Language Navigation
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Yicong Hong
Zun Wang
Qi Wu
Stephen Gould
3DV
201
113
0
05 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from LanguageNeural Information Processing Systems (NeurIPS), 2022
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&RoOffRL
664
34
0
28 Feb 2022
Think Global, Act Local: Dual-scale Graph Transformer for
  Vision-and-Language Navigation
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
286
208
0
23 Feb 2022
One Step at a Time: Long-Horizon Vision-and-Language Navigation with
  Milestones
One Step at a Time: Long-Horizon Vision-and-Language Navigation with MilestonesComputer Vision and Pattern Recognition (CVPR), 2022
Chan Hee Song
Jihyung Kil
Tai-Yu Pan
Brian M. Sadler
Wei-Lun Chao
Yu-Chuan Su
LRM
340
39
0
14 Feb 2022
Recursive Decoding: A Situated Cognition Approach to Compositional
  Generation in Grounded Language Understanding
Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding
Matthew Setzler
Scott Howland
Lauren A. Phillips
LRM
191
6
0
27 Jan 2022
Self-supervised 3D Semantic Representation Learning for
  Vision-and-Language Navigation
Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
SSL
186
10
0
26 Jan 2022
Contrastive Instruction-Trajectory Learning for Vision-Language
  Navigation
Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Xiwen Liang
Fengda Zhu
Yi Zhu
Bingqian Lin
Bing Wang
Xiaodan Liang
187
26
0
08 Dec 2021
Explore the Potential Performance of Vision-and-Language Navigation
  Model: a Snapshot Ensemble Method
Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method
Wenda Qin
Teruhisa Misu
Derry Wijaya
UQCVLM&Ro
214
6
0
28 Nov 2021
Curriculum Learning for Vision-and-Language Navigation
Curriculum Learning for Vision-and-Language NavigationNeural Information Processing Systems (NeurIPS), 2021
Jiwen Zhang
Zhongyu Wei
Jianqing Fan
J. Peng
LM&Ro
202
27
0
14 Nov 2021
Multimodal Transformer with Variable-length Memory for
  Vision-and-Language Navigation
Multimodal Transformer with Variable-length Memory for Vision-and-Language NavigationEuropean Conference on Computer Vision (ECCV), 2021
Chuang Lin
Yi Jiang
Jianfei Cai
Zhuang Li
Gholamreza Haffari
Zehuan Yuan
180
37
0
10 Nov 2021
SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language
  Navigation
SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
A. Moudgil
Arjun Majumdar
Harsh Agrawal
Stefan Lee
Dhruv Batra
LM&Ro
186
71
0
27 Oct 2021
History Aware Multimodal Transformer for Vision-and-Language Navigation
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
299
310
0
25 Oct 2021
Explore before Moving: A Feasible Path Estimation and Memory Recalling
  Framework for Embodied Navigation
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation
Yang Wu
Shirui Feng
Guanbin Li
Liang Lin
65
0
0
16 Oct 2021
Rethinking the Spatial Route Prior in Vision-and-Language Navigation
Rethinking the Spatial Route Prior in Vision-and-Language Navigation
Xinzhe Zhou
Wei Liu
Yadong Mu
119
7
0
12 Oct 2021
Are you doing what I say? On modalities alignment in ALFRED
Are you doing what I say? On modalities alignment in ALFRED
Ting-Rui Chiang
Yi-Ting Yeh
Ta-Chung Chi
Yau-Shian Wang
203
1
0
12 Oct 2021
Skill Induction and Planning with Latent Language
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
512
123
0
04 Oct 2021
Mapping Language to Programs using Multiple Reward Components with
  Inverse Reinforcement Learning
Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement Learning
Sayan Ghosh
Shashank Srivastava
245
3
0
02 Oct 2021
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language
  Navigation in Continuous Environments
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
Sonia Raychaudhuri
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
LM&Ro
177
86
0
30 Sep 2021
Procedures as Programs: Hierarchical Control of Situated Agents through
  Natural Language
Procedures as Programs: Hierarchical Control of Situated Agents through Natural Language
Shuyan Zhou
Pengcheng Yin
Graham Neubig
LM&Ro
231
1
0
16 Sep 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for
  Vision-and-Language Navigation in Continuous Environments
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous EnvironmentsInternational Conference on Pattern Recognition (ICPR), 2021
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
211
69
0
26 Aug 2021
Vision-Language Navigation: A Survey and Taxonomy
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
333
47
0
26 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
210
166
0
20 Aug 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language
  Navigation
Adversarial Reinforced Instruction Attacker for Robust Vision-Language NavigationIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
204
20
0
23 Jul 2021
Neighbor-view Enhanced Model for Vision and Language Navigation
Neighbor-view Enhanced Model for Vision and Language NavigationACM Multimedia (ACM MM), 2021
Dongyan An
Yuankai Qi
Yan Huang
Qi Wu
Liang Wang
Tieniu Tan
LM&Ro
250
84
0
15 Jul 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIPVLMMLLM
505
468
0
13 Jul 2021
A Persistent Spatial Semantic Representation for High-level Natural
  Language Instruction Execution
A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution
Valts Blukis
Chris Paxton
Dieter Fox
Animesh Garg
Yoav Artzi
LM&Ro
527
154
0
12 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
LanguageRefer: Spatial-Language Model for 3D Visual GroundingConference on Robot Learning (CoRL), 2021
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
305
112
0
07 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoVLM&Ro
491
0
0
07 Jul 2021
Core Challenges in Embodied Vision-Language Planning
Core Challenges in Embodied Vision-Language PlanningJournal of Artificial Intelligence Research (JAIR), 2021
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
537
58
0
26 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
Vision-Language Navigation with Random Environmental MixupIEEE International Conference on Computer Vision (ICCV), 2021
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
298
105
0
15 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
RobustNav: Towards Benchmarking Robustness in Embodied NavigationIEEE International Conference on Computer Vision (ICCV), 2021
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
285
68
0
08 Jun 2021
Hierarchical Task Learning from Language Instructions with Unified
  Transformers and Self-Monitoring
Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-MonitoringFindings (Findings), 2021
Yichi Zhang
J. Chai
235
84
0
07 Jun 2021
Look Wide and Interpret Twice: Improving Performance on Interactive
  Instruction-following Tasks
Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following TasksInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
LM&Ro
231
36
0
01 Jun 2021
Pathdreamer: A World Model for Indoor Navigation
Pathdreamer: A World Model for Indoor Navigation
Jing Yu Koh
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
347
114
0
18 May 2021
Towards Navigation by Reasoning over Spatial Configurations
Towards Navigation by Reasoning over Spatial Configurations
Yue Zhang
Quan Guo
Parisa Kordjamshidi
LLMAG
134
20
0
14 May 2021
Episodic Transformer for Vision-and-Language Navigation
Episodic Transformer for Vision-and-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2021
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
346
212
0
13 May 2021
Hierarchical Cross-Modal Agent for Robotics Vision-and-Language
  Navigation
Hierarchical Cross-Modal Agent for Robotics Vision-and-Language NavigationIEEE International Conference on Robotics and Automation (ICRA), 2021
Muhammad Zubair Irshad
Chih-Yao Ma
Z. Kira
LM&Ro
188
61
0
21 Apr 2021
Improving Cross-Modal Alignment in Vision Language Navigation via
  Syntactic Information
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic InformationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Jialu Li
Hao Hao Tan
Joey Tianyi Zhou
178
36
0
19 Apr 2021
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for
  Indoor Vision-Language Navigation
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2021
Yuankai Qi
Zizheng Pan
Yicong Hong
Ming-Hsuan Yang
Anton Van Den Hengel
Qi Wu
LM&Ro
239
79
0
09 Apr 2021
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
SOON: Scenario Oriented Object Navigation with Graph-based ExplorationComputer Vision and Pattern Recognition (CVPR), 2021
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
282
167
0
31 Mar 2021
Diagnosing Vision-and-Language Navigation: What Really Matters
Diagnosing Vision-and-Language Navigation: What Really MattersNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021
Wanrong Zhu
Yuankai Qi
P. Narayana
Kazoo Sone
Sugato Basu
Xinze Wang
Qi Wu
Miguel P. Eckstein
Wenjie Wang
LM&Ro
233
55
0
30 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Scene-Intuitive Agent for Remote Embodied Visual GroundingComputer Vision and Pattern Recognition (CVPR), 2021
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
184
60
0
24 Mar 2021
Structured Scene Memory for Vision-Language Navigation
Structured Scene Memory for Vision-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2021
Hanqing Wang
Wenguan Wang
Wei Liang
Caiming Xiong
Jianbing Shen
LM&Ro
224
142
0
05 Mar 2021
Are We There Yet? Learning to Localize in Embodied Instruction Following
Are We There Yet? Learning to Localize in Embodied Instruction Following
Shane Storks
Qiaozi Gao
Govind Thattai
Gokhan Tur
LM&Ro
228
11
0
09 Jan 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Sourav Garg
Niko Sünderhauf
Feras Dayoub
D. Morrison
Akansel Cosgun
...
Tat-Jun Chin
Ian Reid
Stephen Gould
Peter Corke
Michael Milford
341
134
0
02 Jan 2021
Spatial Reasoning from Natural Language Instructions for Robot
  Manipulation
Spatial Reasoning from Natural Language Instructions for Robot ManipulationIEEE International Conference on Robotics and Automation (ICRA), 2020
S. Gubbi
Anirban Biswas
Raviteja Upadrashta
V. Srinivasan
Partha P. Talukdar
B. Amrutur
LM&RoLRM
267
34
0
26 Dec 2020
Previous
12345
Next