Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1901.03035
Cited By
Self-Monitoring Navigation Agent via Auxiliary Progress Estimation
10 January 2019
Chih-Yao Ma
Jiasen Lu
Zuxuan Wu
G. Al-Regib
Z. Kira
R. Socher
Caiming Xiong
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Github (122★)
Papers citing
"Self-Monitoring Navigation Agent via Auxiliary Progress Estimation"
50 / 202 papers shown
elBERto: Self-supervised Commonsense Learning for Question Answering
Knowledge-Based Systems (KBS), 2022
Xunlin Zhan
Yuan Li
Xiao Dong
Xiaodan Liang
Zhiting Hu
Lawrence Carin
SSL
RALM
LRM
184
9
0
17 Mar 2022
Cross-modal Map Learning for Vision and Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
G. Georgakis
Karl Schmeckpeper
Karan Wanchoo
Soham Dan
E. Miltsakaki
Dan Roth
Kostas Daniilidis
377
97
0
10 Mar 2022
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiwen Liang
Fengda Zhu
Lingling Li
Hang Xu
Xiaodan Liang
LM&Ro
VLM
128
33
0
08 Mar 2022
Find a Way Forward: a Language-Guided Semantic Map Navigator
Zehao Wang
Mingxiao Li
Minye Wu
Marie-Francine Moens
Tinne Tuytelaars
LM&Ro
151
4
0
07 Mar 2022
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Yicong Hong
Zun Wang
Qi Wu
Stephen Gould
3DV
201
113
0
05 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
Neural Information Processing Systems (NeurIPS), 2022
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
664
34
0
28 Feb 2022
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
286
208
0
23 Feb 2022
One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Computer Vision and Pattern Recognition (CVPR), 2022
Chan Hee Song
Jihyung Kil
Tai-Yu Pan
Brian M. Sadler
Wei-Lun Chao
Yu-Chuan Su
LRM
340
39
0
14 Feb 2022
Recursive Decoding: A Situated Cognition Approach to Compositional Generation in Grounded Language Understanding
Matthew Setzler
Scott Howland
Lauren A. Phillips
LRM
191
6
0
27 Jan 2022
Self-supervised 3D Semantic Representation Learning for Vision-and-Language Navigation
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
SSL
186
10
0
26 Jan 2022
Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
Xiwen Liang
Fengda Zhu
Yi Zhu
Bingqian Lin
Bing Wang
Xiaodan Liang
187
26
0
08 Dec 2021
Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method
Wenda Qin
Teruhisa Misu
Derry Wijaya
UQCV
LM&Ro
214
6
0
28 Nov 2021
Curriculum Learning for Vision-and-Language Navigation
Neural Information Processing Systems (NeurIPS), 2021
Jiwen Zhang
Zhongyu Wei
Jianqing Fan
J. Peng
LM&Ro
202
27
0
14 Nov 2021
Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation
European Conference on Computer Vision (ECCV), 2021
Chuang Lin
Yi Jiang
Jianfei Cai
Zhuang Li
Gholamreza Haffari
Zehuan Yuan
180
37
0
10 Nov 2021
SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
A. Moudgil
Arjun Majumdar
Harsh Agrawal
Stefan Lee
Dhruv Batra
LM&Ro
186
71
0
27 Oct 2021
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
299
310
0
25 Oct 2021
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation
Yang Wu
Shirui Feng
Guanbin Li
Liang Lin
65
0
0
16 Oct 2021
Rethinking the Spatial Route Prior in Vision-and-Language Navigation
Xinzhe Zhou
Wei Liu
Yadong Mu
119
7
0
12 Oct 2021
Are you doing what I say? On modalities alignment in ALFRED
Ting-Rui Chiang
Yi-Ting Yeh
Ta-Chung Chi
Yau-Shian Wang
203
1
0
12 Oct 2021
Skill Induction and Planning with Latent Language
Pratyusha Sharma
Antonio Torralba
Jacob Andreas
LM&Ro
512
123
0
04 Oct 2021
Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement Learning
Sayan Ghosh
Shashank Srivastava
245
3
0
02 Oct 2021
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
Sonia Raychaudhuri
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
LM&Ro
177
86
0
30 Sep 2021
Procedures as Programs: Hierarchical Control of Situated Agents through Natural Language
Shuyan Zhou
Pengcheng Yin
Graham Neubig
LM&Ro
231
1
0
16 Sep 2021
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
International Conference on Pattern Recognition (ICPR), 2021
Muhammad Zubair Irshad
Niluthpol Chowdhury Mithun
Zachary Seymour
Han-Pang Chiu
S. Samarasekera
Rakesh Kumar
LM&Ro
211
69
0
26 Aug 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
333
47
0
26 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
210
166
0
20 Aug 2021
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Bingqian Lin
Yi Zhu
Yanxin Long
Xiaodan Liang
QiXiang Ye
Liang Lin
AAML
204
20
0
23 Jul 2021
Neighbor-view Enhanced Model for Vision and Language Navigation
ACM Multimedia (ACM MM), 2021
Dongyan An
Yuankai Qi
Yan Huang
Qi Wu
Liang Wang
Tieniu Tan
LM&Ro
250
84
0
15 Jul 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
505
468
0
13 Jul 2021
A Persistent Spatial Semantic Representation for High-level Natural Language Instruction Execution
Valts Blukis
Chris Paxton
Dieter Fox
Animesh Garg
Yoav Artzi
LM&Ro
527
154
0
12 Jul 2021
LanguageRefer: Spatial-Language Model for 3D Visual Grounding
Conference on Robot Learning (CoRL), 2021
Junha Roh
Karthik Desingh
Ali Farhadi
Dieter Fox
305
112
0
07 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoV
LM&Ro
491
0
0
07 Jul 2021
Core Challenges in Embodied Vision-Language Planning
Journal of Artificial Intelligence Research (JAIR), 2021
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
537
58
0
26 Jun 2021
Vision-Language Navigation with Random Environmental Mixup
IEEE International Conference on Computer Vision (ICCV), 2021
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
298
105
0
15 Jun 2021
RobustNav: Towards Benchmarking Robustness in Embodied Navigation
IEEE International Conference on Computer Vision (ICCV), 2021
Prithvijit Chattopadhyay
Judy Hoffman
Roozbeh Mottaghi
Aniruddha Kembhavi
285
68
0
08 Jun 2021
Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring
Findings (Findings), 2021
Yichi Zhang
J. Chai
235
84
0
07 Jun 2021
Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
LM&Ro
231
36
0
01 Jun 2021
Pathdreamer: A World Model for Indoor Navigation
Jing Yu Koh
Honglak Lee
Yinfei Yang
Jason Baldridge
Peter Anderson
347
114
0
18 May 2021
Towards Navigation by Reasoning over Spatial Configurations
Yue Zhang
Quan Guo
Parisa Kordjamshidi
LLMAG
134
20
0
14 May 2021
Episodic Transformer for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2021
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
346
212
0
13 May 2021
Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation
IEEE International Conference on Robotics and Automation (ICRA), 2021
Muhammad Zubair Irshad
Chih-Yao Ma
Z. Kira
LM&Ro
188
61
0
21 Apr 2021
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Jialu Li
Hao Hao Tan
Joey Tianyi Zhou
178
36
0
19 Apr 2021
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2021
Yuankai Qi
Zizheng Pan
Yicong Hong
Ming-Hsuan Yang
Anton Van Den Hengel
Qi Wu
LM&Ro
239
79
0
09 Apr 2021
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
Computer Vision and Pattern Recognition (CVPR), 2021
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
282
167
0
31 Mar 2021
Diagnosing Vision-and-Language Navigation: What Really Matters
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Wanrong Zhu
Yuankai Qi
P. Narayana
Kazoo Sone
Sugato Basu
Xinze Wang
Qi Wu
Miguel P. Eckstein
Wenjie Wang
LM&Ro
233
55
0
30 Mar 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2021
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
184
60
0
24 Mar 2021
Structured Scene Memory for Vision-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2021
Hanqing Wang
Wenguan Wang
Wei Liang
Caiming Xiong
Jianbing Shen
LM&Ro
224
142
0
05 Mar 2021
Are We There Yet? Learning to Localize in Embodied Instruction Following
Shane Storks
Qiaozi Gao
Govind Thattai
Gokhan Tur
LM&Ro
228
11
0
09 Jan 2021
Semantics for Robotic Mapping, Perception and Interaction: A Survey
Sourav Garg
Niko Sünderhauf
Feras Dayoub
D. Morrison
Akansel Cosgun
...
Tat-Jun Chin
Ian Reid
Stephen Gould
Peter Corke
Michael Milford
341
134
0
02 Jan 2021
Spatial Reasoning from Natural Language Instructions for Robot Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2020
S. Gubbi
Anirban Biswas
Raviteja Upadrashta
V. Srinivasan
Partha P. Talukdar
B. Amrutur
LM&Ro
LRM
267
34
0
26 Dec 2020
Previous
1
2
3
4
5
Next