Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1908.03409
Cited By
v1
v2 (latest)
Transferable Representation Learning in Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2019
9 August 2019
Haoshuo Huang
Vihan Jain
Harsh Mehta
Alexander Ku
Gabriel Ilharco
Jason Baldridge
Eugene Ie
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Transferable Representation Learning in Vision-and-Language Navigation"
50 / 51 papers shown
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Ruoyu Wang
Tong Yu
Junda Wu
Yao Liu
Julian McAuley
Lina Yao
277
5
0
18 Jun 2025
Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations
Yibo Cui
Liang Xie
Yu Zhao
Jiawei Sun
Erwei Yin
214
2
0
10 Jun 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Information Fusion (Inf. Fusion), 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
520
48
0
03 Apr 2025
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
IEEE International Conference on Robotics and Automation (ICRA), 2024
Yanyuan Qiao
Wenqi Lyu
Hui Wang
Zixu Wang
Zerui Li
Yuan Zhang
Zhuliang Yu
Qi Wu
LRM
368
43
0
27 Sep 2024
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
373
22
0
29 May 2024
Closed Loop Interactive Embodied Reasoning for Robot Manipulation
Michal Nazarczuk
Jan Kristof Behrens
Karla Stepanova
Matej Hoffmann
K. Mikolajczyk
LM&Ro
LRM
455
4
0
23 Apr 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation
IEEE Robotics and Automation Letters (RA-L), 2024
Ming Xu
Zilong Xie
317
4
0
18 Mar 2024
Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation
Francesco Taioli
Stefano Rosa
A. Castellini
Lorenzo Natale
Alessio Del Bue
Alessandro Farinelli
Marco Cristani
Yiming Wang
365
9
0
15 Mar 2024
Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Neural Information Processing Systems (NeurIPS), 2023
Hongchen Wang
Andy Guan Hong Chen
Xiaoqi Li
Mingdong Wu
Hao Dong
515
27
0
15 Sep 2023
DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Hanqing Wang
Wei Liang
Luc Van Gool
Wenguan Wang
LM&Ro
288
89
0
14 Aug 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
355
92
0
09 Aug 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
Engineering applications of artificial intelligence (Eng. Appl. Artif. Intell.), 2023
Liuyi Wang
Chengju Liu
Zongtao He
Shu Li
Qingqing Yan
Huiyi Chen
Qi Chen
229
13
0
19 May 2023
Lana: A Language-Capable Navigator for Instruction Following and Generation
Computer Vision and Pattern Recognition (CVPR), 2023
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAG
LM&Ro
263
58
0
15 Mar 2023
CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation
Vishnu Sashank Dorbala
Gunnar Sigurdsson
Robinson Piramuthu
Jesse Thomason
Gaurav Sukhatme
LM&Ro
236
79
0
30 Nov 2022
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
259
12
0
20 Nov 2022
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jason Armitage
L. Impett
Rico Sennrich
413
6
0
24 Jul 2022
Target-Driven Structured Transformer Planner for Vision-Language Navigation
ACM Multimedia (ACM MM), 2022
Yusheng Zhao
Jinyu Chen
Chen Gao
Wenguan Wang
Lirong Yang
Haibing Ren
Huaxia Xia
Si Liu
LM&Ro
464
79
0
19 Jul 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
Jialu Li
Hao Tan
Joey Tianyi Zhou
LM&Ro
243
12
0
05 Jul 2022
Local Slot Attention for Vision-and-Language Navigation
International Conference on Multimedia Retrieval (ICMR), 2022
Yifeng Zhuang
Qiang Sun
Yanwei Fu
Lifeng Chen
Xiangyang Xue
299
2
0
17 Jun 2022
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts
Computer Vision and Pattern Recognition (CVPR), 2022
Bingqian Lin
Yi Zhu
Zicong Chen
Xiwen Liang
Jian-zhuo Liu
Xiaodan Liang
LM&Ro
254
65
0
31 May 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Hongru Wang
Wei Liang
Jianbing Shen
Luc Van Gool
Wenguan Wang
245
78
0
30 Mar 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Jialu Li
Hao Tan
Joey Tianyi Zhou
471
114
0
29 Mar 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Xinze Wang
LM&Ro
393
163
0
22 Mar 2022
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Yanyuan Qiao
Yuankai Qi
Yicong Hong
Zheng Yu
Peifeng Wang
Qi Wu
AI4TS
272
97
0
22 Mar 2022
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Xiwen Liang
Fengda Zhu
Lingling Li
Hang Xu
Xiaodan Liang
LM&Ro
VLM
160
35
0
08 Mar 2022
Rethinking the Spatial Route Prior in Vision-and-Language Navigation
Xinzhe Zhou
Wei Liu
Yadong Mu
172
7
0
12 Oct 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
416
55
0
26 Aug 2021
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-Louis Guhur
Makarand Tapaswi
Shizhe Chen
Ivan Laptev
Cordelia Schmid
LM&Ro
293
174
0
20 Aug 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
595
493
0
13 Jul 2021
Vision-Language Navigation with Random Environmental Mixup
IEEE International Conference on Computer Vision (ICCV), 2021
Chong Liu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
Zongyuan Ge
Yi-Dong Shen
LM&Ro
309
112
0
15 Jun 2021
Episodic Transformer for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2021
Alexander Pashevich
Cordelia Schmid
Chen Sun
LM&Ro
463
219
0
13 May 2021
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Jialu Li
Hao Hao Tan
Joey Tianyi Zhou
245
37
0
19 Apr 2021
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
Computer Vision and Pattern Recognition (CVPR), 2021
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
380
170
0
31 Mar 2021
Diagnosing Vision-and-Language Navigation: What Really Matters
North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Wanrong Zhu
Yuankai Qi
P. Narayana
Kazoo Sone
Sugato Basu
Xinze Wang
Qi Wu
Miguel P. Eckstein
Wenjie Wang
LM&Ro
263
56
0
30 Mar 2021
Structured Scene Memory for Vision-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2021
Hanqing Wang
Wenguan Wang
Wei Liang
Caiming Xiong
Jianbing Shen
LM&Ro
298
148
0
05 Mar 2021
On the Evaluation of Vision-and-Language Navigation Instructions
Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2021
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
577
59
0
26 Jan 2021
Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning
Weixia Zhang
Chao Ma
Qi Wu
Xiaokang Yang
248
57
0
22 Nov 2020
Grounding Implicit Goal Description for Robot Indoor Navigation Via Recursive Belief Update
Rui Chen
Jinxin Zhao
Liangjun Zhang
67
0
0
10 Nov 2020
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
335
6
0
19 Oct 2020
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Alexander Ku
Peter Anderson
Roma Patel
Eugene Ie
Jason Baldridge
345
464
0
15 Oct 2020
Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
International Conference on Learning Representations (ICLR), 2020
Shuhei Kurita
Dong Wang
LM&Ro
281
29
0
16 Sep 2020
Object-and-Action Aware Model for Visual Language Navigation
European Conference on Computer Vision (ECCV), 2020
Yuankai Qi
Zizheng Pan
Shengping Zhang
Anton Van Den Hengel
Qi Wu
LM&Ro
267
132
0
29 Jul 2020
Active Visual Information Gathering for Vision-Language Navigation
European Conference on Computer Vision (ECCV), 2020
Hanqing Wang
Wenguan Wang
Tianmin Shu
Wei Liang
Jianbing Shen
370
83
0
15 Jul 2020
BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps
Peng Guo
Hexiang Hu
Jiacheng Chen
Zhiwei Deng
Vihan Jain
Eugene Ie
Fei Sha
LM&Ro
240
80
0
10 May 2020
Diagnosing the Environment Bias in Vision-and-Language Navigation
International Joint Conference on Artificial Intelligence (IJCAI), 2020
Yubo Zhang
Hao Tan
Joey Tianyi Zhou
234
62
0
06 May 2020
Sub-Instruction Aware Vision-and-Language Navigation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Yicong Hong
Cristian Rodriguez-Opazo
Qi Wu
Stephen Gould
354
86
0
06 Apr 2020
Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
European Conference on Computer Vision (ECCV), 2020
Xinze Wang
Vihan Jain
Eugene Ie
William Yang Wang
Zornitsa Kozareva
Sujith Ravi
LM&Ro
366
71
0
01 Mar 2020
From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN)
Xin Ye
Yezhou Yang
SSL
411
17
0
26 Feb 2020
VALAN: Vision and Language Agent Navigation
L. Lansing
Vihan Jain
Harsh Mehta
Haoshuo Huang
Eugene Ie
LM&Ro
AI4TS
160
8
0
06 Dec 2019
Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks
Computer Vision and Pattern Recognition (CVPR), 2019
Fengda Zhu
Yi Zhu
Xiaojun Chang
Xiaodan Liang
LRM
529
274
0
18 Nov 2019
1
2
Next
Page 1 of 2