Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2210.03112
Cited By
v1
v2
v3 (latest)
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Computer Vision and Pattern Recognition (CVPR), 2022
6 October 2022
Aishwarya Kamath
Peter Anderson
Su Wang
Jing Yu Koh
Alexander Ku
Austin Waters
Yinfei Yang
Jason Baldridge
Zarana Parekh
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning"
43 / 43 papers shown
Title
Embodied Navigation with Auxiliary Task of Action Description Prediction
Haru Kondoh
Asako Kanezaki
132
0
0
21 Oct 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
126
0
0
18 Oct 2025
Teaching RL Agents to Act Better: VLM as Action Advisor for Online Reinforcement Learning
Xiefeng Wu
Jing Zhao
Shu Zhang
Mingyu Hu
OffRL
60
1
0
25 Sep 2025
Harnessing Input-Adaptive Inference for Efficient VLN
Dongwoo Kang
Akhil Perincherry
Zachary Coalson
Aiden Gabriel
Stefan Lee
Sanghyun Hong
LM&Ro
102
0
0
12 Aug 2025
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments
Xuan Yao
Junyu Gao
Changsheng Xu
LM&Ro
165
11
0
30 Jun 2025
Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations
Yibo Cui
Liang Xie
Yu Zhao
Jiawei Sun
Erwei Yin
155
2
0
10 Jun 2025
NavBench: Probing Multimodal Large Language Models for Embodied Navigation
Yanyuan Qiao
Haodong Hong
Wenqi Lyu
Dong An
Siqi Zhang
Yutong Xie
Xinyu Wang
Qi Wu
LM&Ro
215
3
0
01 Jun 2025
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation
P. Zhang
Yifei Su
Pengyuan Wu
Dong An
Li Zhang
Zhigang Wang
Dong Wang
Yan Ding
Jiangwei Zhong
Xuelong Li
LM&Ro
360
2
0
27 May 2025
CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Weichen Zhang
Chen Gao
Shiquan Yu
Ruiying Peng
Baining Zhao
Qian Zhang
Jinqiang Cui
Xinlei Chen
Yongqian Li
LLMAG
LM&Ro
480
4
0
08 May 2025
Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Computer Vision and Pattern Recognition (CVPR), 2025
Akhil Perincherry
Jacob Krantz
Stefan Lee
LM&Ro
240
7
0
20 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
Neural Networks (NN), 2025
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
298
7
0
13 Mar 2025
TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation
Navid Rajabi
Jana Kosecka
LM&Ro
3DV
393
0
0
11 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
367
60
0
31 Dec 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
254
69
0
17 Jul 2024
Controllable Navigation Instruction Generation with Chain of Thought Prompting
Xianghao Kong
Jinyu Chen
Wenguan Wang
Hang Su
Xiaolin Hu
Yi Yang
Si Liu
LRM
221
15
0
10 Jul 2024
Into the Unknown: Generating Geospatial Descriptions for New Environments
Tzuf Paz-Argaman
John Palowitch
Sayali Kulkarni
Reut Tsarfaty
Jason Baldridge
256
1
0
28 Jun 2024
LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning
Utsav Singh
Pramit Bhattacharyya
Vinay P. Namboodiri
LM&Ro
496
3
0
09 Jun 2024
LEGENT: Open Platform for Embodied Agents
Zhili Cheng
Zhitong Wang
Jinyi Hu
Shengding Hu
An Liu
Yuge Tu
Pengkai Li
Lei Shi
Zhiyuan Liu
Maosong Sun
VLM
176
12
0
28 Apr 2024
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
243
39
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
188
5
0
15 Apr 2024
Semantic Map-based Generation of Navigation Instructions
Chengzu Li
Chao Zhang
Simone Teufel
R. Doddipatla
Svetlana Stoyanchev
179
4
0
28 Mar 2024
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
Ganlong Zhao
Guanbin Li
Weikai Chen
Yizhou Yu
240
13
0
26 Mar 2024
Continual Vision-and-Language Navigation
Seongjun Jeong
Gi-Cheon Kang
Seongho Choi
Joochan Kim
Byoung-Tak Zhang
317
4
0
22 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
214
55
0
21 Mar 2024
Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Vishnu Sashank Dorbala
Sanjoy Chowdhury
Dinesh Manocha
LM&Ro
340
7
0
18 Mar 2024
Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction
Yonghao Dong
Le Wang
Sanpin Zhou
Gang Hua
Changyin Sun
334
16
0
09 Mar 2024
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
183
3
0
06 Mar 2024
Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections
Lingjun Zhao
Khanh Nguyen
Hal Daumé
195
4
0
26 Feb 2024
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
X. Lei
Min Wang
Wen-gang Zhou
Li Li
Houqiang Li
243
16
0
25 Feb 2024
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
JIazhao Zhang
Kunyu Wang
Rongtao Xu
Gengze Zhou
Yicong Hong
Xiaomeng Fang
Qi Wu
Dongbin Zhao
Wang He
LM&Ro
567
142
0
24 Feb 2024
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving
AAAI Conference on Artificial Intelligence (AAAI), 2024
Wencheng Han
Dongqian Guo
Cheng-Zhong Xu
Jianbing Shen
244
53
0
08 Jan 2024
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
International Conference on Machine Learning (ICML), 2023
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
480
14
0
22 Nov 2023
A Survey on Multimodal Large Language Models for Autonomous Driving
Can Cui
Yunsheng Ma
Xu Cao
Wenqian Ye
Yang Zhou
...
Xinrui Yan
Shuqi Mei
Jianguo Cao
Ziran Wang
Chao Zheng
304
422
0
21 Nov 2023
Hallucination Detection for Grounded Instruction Generation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Lingjun Zhao
Khanh Nguyen
Hal Daumé
HILM
201
8
0
23 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
577
216
0
04 Oct 2023
Scaling Data Generation in Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Joey Tianyi Zhou
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
289
100
0
28 Jul 2023
GridMM: Grid Memory Map for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
355
99
0
24 Jul 2023
Learning Vision-and-Language Navigation from YouTube Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Kun-Li Channing Lin
Peihao Chen
Di Huang
Thomas H. Li
Zhuliang Yu
Chuang Gan
LM&Ro
193
43
0
22 Jul 2023
Behavioral Analysis of Vision-and-Language Navigation Agents
Computer Vision and Pattern Recognition (CVPR), 2023
Zijiao Yang
Arjun Majumdar
Stefan Lee
LM&Ro
LLMAG
133
10
0
20 Jul 2023
Language to Rewards for Robotic Skill Synthesis
Conference on Robot Learning (CoRL), 2023
Wenhao Yu
Nimrod Gileadi
Chuyuan Fu
Sean Kirmani
Kuang-Huei Lee
...
N. Heess
Dorsa Sadigh
Jie Tan
Yuval Tassa
F. Xia
LM&Ro
224
350
0
14 Jun 2023
Masked Path Modeling for Vision-and-Language Navigation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Zi-Yi Dou
Feng Gao
Nanyun Peng
LM&Ro
170
4
0
23 May 2023
ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023
Dongyan An
Hongru Wang
Wenguan Wang
Zun Wang
Yan Huang
Keji He
Liang Wang
455
133
0
06 Apr 2023
Define, Evaluate, and Improve Task-Oriented Cognitive Capabilities for Instruction Generation Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Lingjun Zhao
Khanh Nguyen
Hal Daumé
ELM
229
7
0
21 Dec 2022
1