Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1903.00401
Cited By
v1
v2 (latest)
Learning To Follow Directions in Street View
AAAI Conference on Artificial Intelligence (AAAI), 2019
1 March 2019
Karl Moritz Hermann
Mateusz Malinowski
Piotr Wojciech Mirowski
Andras Banki-Horvath
Keith Anderson
R. Hadsell
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning To Follow Directions in Street View"
42 / 42 papers shown
Title
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
335
57
0
31 Dec 2024
CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos
Computer Vision and Pattern Recognition (CVPR), 2024
Xinhao Liu
Jiajian Li
Yichen Jiang
Niranjan Sujay
Zhiyong Yang
Juexiao Zhang
John Abanes
Jing Zhang
Chen Feng
457
22
0
26 Nov 2024
Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions
Qingbin Zeng
Qinglong Yang
Shunan Dong
Heming Du
Liang Zheng
Fengli Xu
Yong Li
LLMAG
LM&Ro
264
20
0
08 Aug 2024
Can ChatGPT assist visually impaired people with micro-navigation?
Junxian He
Shrinivas J. Pundlik
Gang Luo
105
0
0
31 Jul 2024
CityNav: A Large-Scale Dataset for Real-World Aerial Navigation
Jungdae Lee
Taiki Miyanishi
Shuhei Kurita
Koya Sakamoto
Daichi Azuma
Yutaka Matsuo
Nakamasa Inoue
249
23
0
20 Jun 2024
Semantic Map-based Generation of Navigation Instructions
Chengzu Li
Chao Zhang
Simone Teufel
R. Doddipatla
Svetlana Stoyanchev
171
3
0
28 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
284
7
0
22 Feb 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
253
9
0
05 Feb 2024
Multi-model fusion for Aerial Vision and Dialog Navigation based on human attention aids
Xinyi Wang
Xuan Cui
Danxu Li
Fang Liu
Licheng Jiao
82
0
0
27 Aug 2023
Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Yibo Cui
Liang Xie
Yakun Zhang
Meishan Zhang
Ye Yan
Erwei Yin
LM&Ro
167
26
0
24 Aug 2023
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
AAAI Conference on Artificial Intelligence (AAAI), 2023
Raphael Schumann
Wanrong Zhu
Weixi Feng
Tsu-Jui Fu
Stefan Riezler
William Yang Wang
LM&Ro
207
99
0
12 Jul 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Computer Vision and Pattern Recognition (CVPR), 2023
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
212
26
0
07 Mar 2023
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Chia-Wen Kuo
Chih-Yao Ma
Judy Hoffman
Z. Kira
192
12
0
20 Nov 2022
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ziqiao Ma
B. VanDerPloeg
Cristian-Paul Bara
Yidong Huang
Eui-In Kim
Felix Gervits
M. Marge
J. Chai
265
8
0
22 Oct 2022
ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities
Findings (Findings), 2022
Terry Yue Zhuo
Yaqing Liao
Yuecheng Lei
Zhuang Li
Gerard de Melo
Xiaojun Chang
Yazhou Ren
Zenglin Xu
204
2
0
11 Oct 2022
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Jason Armitage
L. Impett
Rico Sennrich
340
6
0
24 Jul 2022
LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action
Conference on Robot Learning (CoRL), 2022
Dhruv Shah
B. Osinski
Brian Ichter
Sergey Levine
LM&Ro
462
593
0
10 Jul 2022
Learning Local Implicit Fourier Representation for Image Warping
European Conference on Computer Vision (ECCV), 2022
Jae-Won Lee
K. Choi
Kyong Hwan Jin
101
19
0
05 Jul 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Jialu Li
Hao Tan
Joey Tianyi Zhou
248
104
0
29 Mar 2022
Analyzing Generalization of Vision and Language Navigation to Unseen Outdoor Areas
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Raphael Schumann
Stefan Riezler
129
33
0
25 Mar 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Xinze Wang
LM&Ro
307
145
0
22 Mar 2022
Are you doing what I say? On modalities alignment in ALFRED
Ting-Rui Chiang
Yi-Ting Yeh
Ta-Chung Chi
Yau-Shian Wang
159
1
0
12 Oct 2021
Waypoint Models for Instruction-guided Navigation in Continuous Environments
Jacob Krantz
Aaron Gokaslan
Dhruv Batra
Stefan Lee
Oleksandr Maksymets
LM&Ro
271
117
0
05 Oct 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
246
41
0
26 Aug 2021
Core Challenges in Embodied Vision-Language Planning
Journal of Artificial Intelligence Research (JAIR), 2021
Jonathan M Francis
Nariaki Kitamura
Felix Labelle
Xiaopeng Lu
Ingrid Navarro
Jean Oh
LM&Ro
420
57
0
26 Jun 2021
Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Van-Quang Nguyen
Masanori Suganuma
Takayuki Okatani
LM&Ro
171
35
0
01 Jun 2021
Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
Raphael Schumann
Stefan Riezler
198
30
0
30 Dec 2020
Visually Grounding Language Instruction for History-Dependent Manipulation
IEEE International Conference on Robotics and Automation (ICRA), 2020
Hyemin Ahn
Obin Kwon
Kyungdo Kim
Jaeyeon Jeong
Howoong Jun
Hongjung Lee
Dongheui Lee
Songhwai Oh
LM&Ro
199
7
0
16 Dec 2020
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments
Findings (Findings), 2020
Hyounghun Kim
Abhaysinh Zala
Graham Burri
Hao Tan
Joey Tianyi Zhou
LM&Ro
145
17
0
15 Nov 2020
Safe Reinforcement Learning with Natural Language Constraints
Neural Information Processing Systems (NeurIPS), 2020
Tsung-Yen Yang
Michael Y. Hu
Yinlam Chow
Peter J. Ramadge
Karthik Narasimhan
228
33
0
11 Oct 2020
Zero-Shot Compositional Policy Learning via Language Grounding
Tianshi Cao
Jingkang Wang
Yining Zhang
S. Manivasagam
LM&Ro
167
3
0
15 Apr 2020
Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
European Conference on Computer Vision (ECCV), 2020
Jacob Krantz
Erik Wijmans
Arjun Majumdar
Dhruv Batra
Stefan Lee
179
378
0
06 Apr 2020
Visual Grounding in Video for Unsupervised Word Translation
Computer Vision and Pattern Recognition (CVPR), 2020
Gunnar Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
218
51
0
11 Mar 2020
MVP: Unified Motion and Visual Self-Supervised Learning for Large-Scale Robotic Navigation
Marvin Chancán
Michael Milford
SSL
117
8
0
02 Mar 2020
Retouchdown: Adding Touchdown to StreetLearn as a Shareable Resource for Language Grounding Tasks in Street View
Harsh Mehta
Yoav Artzi
Jason Baldridge
Eugene Ie
Piotr Wojciech Mirowski
160
27
0
10 Jan 2020
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Conference on Robot Learning (CoRL), 2019
Martin Weiss
Simon Chamorro
Roger Girgis
Margaux Luck
Samira Ebrahimi Kahou
Joseph Paul Cohen
Derek Nowrouzezahrai
Doina Precup
Florian Golemo
C. Pal
217
13
0
29 Oct 2019
HIGhER : Improving instruction following with Hindsight Generation for Experience Replay
Geoffrey Cideron
Mathieu Seurin
Florian Strub
Olivier Pietquin
148
37
0
21 Oct 2019
CityLearn: Diverse Real-World Environments for Sample-Efficient Navigation Policy Learning
IEEE International Conference on Robotics and Automation (ICRA), 2019
Marvin Chancán
Michael Milford
SSL
128
5
0
10 Oct 2019
Talk2Nav: Long-Range Vision-and-Language Navigation with Dual Attention and Spatial Memory
A. Vasudevan
Ahmed K. Farahat
Chetan Gupta
LM&Ro
187
3
0
04 Oct 2019
Transferable Representation Learning in Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2019
Haoshuo Huang
Vihan Jain
Harsh Mehta
Alexander Ku
Gabriel Ilharco
Jason Baldridge
Eugene Ie
LM&Ro
173
92
0
09 Aug 2019
Cross-View Policy Learning for Street Navigation
IEEE International Conference on Computer Vision (ICCV), 2019
Ang Li
Huiyi Hu
Piotr Wojciech Mirowski
Mehrdad Farajtabar
159
32
0
13 Jun 2019
Multi-modal Discriminative Model for Vision-and-Language Navigation
Haoshuo Huang
Vihan Jain
Harsh Mehta
Jason Baldridge
Eugene Ie
LM&Ro
151
27
0
31 May 2019
1