Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2103.12944
Cited By
Scene-Intuitive Agent for Remote Embodied Visual Grounding
Computer Vision and Pattern Recognition (CVPR), 2021
24 March 2021
Xiangru Lin
Guanbin Li
Yizhou Yu
LM&Ro
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Scene-Intuitive Agent for Remote Embodied Visual Grounding"
39 / 39 papers shown
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
Shanshan Li
Da Huang
Yu He
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
316
0
0
21 Nov 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
196
5
0
18 Oct 2025
Landmark-Guided Knowledge for Vision-and-Language Navigation
International Conference on Intelligent Computing (ICIC), 2025
Dongsheng Yang
Meiling Zhu
Yinfeng Yu
LM&Ro
201
0
0
30 Sep 2025
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Ruoyu Wang
Tong Yu
Junda Wu
Yao Liu
Julian McAuley
Lina Yao
288
5
0
18 Jun 2025
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation
International Conference on Multimedia Retrieval (ICMR), 2025
Yinfeng Yu
Dongsheng Yang
442
6
0
30 Apr 2025
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Ziming Wei
Bingqian Lin
Yunshuang Nie
Jiaqi Chen
Shikui Ma
Hang Xu
Xiaodan Liang
564
5
0
23 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
Neural Networks (NN), 2025
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
368
12
0
13 Mar 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
442
78
0
31 Dec 2024
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
Gengze Zhou
Yicong Hong
Zun Wang
Chongyang Zhao
Joey Tianyi Zhou
Qi Wu
219
11
0
07 Dec 2024
Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2024
Yiyuan Pan
Yunzhe Xu
Yanfeng Guo
Hesheng Wang
LM&Ro
544
6
0
30 Nov 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
253
21
0
03 Jun 2024
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
392
24
0
29 May 2024
Vision-and-Language Navigation Generative Pretrained Transformer
Hanlin Wen
LM&Ro
317
0
0
27 May 2024
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
334
56
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
286
5
0
15 Apr 2024
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation
Bowen Huang
Yanwei Zheng
Chuanlin Lan
Xinpeng Zhao
Yifei Zou
Dongxiao Yu
373
1
0
23 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
322
77
0
21 Mar 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language Navigation
IEEE Robotics and Automation Letters (RA-L), 2024
Ming Xu
Zilong Xie
323
4
0
18 Mar 2024
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
336
3
0
06 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
498
8
0
22 Feb 2024
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
International Conference on Machine Learning (ICML), 2023
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
646
22
0
22 Nov 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
392
100
0
09 Aug 2023
Scaling Data Generation in Vision-and-Language Navigation
IEEE International Conference on Computer Vision (ICCV), 2023
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Joey Tianyi Zhou
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
426
134
0
28 Jul 2023
Learning Vision-and-Language Navigation from YouTube Videos
IEEE International Conference on Computer Vision (ICCV), 2023
Kun-Li Channing Lin
Peihao Chen
Di Huang
Thomas H. Li
Zhuliang Yu
Chuang Gan
LM&Ro
288
55
0
22 Jul 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2023
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
455
28
0
26 May 2023
A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
International Joint Conference on Artificial Intelligence (IJCAI), 2023
Liuyi Wang
Zongtao He
Jiagui Tang
Ronghao Dang
Naijia Wang
Chengju Liu
Qi Chen
355
29
0
05 May 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding
Computer Vision and Pattern Recognition (CVPR), 2023
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
291
29
0
07 Mar 2023
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation
AAAI Conference on Artificial Intelligence (AAAI), 2023
Bingqian Lin
Yi Zhu
Xiaodan Liang
Liang Lin
Jian-zhuo Liu
CoGe
LM&Ro
377
7
0
13 Feb 2023
Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation
International Conference on Machine Learning (ICML), 2023
Ronghao Dang
Lu Chen
Liuyi Wang
Zongtao He
Chengju Liu
Qi Chen
LRM
193
16
0
03 Feb 2023
RREx-BoT: Remote Referring Expressions with a Bag of Tricks
IEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Gunnar Sigurdsson
Jesse Thomason
Gaurav Sukhatme
Robinson Piramuthu
LM&Ro
280
14
0
30 Jan 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
338
131
0
08 Dec 2022
Layout-aware Dreamer for Embodied Referring Expression Grounding
Mingxiao Li
Zehao Wang
Tinne Tuytelaars
Marie-Francine Moens
LM&Ro
197
7
0
30 Nov 2022
Embodied Referring Expression for Manipulation Question Answering in Interactive Environment
IEEE International Conference on Robotics and Automation (ICRA), 2022
Qie Sima
Sinan Tan
Huaping Liu
LM&Ro
221
8
0
06 Oct 2022
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
European Conference on Computer Vision (ECCV), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
387
61
0
24 Aug 2022
Target-Driven Structured Transformer Planner for Vision-Language Navigation
ACM Multimedia (ACM MM), 2022
Yusheng Zhao
Jinyu Chen
Chen Gao
Wenguan Wang
Lirong Yang
Haibing Ren
Huaxia Xia
Si Liu
LM&Ro
482
82
0
19 Jul 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Xinze Wang
LM&Ro
437
170
0
22 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Computer Vision and Pattern Recognition (CVPR), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
389
238
0
23 Feb 2022
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
378
345
0
25 Oct 2021
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
440
60
0
26 Aug 2021
1
Page 1 of 1