ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.12944
  4. Cited By
Scene-Intuitive Agent for Remote Embodied Visual Grounding

Scene-Intuitive Agent for Remote Embodied Visual Grounding

Computer Vision and Pattern Recognition (CVPR), 2021
24 March 2021
Xiangru Lin
Guanbin Li
Yizhou Yu
    LM&Ro
ArXiv (abs)PDFHTML

Papers citing "Scene-Intuitive Agent for Remote Embodied Visual Grounding"

39 / 39 papers shown
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
TP-MDDN: Task-Preferenced Multi-Demand-Driven Navigation with Autonomous Decision-Making
Shanshan Li
Da Huang
Yu He
Yanwei Fu
Yu-Gang Jiang
Xiangyang Xue
316
0
0
21 Nov 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
196
5
0
18 Oct 2025
Landmark-Guided Knowledge for Vision-and-Language Navigation
Landmark-Guided Knowledge for Vision-and-Language NavigationInternational Conference on Intelligent Computing (ICIC), 2025
Dongsheng Yang
Meiling Zhu
Yinfeng Yu
LM&Ro
201
0
0
30 Sep 2025
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Ruoyu Wang
Tong Yu
Junda Wu
Yao Liu
Julian McAuley
Lina Yao
288
5
0
18 Jun 2025
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language NavigationInternational Conference on Multimedia Retrieval (ICMR), 2025
Yinfeng Yu
Dongsheng Yang
442
6
0
30 Apr 2025
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation
Ziming Wei
Bingqian Lin
Yunshuang Nie
Jiaqi Chen
Shikui Ma
Hang Xu
Xiaodan Liang
564
5
0
23 Mar 2025
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language Navigation
PanoGen++: Domain-Adapted Text-Guided Panoramic Environment Generation for Vision-and-Language NavigationNeural Networks (NN), 2025
Sen Wang
Dongliang Zhou
Liang Xie
Chao Xu
Ye Yan
Erwei Yin
DiffM
368
12
0
13 Mar 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
442
78
0
31 Dec 2024
SAME: Learning Generic Language-Guided Visual Navigation with
  State-Adaptive Mixture of Experts
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
Gengze Zhou
Yicong Hong
Zun Wang
Chongyang Zhao
Joey Tianyi Zhou
Qi Wu
219
11
0
07 Dec 2024
Planning from Imagination: Episodic Simulation and Episodic Memory for
  Vision-and-Language Navigation
Planning from Imagination: Episodic Simulation and Episodic Memory for Vision-and-Language NavigationAAAI Conference on Artificial Intelligence (AAAI), 2024
Yiyuan Pan
Yunzhe Xu
Yanfeng Guo
Hesheng Wang
LM&Ro
544
6
0
30 Nov 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
253
21
0
03 Jun 2024
Correctable Landmark Discovery via Large Models for Vision-Language
  Navigation
Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Bingqian Lin
Yunshuang Nie
Ziming Wei
Yi Zhu
Hang Xu
Shikui Ma
Jianzhuang Liu
Xiaodan Liang
LM&Ro
392
24
0
29 May 2024
Vision-and-Language Navigation Generative Pretrained Transformer
Vision-and-Language Navigation Generative Pretrained Transformer
Hanlin Wen
LM&Ro
317
0
0
27 May 2024
Vision-and-Language Navigation via Causal Learning
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
334
56
0
16 Apr 2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
AIGeN: An Adversarial Approach for Instruction Generation in VLN
Niyati Rawal
Roberto Bigazzi
Lorenzo Baraldi
Rita Cucchiara
GAN
286
5
0
15 Apr 2024
Temporal-Spatial Object Relations Modeling for Vision-and-Language
  Navigation
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation
Bowen Huang
Yanwei Zheng
Chuanlin Lan
Xinpeng Zhao
Yifei Zou
Dongxiao Yu
373
1
0
23 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
322
77
0
21 Mar 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language
  Navigation
Hierarchical Spatial Proximity Reasoning for Vision-and-Language NavigationIEEE Robotics and Automation Letters (RA-L), 2024
Ming Xu
Zilong Xie
323
4
0
18 Mar 2024
Causality-based Cross-Modal Representation Learning for
  Vision-and-Language Navigation
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
336
3
0
06 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
498
8
0
22 Feb 2024
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Fast-Slow Test-Time Adaptation for Online Vision-and-Language NavigationInternational Conference on Machine Learning (ICML), 2023
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
646
22
0
22 Nov 2023
Bird's-Eye-View Scene Graph for Vision-Language Navigation
Bird's-Eye-View Scene Graph for Vision-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Ruitao Liu
Xiaohan Wang
Wenguan Wang
Yi Yang
392
100
0
09 Aug 2023
Scaling Data Generation in Vision-and-Language Navigation
Scaling Data Generation in Vision-and-Language NavigationIEEE International Conference on Computer Vision (ICCV), 2023
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Joey Tianyi Zhou
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
426
134
0
28 Jul 2023
Learning Vision-and-Language Navigation from YouTube Videos
Learning Vision-and-Language Navigation from YouTube VideosIEEE International Conference on Computer Vision (ICCV), 2023
Kun-Li Channing Lin
Peihao Chen
Di Huang
Thomas H. Li
Zhuliang Yu
Chuang Gan
LM&Ro
288
55
0
22 Jul 2023
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot
  Attention for Vision-and-Language Navigation
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2023
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
455
28
0
26 May 2023
A Dual Semantic-Aware Recurrent Global-Adaptive Network For
  Vision-and-Language Navigation
A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language NavigationInternational Joint Conference on Artificial Intelligence (IJCAI), 2023
Liuyi Wang
Zongtao He
Jiagui Tang
Ronghao Dang
Naijia Wang
Chengju Liu
Qi Chen
355
29
0
05 May 2023
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation
  Using Scene Object Spectrum Grounding
Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum GroundingComputer Vision and Pattern Recognition (CVPR), 2023
Minyoung Hwang
Jaeyeon Jeong
Minsoo Kim
Yoonseon Oh
Songhwai Oh
291
29
0
07 Mar 2023
Actional Atomic-Concept Learning for Demystifying Vision-Language
  Navigation
Actional Atomic-Concept Learning for Demystifying Vision-Language NavigationAAAI Conference on Artificial Intelligence (AAAI), 2023
Bingqian Lin
Yi Zhu
Xiaodan Liang
Liang Lin
Jian-zhuo Liu
CoGeLM&Ro
377
7
0
13 Feb 2023
Multiple Thinking Achieving Meta-Ability Decoupling for Object
  Navigation
Multiple Thinking Achieving Meta-Ability Decoupling for Object NavigationInternational Conference on Machine Learning (ICML), 2023
Ronghao Dang
Lu Chen
Liuyi Wang
Zongtao He
Chengju Liu
Qi Chen
LRM
193
16
0
03 Feb 2023
RREx-BoT: Remote Referring Expressions with a Bag of Tricks
RREx-BoT: Remote Referring Expressions with a Bag of TricksIEEE/RJS International Conference on Intelligent RObots and Systems (IROS), 2023
Gunnar Sigurdsson
Jesse Thomason
Gaurav Sukhatme
Robinson Piramuthu
LM&Ro
280
14
0
30 Jan 2023
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
BEVBert: Multimodal Map Pre-training for Language-guided Navigation
Dongyan An
Yuankai Qi
Yangguang Li
Yan Huang
Liangsheng Wang
Tieniu Tan
Jing Shao
338
131
0
08 Dec 2022
Layout-aware Dreamer for Embodied Referring Expression Grounding
Layout-aware Dreamer for Embodied Referring Expression Grounding
Mingxiao Li
Zehao Wang
Tinne Tuytelaars
Marie-Francine Moens
LM&Ro
197
7
0
30 Nov 2022
Embodied Referring Expression for Manipulation Question Answering in
  Interactive Environment
Embodied Referring Expression for Manipulation Question Answering in Interactive EnvironmentIEEE International Conference on Robotics and Automation (ICRA), 2022
Qie Sima
Sinan Tan
Huaping Liu
LM&Ro
221
8
0
06 Oct 2022
Learning from Unlabeled 3D Environments for Vision-and-Language
  Navigation
Learning from Unlabeled 3D Environments for Vision-and-Language NavigationEuropean Conference on Computer Vision (ECCV), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
387
61
0
24 Aug 2022
Target-Driven Structured Transformer Planner for Vision-Language
  Navigation
Target-Driven Structured Transformer Planner for Vision-Language NavigationACM Multimedia (ACM MM), 2022
Yusheng Zhao
Jinyu Chen
Chen Gao
Wenguan Wang
Lirong Yang
Haibing Ren
Huaxia Xia
Si Liu
LM&Ro
482
82
0
19 Jul 2022
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future
  Directions
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future DirectionsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Jing Gu
Eliana Stefani
Qi Wu
Jesse Thomason
Xinze Wang
LM&Ro
437
170
0
22 Mar 2022
Think Global, Act Local: Dual-scale Graph Transformer for
  Vision-and-Language Navigation
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2022
Shizhe Chen
Pierre-Louis Guhur
Makarand Tapaswi
Cordelia Schmid
Ivan Laptev
LM&Ro
389
238
0
23 Feb 2022
History Aware Multimodal Transformer for Vision-and-Language Navigation
History Aware Multimodal Transformer for Vision-and-Language Navigation
Shizhe Chen
Pierre-Louis Guhur
Cordelia Schmid
Ivan Laptev
LM&Ro
378
345
0
25 Oct 2021
Vision-Language Navigation: A Survey and Taxonomy
Vision-Language Navigation: A Survey and Taxonomy
Wansen Wu
Tao Chang
Xinmeng Li
LM&Ro
440
60
0
26 Aug 2021
1
Page 1 of 1