ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.17138
  4. Cited By
SOON: Scenario Oriented Object Navigation with Graph-based Exploration
v1v2 (latest)

SOON: Scenario Oriented Object Navigation with Graph-based Exploration

Computer Vision and Pattern Recognition (CVPR), 2021
31 March 2021
Fengda Zhu
Xiwen Liang
Yi Zhu
Xiaojun Chang
Xiaodan Liang
ArXiv (abs)PDFHTML

Papers citing "SOON: Scenario Oriented Object Navigation with Graph-based Exploration"

50 / 91 papers shown
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
Large Language Models and 3D Vision for Intelligent Robotic Perception and AutonomyItalian National Conference on Sensors (INS), 2025
Vinit Mehta
Charu Sharma
Karthick Thiyagarajan
LM&Ro
431
5
0
14 Nov 2025
OpenVLN: Open-world Aerial Vision-Language Navigation
OpenVLN: Open-world Aerial Vision-Language Navigation
Peican Lin
Gan Sun
Chenxi Liu
Fazeng Li
Weihong Ren
Yang Cong
155
5
0
09 Nov 2025
NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation
NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation
Mingyu Jeong
Eunsung Kim
Sehun Park
Andrew Jaeyong Choi
156
0
0
28 Oct 2025
Embodied Navigation with Auxiliary Task of Action Description Prediction
Embodied Navigation with Auxiliary Task of Action Description Prediction
Haru Kondoh
Asako Kanezaki
188
2
0
21 Oct 2025
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
Peiran Xu
Xicheng Gong
Yadong Mu
196
5
0
18 Oct 2025
Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning
Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning
Wenda Qin
Andrea Burns
Bryan A. Plummer
Margrit Betke
AAML
314
0
0
18 Sep 2025
DialNav: Multi-turn Dialog Navigation with a Remote Guide
DialNav: Multi-turn Dialog Navigation with a Remote Guide
Leekyeung Han
Hyunji Min
Gyeom Hwangbo
Jonghyun Choi
Paul Hongsuck Seo
229
1
0
16 Sep 2025
GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions
GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions
Kei Katsumata
Yui Iioka
Naoki Hosomi
Teruhisa Misu
Kentaro Yamada
K. Sugiura
178
1
0
28 Aug 2025
Harnessing Input-Adaptive Inference for Efficient VLN
Harnessing Input-Adaptive Inference for Efficient VLN
Dongwoo Kang
Akhil Perincherry
Zachary Coalson
Aiden Gabriel
Stefan Lee
Sanghyun Hong
LM&Ro
225
0
0
12 Aug 2025
SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps
SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps
Haojun Xu
Jiaqi Xiang
Wu Wei
Jinyu Chen
Linqing Zhong
Linjiang Huang
Hongyu Yang
Si Liu
201
0
0
05 Aug 2025
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation
Ruoyu Wang
Tong Yu
Junda Wu
Yao Liu
Julian McAuley
Lina Yao
288
5
0
18 Jun 2025
LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs
Xinyuan Zhang
Yonglin Tian
Fei Lin
Yue Liu
Jing Ma
Kornélia Sára Szatmáry
Fei Wang
508
11
0
06 May 2025
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language Navigation
DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language NavigationInternational Conference on Multimedia Retrieval (ICMR), 2025
Yinfeng Yu
Dongsheng Yang
445
6
0
30 Apr 2025
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation
Junrong Yue
Yanzhe Zhang
Chuan Qin
Jing Chen
Xiaomin Lie
Xinlei Yu
Wenxin Zhang
Zhendong Zhao
571
5
0
23 Apr 2025
Multimodal Perception for Goal-oriented Navigation: A Survey
Multimodal Perception for Goal-oriented Navigation: A Survey
I-Tak Ieong
Hao Tang
LM&RoLRM
434
1
0
22 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionInformation Fusion (Inf. Fusion), 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
595
58
0
03 Apr 2025
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Zike Yan
Qi Wu
Zhihua Wei
Qingbin Liu
636
6
0
31 Mar 2025
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation TasksIEEE transactions on multimedia (TMM), 2025
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
Qingbin Liu
LM&Ro
431
15
0
18 Mar 2025
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments
Dongping Li
Tielong Cai
Tianci Tang
Wenhao Chai
Katherine Rose Driggs-Campbell
Gaoang Wang
LM&Ro
674
2
0
11 Mar 2025
A Survey of Graph Transformers: Architectures, Theories and Applications
A Survey of Graph Transformers: Architectures, Theories and Applications
Chaohao Yuan
Kangfei Zhao
Ercan Engin Kuruoglu
Shu Wu
Qifeng Bai
Wenbing Huang
Deli Zhao
Hong Cheng
Yu Rong
552
17
0
23 Feb 2025
OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart Logistics
OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart LogisticsIEEE International Conference on Robotics and Automation (ICRA), 2025
Junhui Wang
Dongjie Huo
Zehui Xu
Yongliang Shi
Yimin Yan
Yuanxin Wang
Chao Gao
Yan Qiao
Guyue Zhou
513
5
0
13 Feb 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Joey Tianyi Zhou
Parisa Kordjamshidi
LRM
442
78
0
31 Dec 2024
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and MethodComputer Vision and Pattern Recognition (CVPR), 2024
Xinshuai Song
Weixing Chen
Wenshu Fan
Weikai Chen
Guanbin Li
Guanbin Li
683
58
0
12 Dec 2024
SAME: Learning Generic Language-Guided Visual Navigation with
  State-Adaptive Mixture of Experts
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
Gengze Zhou
Yicong Hong
Zun Wang
Chongyang Zhao
Joey Tianyi Zhou
Qi Wu
220
11
0
07 Dec 2024
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans
Dillon Loh
Tomasz Bednarz
Xinxing Xia
Frank Guan
502
1
0
27 Nov 2024
The Wallpaper is Ugly: Indoor Localization using Vision and Language
The Wallpaper is Ugly: Indoor Localization using Vision and LanguageIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023
Seth Pate
Lawson L. S. Wong
300
4
0
04 Oct 2024
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for
  Multi-object Demand-driven Navigation
MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven NavigationNeural Information Processing Systems (NeurIPS), 2024
Hongcheng Wang
Peiqi Liu
Wenzhe Cai
Mingdong Wu
Zhengyu Qian
Hao Dong
395
7
0
04 Oct 2024
MiniVLN: Efficient Vision-and-Language Navigation by Progressive
  Knowledge Distillation
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge DistillationIEEE International Conference on Robotics and Automation (ICRA), 2024
Junyou Zhu
Yanyuan Qiao
Siqi Zhang
Xingjian He
Qi Wu
Jing Liu
VLM
487
5
0
27 Sep 2024
Vision-Language Navigation with Continual Learning
Vision-Language Navigation with Continual Learning
Zhiyuan Li
Yanfeng Lv
Ziqin Tu
Di Shang
Hong Qiao
327
4
0
04 Sep 2024
Narrowing the Gap between Vision and Action in Navigation
Narrowing the Gap between Vision and Action in NavigationACM Multimedia (MM), 2024
Yue Zhang
Parisa Kordjamshidi
468
4
0
19 Aug 2024
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles
  Based on Open-Vocabulary Instructions
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions
Ryosuke Korekata
Kanta Kaneda
Shunya Nagashima
Yuto Imai
Komei Sugiura
ObjDLM&Ro
381
5
0
15 Aug 2024
Can ChatGPT assist visually impaired people with micro-navigation?
Can ChatGPT assist visually impaired people with micro-navigation?
Junxian He
Shrinivas J. Pundlik
Gang Luo
227
0
0
31 Jul 2024
Navigating Beyond Instructions: Vision-and-Language Navigation in
  Obstructed Environments
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
379
10
0
31 Jul 2024
Controllable Navigation Instruction Generation with Chain of Thought
  Prompting
Controllable Navigation Instruction Generation with Chain of Thought Prompting
Xianghao Kong
Jinyu Chen
Wenguan Wang
Hang Su
Xiaolin Hu
Yi Yang
Si Liu
LRM
365
21
0
10 Jul 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on
  Embodied AI
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Zehua Wang
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&RoSyDaAI4CE
810
257
0
09 Jul 2024
Human-centered In-building Embodied Delivery Benchmark
Human-centered In-building Embodied Delivery Benchmark
Zhuoqun Xu
Yang Liu
Xiaoqi Li
Jiyao Zhang
Hao Dong
411
2
0
25 Jun 2024
Why Only Text: Empowering Vision-and-Language Navigation with
  Multi-modal Prompts
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Haodong Hong
Sen Wang
Zi Huang
Qi Wu
Jiajun Liu
270
10
0
04 Jun 2024
Augmented Commonsense Knowledge for Remote Object Grounding
Augmented Commonsense Knowledge for Remote Object Grounding
Bahram Mohammadi
Yicong Hong
Yuankai Qi
Qi Wu
Shirui Pan
Javen Qinfeng Shi
254
21
0
03 Jun 2024
Vision-and-Language Navigation via Causal Learning
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
336
56
0
16 Apr 2024
Lookahead Exploration with Neural Radiance Representation for Continuous
  Vision-Language Navigation
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2024
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Junjie Hu
Ming Jiang
Shuqiang Jiang
278
66
0
02 Apr 2024
SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion
  for 3D Scene Graph Alignment and Its Downstream Tasks
SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks
Yaxu Xie
A. Pagani
Didier Stricker
391
7
0
28 Mar 2024
IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot
  Navigation
IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation
Jiacui Huang
Hongtao Zhang
Mingbo Zhao
Zhou Wu
LM&Ro
293
7
0
28 Mar 2024
Temporal-Spatial Object Relations Modeling for Vision-and-Language
  Navigation
Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation
Bowen Huang
Yanwei Zheng
Chuanlin Lan
Xinpeng Zhao
Yifei Zou
Dongxiao Yu
374
1
0
23 Mar 2024
Prioritized Semantic Learning for Zero-shot Instance Navigation
Prioritized Semantic Learning for Zero-shot Instance NavigationEuropean Conference on Computer Vision (ECCV), 2024
Xander Sun
Louis Lau
Hoyard Zhi
Ronghe Qiu
Junwei Liang
304
40
0
18 Mar 2024
Hierarchical Spatial Proximity Reasoning for Vision-and-Language
  Navigation
Hierarchical Spatial Proximity Reasoning for Vision-and-Language NavigationIEEE Robotics and Automation Letters (RA-L), 2024
Ming Xu
Zilong Xie
325
4
0
18 Mar 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Haiwei Yang
Ruyue Yuan
LM&Ro
500
8
0
22 Feb 2024
NavHint: Vision and Language Navigation Agent with a Hint Generator
NavHint: Vision and Language Navigation Agent with a Hint Generator
Yue Zhang
Quan Guo
Parisa Kordjamshidi
LLMAG
347
13
0
04 Feb 2024
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human
  Preferences
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human PreferencesComputer Vision and Pattern Recognition (CVPR), 2023
Minyoung Hwang
Luca Weihs
Chanwoo Park
Kimin Lee
Aniruddha Kembhavi
Kiana Ehsani
283
28
0
14 Dec 2023
Towards Learning a Generalist Model for Embodied Navigation
Towards Learning a Generalist Model for Embodied NavigationComputer Vision and Pattern Recognition (CVPR), 2023
Duo Zheng
Shijia Huang
Lin Zhao
Yiwu Zhong
Liwei Wang
LM&Ro
791
145
0
04 Dec 2023
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Fast-Slow Test-Time Adaptation for Online Vision-and-Language NavigationInternational Conference on Machine Learning (ICML), 2023
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
651
22
0
22 Nov 2023
12
Next
Page 1 of 2