v1v2 (latest)

SOON: Scenario Oriented Object Navigation with Graph-based Exploration

Computer Vision and Pattern Recognition (CVPR), 2021

31 March 2021

Fengda Zhu

Xiwen Liang

Yi Zhu

Xiaojun Chang

Xiaodan Liang

ArXiv (abs)PDF HTML

Papers citing "SOON: Scenario Oriented Object Navigation with Graph-based Exploration"

50 / 91 papers shown

Large Language Models and 3D Vision for Intelligent Robotic Perception and AutonomyItalian National Conference on Sensors (INS), 2025

Vinit Mehta

Charu Sharma

Karthick Thiyagarajan

LM&Ro

431

14 Nov 2025

OpenVLN: Open-world Aerial Vision-Language Navigation

155

09 Nov 2025

NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation

156

28 Oct 2025

Embodied Navigation with Auxiliary Task of Action Description Prediction

Haru Kondoh

Asako Kanezaki

188

21 Oct 2025

NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation

Peiran Xu

Xicheng Gong

Yadong Mu

196

18 Oct 2025

Walk and Read Less: Improving the Efficiency of Vision-and-Language Navigation via Tuning-Free Multimodal Token Pruning

314

18 Sep 2025

DialNav: Multi-turn Dialog Navigation with a Remote Guide

229

16 Sep 2025

GENNAV: Polygon Mask Generation for Generalized Referring Navigable Regions

178

28 Aug 2025

Harnessing Input-Adaptive Inference for Efficient VLN

225

12 Aug 2025

SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps

201

05 Aug 2025

Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation

288

18 Jun 2025

LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs

Kornélia Sára Szatmáry

Fei Wang

508

06 May 2025

DOPE: Dual Object Perception-Enhancement Network for Vision-and-Language NavigationInternational Conference on Multimedia Retrieval (ICMR), 2025

Yinfeng Yu

Dongsheng Yang

445

30 Apr 2025

Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation

571

23 Apr 2025

Multimodal Perception for Goal-oriented Navigation: A Survey

I-Tak Ieong

Hao Tang

LM&Ro LRM

434

22 Apr 2025

Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionInformation Fusion (Inf. Fusion), 2025

...

595

03 Apr 2025

COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation

636

31 Mar 2025

FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation TasksIEEE transactions on multimedia (TMM), 2025

431

18 Mar 2025

EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments

Katherine Rose Driggs-Campbell

Gaoang Wang

LM&Ro

674

11 Mar 2025

A Survey of Graph Transformers: Architectures, Theories and Applications

552

23 Feb 2025

OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart LogisticsIEEE International Conference on Robotics and Automation (ICRA), 2025

513

13 Feb 2025

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models

442

31 Dec 2024

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and MethodComputer Vision and Pattern Recognition (CVPR), 2024

683

12 Dec 2024

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

220

07 Dec 2024

AdaVLN: Towards Visual Language Navigation in Continuous Indoor Environments with Moving Humans

502

27 Nov 2024

The Wallpaper is Ugly: Indoor Localization using Vision and LanguageIEEE International Symposium on Robot and Human Interactive Communication (RO-MAN), 2023

Seth Pate

Lawson L. S. Wong

300

04 Oct 2024

MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-object Demand-driven NavigationNeural Information Processing Systems (NeurIPS), 2024

Hao Dong

395

04 Oct 2024

MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge DistillationIEEE International Conference on Robotics and Automation (ICRA), 2024

Junyou Zhu

Yanyuan Qiao

Siqi Zhang

Xingjian He

Qi Wu

Jing Liu

VLM

487

27 Sep 2024

Vision-Language Navigation with Continual Learning

Zhiyuan Li

Yanfeng Lv

Ziqin Tu

Di Shang

Hong Qiao

327

04 Sep 2024

Narrowing the Gap between Vision and Action in NavigationACM Multimedia (MM), 2024

Yue Zhang

Parisa Kordjamshidi

468

19 Aug 2024

DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions

Komei Sugiura

381

15 Aug 2024

Can ChatGPT assist visually impaired people with micro-navigation?

Junxian He

Shrinivas J. Pundlik

Gang Luo

227

31 Jul 2024

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

379

31 Jul 2024

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Xianghao Kong

Yi Yang

365

10 Jul 2024

Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

Xiaodan Liang

Liang Lin

810

257

09 Jul 2024

Human-centered In-building Embodied Delivery Benchmark

Zhuoqun Xu

Yang Liu

Xiaoqi Li

Jiyao Zhang

Hao Dong

411

25 Jun 2024

Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts

Haodong Hong

Sen Wang

Zi Huang

Qi Wu

Jiajun Liu

270

04 Jun 2024

Augmented Commonsense Knowledge for Remote Object Grounding

Qi Wu

254

03 Jun 2024

Vision-and-Language Navigation via Causal Learning

336

16 Apr 2024

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language NavigationComputer Vision and Pattern Recognition (CVPR), 2024

278

02 Apr 2024

SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks

Yaxu Xie

A. Pagani

Didier Stricker

391

28 Mar 2024

IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation

293

28 Mar 2024

Temporal-Spatial Object Relations Modeling for Vision-and-Language Navigation

374

23 Mar 2024

Prioritized Semantic Learning for Zero-shot Instance NavigationEuropean Conference on Computer Vision (ECCV), 2024

304

18 Mar 2024

Hierarchical Spatial Proximity Reasoning for Vision-and-Language NavigationIEEE Robotics and Automation Letters (RA-L), 2024

Ming Xu

Zilong Xie

325

18 Mar 2024

Vision-Language Navigation with Embodied Intelligence: A Survey

500

22 Feb 2024

NavHint: Vision and Language Navigation Agent with a Hint Generator

Yue Zhang

Quan Guo

Parisa Kordjamshidi

LLMAG

347

04 Feb 2024

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human PreferencesComputer Vision and Pattern Recognition (CVPR), 2023

283

14 Dec 2023

Towards Learning a Generalist Model for Embodied NavigationComputer Vision and Pattern Recognition (CVPR), 2023

791

145

04 Dec 2023

Fast-Slow Test-Time Adaptation for Online Vision-and-Language NavigationInternational Conference on Machine Learning (ICML), 2023

651

22 Nov 2023