Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2305.17102
Cited By
GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation
26 May 2023
Jingyang Huo
Qiang Sun
Boyan Jiang
Haitao Lin
Yanwei Fu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation"
18 / 18 papers shown
Title
ST-Booster: An Iterative SpatioTemporal Perception Booster for Vision-and-Language Navigation in Continuous Environments
Lu Yue
Dongliang Zhou
Liang Xie
Erwei Yin
Feitian Zhang
34
0
0
14 Apr 2025
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
Yongchao Feng
Yajie Liu
Shuai Yang
Wenrui Cai
J. Zhang
...
Jiahui Lv
Z. Liu
Tengyuan Shi
Qingjie Liu
Y. Wang
MLLM
VLM
55
1
0
13 Apr 2025
COSMO: Combination of Selective Memorization for Low-cost Vision-and-Language Navigation
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Zike Yan
Qi Wu
Zhihua Wei
J. Liu
48
0
0
31 Mar 2025
Object-centric proto-symbolic behavioural reasoning from pixels
R. S. V. Bergen
Justus F. Hübotter
Pablo Lanillos
LM&Ro
OCL
70
0
0
26 Nov 2024
NaVIP: An Image-Centric Indoor Navigation Solution for Visually Impaired People
Jun Yu
Yifan Zhang
Badrinadh Aila
V. Namboodiri
28
1
0
08 Oct 2024
MiniVLN: Efficient Vision-and-Language Navigation by Progressive Knowledge Distillation
Junyou Zhu
Yanyuan Qiao
Siqi Zhang
Xingjian He
Qi Wu
Jing Liu
VLM
19
1
0
27 Sep 2024
Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models
Tianyu Wang
Haitao Lin
Junqiu Yu
Yanwei Fu
LM&Ro
20
3
0
15 Aug 2024
PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Renjie Lu
Jingke Meng
Wei-Shi Zheng
23
3
0
16 Jul 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
21
2
0
25 Jun 2024
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
39
13
0
16 Apr 2024
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
Ganlong Zhao
Guanbin Li
Weikai Chen
Yizhou Yu
27
4
0
26 Mar 2024
Volumetric Environment Representation for Vision-Language Navigation
Rui Liu
Wenguan Wang
Yi Yang
32
23
0
21 Mar 2024
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
X. Lei
Min Wang
Wen-gang Zhou
Li Li
Houqiang Li
30
5
0
25 Feb 2024
CLIP feature-based randomized control using images and text for multiple tasks and robots
Kazuki Shibata
Hideki Deguchi
Shun Taguchi
21
1
0
18 Jan 2024
Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation
Junyu Gao
Xuan Yao
Changsheng Xu
TTA
20
3
0
22 Nov 2023
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Mohit Bansal
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
185
403
0
13 Jul 2021
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
169
131
0
19 Oct 2020
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
246
495
0
07 Jun 2018
1