ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.04294
  4. Cited By
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation

FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation

9 June 2022
Zi-Yi Dou
Nanyun Peng
ArXivPDFHTML

Papers citing "FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation"

23 / 23 papers shown
Title
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks
Siqi Zhang
Yanyuan Qiao
Qunbo Wang
Longteng Guo
Zhihua Wei
J. Liu
LM&Ro
74
0
0
18 Mar 2025
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
Yue Zhang
Ziqiao Ma
Jialu Li
Yanyuan Qiao
Zun Wang
J. Chai
Qi Wu
Mohit Bansal
Parisa Kordjamshidi
LRM
51
18
0
31 Dec 2024
NaVIP: An Image-Centric Indoor Navigation Solution for Visually Impaired
  People
NaVIP: An Image-Centric Indoor Navigation Solution for Visually Impaired People
Jun Yu
Yifan Zhang
Badrinadh Aila
V. Namboodiri
28
1
0
08 Oct 2024
NavGPT-2: Unleashing Navigational Reasoning Capability for Large
  Vision-Language Models
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Gengze Zhou
Yicong Hong
Zun Wang
Xin Eric Wang
Qi Wu
LM&Ro
42
18
0
17 Jul 2024
Controllable Navigation Instruction Generation with Chain of Thought
  Prompting
Controllable Navigation Instruction Generation with Chain of Thought Prompting
Xianghao Kong
Jinyu Chen
Wenguan Wang
Hang Su
Xiaolin Hu
Yi Yang
Si Liu
LRM
29
4
0
10 Jul 2024
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for
  Effective-and-Efficient Vision-and-Language Navigation
MAGIC: Meta-Ability Guided Interactive Chain-of-Distillation for Effective-and-Efficient Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Mengjiao Shen
Jingwei Yang
Chengju Liu
Qijun Chen
VLM
21
2
0
25 Jun 2024
Vision-and-Language Navigation via Causal Learning
Vision-and-Language Navigation via Causal Learning
Liuyi Wang
Zongtao He
Ronghao Dang
Mengjiao Shen
Chengju Liu
Qijun Chen
CML
44
13
0
16 Apr 2024
Causality-based Cross-Modal Representation Learning for
  Vision-and-Language Navigation
Causality-based Cross-Modal Representation Learning for Vision-and-Language Navigation
Liuyi Wang
Zongtao He
Ronghao Dang
Huiyi Chen
Chengju Liu
Qi Chen
28
1
0
06 Mar 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language
  Navigation
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li
Aishwarya Padmakumar
Gaurav Sukhatme
Mohit Bansal
24
6
0
05 Feb 2024
Scaling Data Generation in Vision-and-Language Navigation
Scaling Data Generation in Vision-and-Language Navigation
Zun Wang
Jialu Li
Yicong Hong
Yi Wang
Qi Wu
Mohit Bansal
Stephen Gould
Hao Tan
Yu Qiao
LM&Ro
16
54
0
28 Jul 2023
Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for
  Navigation Instruction Generation
Kefa: A Knowledge Enhanced and Fine-grained Aligned Speaker for Navigation Instruction Generation
Haitian Zeng
Xiaohan Wang
Wenguan Wang
Yi Yang
8
7
0
25 Jul 2023
GridMM: Grid Memory Map for Vision-and-Language Navigation
GridMM: Grid Memory Map for Vision-and-Language Navigation
Zihan Wang
Xiangyang Li
Jiahao Yang
Yeqi Liu
Shuqiang Jiang
21
50
0
24 Jul 2023
Masked Path Modeling for Vision-and-Language Navigation
Masked Path Modeling for Vision-and-Language Navigation
Zi-Yi Dou
Feng Gao
Nanyun Peng
LM&Ro
19
3
0
23 May 2023
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For
  Vision-and-Language Navigation
PASTS: Progress-Aware Spatio-Temporal Transformer Speaker For Vision-and-Language Navigation
Liuyi Wang
Chengju Liu
Zongtao He
Shu Li
Qingqing Yan
Huiyi Chen
Qi Chen
19
9
0
19 May 2023
Lana: A Language-Capable Navigator for Instruction Following and
  Generation
Lana: A Language-Capable Navigator for Instruction Following and Generation
Xiaohan Wang
Wenguan Wang
Jiayi Shao
Yi Yang
LLMAG
LM&Ro
36
37
0
15 Mar 2023
VLN-Trans: Translator for the Vision and Language Navigation Agent
VLN-Trans: Translator for the Vision and Language Navigation Agent
Yue Zhang
Parisa Kordjamshidi
25
16
0
18 Feb 2023
TEACh: Task-driven Embodied Agents that Chat
TEACh: Task-driven Embodied Agents that Chat
Aishwarya Padmakumar
Jesse Thomason
Ayush Shrivastava
P. Lange
Anjali Narayan-Chen
Spandana Gella
Robinson Piramithu
Gökhan Tür
Dilek Z. Hakkani-Tür
LM&Ro
152
180
0
01 Oct 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Mohit Bansal
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
185
403
0
13 Jul 2021
On the Evaluation of Vision-and-Language Navigation Instructions
On the Evaluation of Vision-and-Language Navigation Instructions
Mingde Zhao
Peter Anderson
Vihan Jain
Su Wang
Alexander Ku
Jason Baldridge
Eugene Ie
231
50
0
26 Jan 2021
Language and Visual Entity Relationship Graph for Agent Navigation
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
169
131
0
19 Oct 2020
Meta Pseudo Labels
Meta Pseudo Labels
Hieu H. Pham
Zihang Dai
Qizhe Xie
Minh-Thang Luong
Quoc V. Le
VLM
245
655
0
23 Mar 2020
Speaker-Follower Models for Vision-and-Language Navigation
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
246
495
0
07 Jun 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
243
11,659
0
09 Mar 2017
1