Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.16994
Cited By
Vision-and-Language Navigation Generative Pretrained Transformer
27 May 2024
Hanlin Wen
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vision-and-Language Navigation Generative Pretrained Transformer"
3 / 3 papers shown
Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
265
4,229
0
30 Jan 2023
Language and Visual Entity Relationship Graph for Agent Navigation
Yicong Hong
Cristian Rodriguez-Opazo
Yuankai Qi
Qi Wu
Stephen Gould
LM&Ro
171
132
0
19 Oct 2020
Speaker-Follower Models for Vision-and-Language Navigation
Daniel Fried
Ronghang Hu
Volkan Cirik
Anna Rohrbach
Jacob Andreas
Louis-Philippe Morency
Taylor Berg-Kirkpatrick
Kate Saenko
Dan Klein
Trevor Darrell
LM&Ro
LRM
246
496
0
07 Jun 2018
1