Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.10447
Cited By
Explore and Tell: Embodied Visual Captioning in 3D Environments
21 August 2023
Anwen Hu
Shizhe Chen
Liang Zhang
Qin Jin
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Explore and Tell: Embodied Visual Captioning in 3D Environments"
4 / 4 papers shown
Title
Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions
Tommaso Galliena
Tommaso Apicella
Stefano Rosa
Pietro Morerio
Alessio Del Bue
Lorenzo Natale
32
0
0
11 Apr 2025
Self-Explainable Affordance Learning with Embodied Caption
Zhipeng Zhang
Zhimin Wei
Guolei Sun
Peng Wang
Luc Van Gool
40
2
0
08 Apr 2024
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,010
0
28 Jan 2022
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
1