Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.00283
Cited By
Learning to Generate Grounded Visual Captions without Localization Supervision
1 June 2019
Chih-Yao Ma
Yannis Kalantidis
Ghassan AlRegib
Peter Vajda
Marcus Rohrbach
Z. Kira
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Generate Grounded Visual Captions without Localization Supervision"
3 / 3 papers shown
Title
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Wenliang Dai
Zihan Liu
Ziwei Ji
Dan Su
Pascale Fung
MLLM
VLM
29
62
0
14 Oct 2022
Consensus Graph Representation Learning for Better Grounded Image Captioning
Wenqiao Zhang
Haochen Shi
Siliang Tang
Jun Xiao
Qiang Yu
Yueting Zhuang
15
53
0
02 Dec 2021
Neural Baby Talk
Jiasen Lu
Jianwei Yang
Dhruv Batra
Devi Parikh
VLM
191
434
0
27 Mar 2018
1