Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2303.05068
Cited By
Toward Unsupervised Realistic Visual Question Answering
9 March 2023
Yuwei Zhang
Chih-Hui Ho
Nuno Vasconcelos
CoGe
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Toward Unsupervised Realistic Visual Question Answering"
6 / 6 papers shown
Title
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
382
4,010
0
28 Jan 2022
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
61
44
0
21 Sep 2021
Counterfactual Zero-Shot and Open-Set Visual Recognition
Zhongqi Yue
Tan Wang
Hanwang Zhang
Qianru Sun
Xiansheng Hua
BDL
142
189
0
01 Mar 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjD
VLM
252
157
0
02 Jan 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
922
0
24 Sep 2019
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Balaji Lakshminarayanan
Alexander Pritzel
Charles Blundell
UQCV
BDL
268
5,635
0
05 Dec 2016
1