Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.16713
Cited By
Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering
29 June 2023
A. S. Penamakuri
Manish Gupta
Mithun Das Gupta
Anand Mishra
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Answer Mining from a Pool of Images: Towards Retrieval-Based Visual Question Answering"
3 / 3 papers shown
Title
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
249
525
0
04 Feb 2021
Unified Vision-Language Pre-Training for Image Captioning and VQA
Luowei Zhou
Hamid Palangi
Lei Zhang
Houdong Hu
Jason J. Corso
Jianfeng Gao
MLLM
VLM
250
927
0
24 Sep 2019
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,464
0
06 Jun 2016
1