Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.15487
Cited By
Tackling VQA with Pretrained Foundation Models without Further Training
27 September 2023
Alvin De Jun Tan
Bingquan Shen
MLLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Tackling VQA with Pretrained Foundation Models without Further Training"
3 / 3 papers shown
Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
244
4,186
0
30 Jan 2023
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Yumao Lu
Zicheng Liu
Lijuan Wang
169
401
0
10 Sep 2021
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Mohit Bansal
MLLM
249
518
0
04 Feb 2021
1