Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.05568
Cited By
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering
9 March 2021
Aman Jain
Mayank Kothyari
Vishwajeet Kumar
P. Jyothi
Ganesh Ramakrishnan
Soumen Chakrabarti
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering"
5 / 5 papers shown
Title
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
Yangning Li
Yinghui Li
Xinyu Wang
Yong-feng Jiang
Zhen Zhang
...
Hui Wang
Hai-Tao Zheng
Pengjun Xie
Philip S. Yu
Fei Huang
62
15
0
05 Nov 2024
COCO is "ALL'' You Need for Visual Instruction Fine-tuning
Xiaotian Han
Yiqi Wang
Bohan Zhai
Quanzeng You
Hongxia Yang
VLM
MLLM
28
2
0
17 Jan 2024
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen
Hexiang Hu
Yi Luan
Haitian Sun
Soravit Changpinyo
Alan Ritter
Ming-Wei Chang
37
80
0
23 Feb 2023
Can Open Domain Question Answering Systems Answer Visual Knowledge Questions?
Jiawen Zhang
Abhijit Mishra
Avinesh P.V.S
Siddharth Patwardhan
Sachin Agarwal
24
0
0
09 Feb 2022
MoCA: Incorporating Multi-stage Domain Pretraining and Cross-guided Multimodal Attention for Textbook Question Answering
Fangzhi Xu
Qika Lin
J. Liu
Lingling Zhang
Tianzhe Zhao
Qianyi Chai
Yudai Pan
9
2
0
06 Dec 2021
1