Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1606.05433
Cited By
v1
v2
v3
v4 (latest)
FVQA: Fact-based Visual Question Answering
17 June 2016
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FVQA: Fact-based Visual Question Answering"
50 / 241 papers shown
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression Grounding
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Mingjie Sun
Jimin Xiao
Eng Gee Lim
Si Liu
John Y. Goulermas
ObjD
163
168
0
08 Jun 2021
Recent Advances and Trends in Multimodal Deep Learning: A Review
Jabeen Summaira
Xi Li
Amin Muhammad Shoib
Songyuan Li
Abdul Jabbar
HAI
340
71
0
24 May 2021
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss
International Joint Conference on Artificial Intelligence (IJCAI), 2021
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Feng Ji
Ji Zhang
Marco Bertini
144
41
0
05 May 2021
A survey on VQA_Datasets and Approaches
Yeyun Zou
Qiyu Xie
277
21
0
02 May 2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Shir Gur
Natalia Neverova
C. Stauffer
Ser-Nam Lim
Douwe Kiela
A. Reiter
217
36
0
16 Apr 2021
Towards General Purpose Vision Systems
Computer Vision and Pattern Recognition (CVPR), 2021
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
275
55
0
01 Apr 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Computer Vision and Pattern Recognition (CVPR), 2021
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
300
25
0
29 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA
AAAI Conference on Artificial Intelligence (AAAI), 2021
Jialin Wu
Jiasen Lu
Ashish Sabharwal
Roozbeh Mottaghi
377
167
0
23 Mar 2021
SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering
IEEE International Symposium on Biomedical Imaging (ISBI), 2021
Bo Liu
Li-Ming Zhan
Li Xu
Lin Ma
Y. Yang
Xiao-Ming Wu
255
438
0
18 Feb 2021
Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge
Violetta Shevchenko
Damien Teney
A. Dick
Anton Van Den Hengel
213
31
0
15 Jan 2021
Seeing is Knowing! Fact-based Visual Question Answering using Knowledge Graph Embeddings
Kiran Ramnath
M. Hasegawa-Johnson
214
11
0
31 Dec 2020
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Computer Vision and Pattern Recognition (CVPR), 2020
Kenneth Marino
Xinlei Chen
Devi Parikh
Abhinav Gupta
Marcus Rohrbach
272
226
0
20 Dec 2020
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Qingxing Cao
Bailin Li
Xiaodan Liang
Keze Wang
Liang Lin
222
48
0
14 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
AAAI Conference on Artificial Intelligence (AAAI), 2020
Qi Zhu
Chenyu Gao
Peng Wang
Qi Wu
202
58
0
09 Dec 2020
Transformation Driven Visual Reasoning
Computer Vision and Pattern Recognition (CVPR), 2020
Xin Hong
Yanyan Lan
Liang Pang
Jiafeng Guo
Xueqi Cheng
LRM
181
25
0
26 Nov 2020
XTQA: Span-Level Explanations of the Textbook Question Answering
IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Jie Ma
Q. Zheng
Jun Liu
Qingyu Yin
Jianlong Zhou
Y. Huang
209
17
0
25 Nov 2020
Generating Natural Questions from Images for Multimodal Assistants
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Alkesh Patel
Sudarshan Ramanujam
Hadas Kotek
Christopher Klein
Jason D. Williams
VGen
184
10
0
17 Nov 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View
IEEE Transactions on Image Processing (TIP), 2020
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
362
79
0
30 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Neurocomputing (Neurocomputing), 2020
Wei Chen
Weiping Wang
Tianpeng Liu
M. Lew
VLM
329
36
0
16 Oct 2020
That looks interesting! Personalizing Communication and Segmentation with Random Forest Node Embeddings
Weiwei Wang
Wiebke Eberhardt
Stefano Bromuri
165
1
0
13 Sep 2020
Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering
Pattern Recognition (Pattern Recognit.), 2020
Jiahao Yu
Zihao Zhu
Yujing Wang
Weifeng Zhang
Yue Hu
Jianlong Tan
198
113
0
31 Aug 2020
A Dataset and Baselines for Visual Question Answering on Art
Noa Garcia
Chentao Ye
Zihua Liu
Qingtao Hu
Mayu Otani
Chenhui Chu
Yuta Nakashima
Teruko Mitamura
CoGe
157
65
0
28 Aug 2020
Knowledge Graph Extraction from Videos
Louis Mahon
Eleonora Giunchiglia
Bowen Li
Thomas Lukasiewicz
102
21
0
20 Jul 2020
Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions
European Conference on Computer Vision (ECCV), 2020
Noa Garcia
Yuta Nakashima
250
35
0
17 Jul 2020
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
Zihao Zhu
Jiahao Yu
Yujing Wang
Yajing Sun
Yue Hu
Qi Wu
250
149
0
16 Jun 2020
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge
ACM Multimedia (ACM MM), 2020
Peng Wang
Dongyang Liu
Hui Li
Qi Wu
ObjD
217
22
0
02 Jun 2020
Structured Multimodal Attentions for TextVQA
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Chenyu Gao
Qi Zhu
Peng Wang
Hui Li
Yuliang Liu
Anton Van Den Hengel
Qi Wu
272
66
0
01 Jun 2020
Visuo-Linguistic Question Answering (VLQA) Challenge
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
CoGe
138
1
0
01 May 2020
Knowledge-Based Visual Question Answering in Videos
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
59
0
0
17 Apr 2020
An Entropy Clustering Approach for Assessing Visual Question Difficulty
IEEE Access (IEEE Access), 2020
K. Terao
Toru Tamaki
B. Raytchev
K. Kaneda
Shuníchi Satoh
OOD
AAML
304
1
0
12 Apr 2020
Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing
Goonmeet Bajaj
Bortik Bandyopadhyay
Daniela Schmidt
Pranav Maneriker
Christopher Myers
Srinivasan Parthasarathy
173
2
0
08 Apr 2020
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
Computer Vision and Pattern Recognition (CVPR), 2020
Difei Gao
Ke Li
Ruiping Wang
Shiguang Shan
Xilin Chen
212
126
0
31 Mar 2020
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
281
3
0
23 Mar 2020
Multilayer Dense Connections for Hierarchical Concept Classification
T. Parag
Hongcheng Wang
136
1
0
19 Mar 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2020
Xinyu Wang
Yuliang Liu
Chunhua Shen
Chun Chet Ng
Canjie Luo
Lianwen Jin
C. Chan
Anton Van Den Hengel
Liangwei Wang
207
116
0
24 Feb 2020
Augmenting Visual Question Answering with Semantic Frame Information in a Multitask Learning Approach
International Computer Science Conference (ICSC), 2020
Mehrdad Alizadeh
Barbara Di Eugenio
119
3
0
31 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
Pattern Recognition (Pattern Recognit.), 2020
M. Farazi
Salman H. Khan
Nick Barnes
204
18
0
20 Jan 2020
A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning
AAAI Spring Symposium Combining Machine Learning with Knowledge Engineering (CMLKE), 2019
Filippos Gouidis
Alexandros Vassiliades
Theodore Patkos
Antonis Argyros
Nick Bassiliades
Dimitris Plexousakis
OCL
174
12
0
26 Dec 2019
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications
IEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAI
AI4TS
325
402
0
10 Nov 2019
KnowIT VQA: Answering Knowledge-Based Questions about Videos
AAAI Conference on Artificial Intelligence (AAAI), 2019
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
344
90
0
23 Oct 2019
Relational Graph Representation Learning for Open-Domain Question Answering
Sal Vivona
Kaveh Hassani
GNN
NAI
113
10
0
18 Oct 2019
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
219
49
0
11 Oct 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
142
14
0
23 Sep 2019
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Difei Gao
Ruiping Wang
Shiguang Shan
Xilin Chen
CoGe
LRM
307
37
0
08 Aug 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question Answering
British Machine Vision Conference (BMVC), 2019
Cheng Zhang
Wei-Lun Chao
D. Xuan
182
57
0
28 Jul 2019
Bilinear Graph Networks for Visual Question Answering
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Dalu Guo
Chang Xu
Dacheng Tao
GNN
199
68
0
23 Jul 2019
Integrating Knowledge and Reasoning in Image Understanding
International Joint Conference on Artificial Intelligence (IJCAI), 2019
Somak Aditya
Yezhou Yang
Chitta Baral
OCL
140
47
0
24 Jun 2019
Adversarial Multimodal Network for Movie Question Answering
Zhaoquan Yuan
Siyuan Sun
Lixin Duan
Xiao Wu
Changsheng Xu
187
3
0
24 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
211
57
0
02 Jun 2019
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
Computer Vision and Pattern Recognition (CVPR), 2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
673
1,378
0
31 May 2019
Previous
1
2
3
4
5
Next
Page 4 of 5