ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1606.05433
  4. Cited By
FVQA: Fact-based Visual Question Answering
v1v2v3v4 (latest)

FVQA: Fact-based Visual Question Answering

17 June 2016
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
    CoGe
ArXiv (abs)PDFHTML

Papers citing "FVQA: Fact-based Visual Question Answering"

50 / 241 papers shown
Discriminative Triad Matching and Reconstruction for Weakly Referring
  Expression Grounding
Discriminative Triad Matching and Reconstruction for Weakly Referring Expression GroundingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Mingjie Sun
Jimin Xiao
Eng Gee Lim
Si Liu
John Y. Goulermas
ObjD
163
168
0
08 Jun 2021
Recent Advances and Trends in Multimodal Deep Learning: A Review
Recent Advances and Trends in Multimodal Deep Learning: A Review
Jabeen Summaira
Xi Li
Amin Muhammad Shoib
Songyuan Li
Abdul Jabbar
HAI
340
71
0
24 May 2021
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss
AdaVQA: Overcoming Language Priors with Adapted Margin Cosine LossInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Feng Ji
Ji Zhang
Marco Bertini
144
41
0
05 May 2021
A survey on VQA_Datasets and Approaches
A survey on VQA_Datasets and Approaches
Yeyun Zou
Qiyu Xie
277
21
0
02 May 2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Cross-Modal Retrieval Augmentation for Multi-Modal ClassificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Shir Gur
Natalia Neverova
C. Stauffer
Ser-Nam Lim
Douwe Kiela
A. Reiter
217
36
0
16 Apr 2021
Towards General Purpose Vision Systems
Towards General Purpose Vision SystemsComputer Vision and Pattern Recognition (CVPR), 2021
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
275
55
0
01 Apr 2021
Domain-robust VQA with diverse datasets and methods but no target labels
Domain-robust VQA with diverse datasets and methods but no target labelsComputer Vision and Pattern Recognition (CVPR), 2021
Ruotong Wang
Tristan D. Maidment
Ahmad Diab
Adriana Kovashka
R. Hwa
OOD
300
25
0
29 Mar 2021
Multi-Modal Answer Validation for Knowledge-Based VQA
Multi-Modal Answer Validation for Knowledge-Based VQAAAAI Conference on Artificial Intelligence (AAAI), 2021
Jialin Wu
Jiasen Lu
Ashish Sabharwal
Roozbeh Mottaghi
377
167
0
23 Mar 2021
SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical
  Visual Question Answering
SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question AnsweringIEEE International Symposium on Biomedical Imaging (ISBI), 2021
Bo Liu
Li-Ming Zhan
Li Xu
Lin Ma
Y. Yang
Xiao-Ming Wu
255
438
0
18 Feb 2021
Reasoning over Vision and Language: Exploring the Benefits of
  Supplemental Knowledge
Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge
Violetta Shevchenko
Damien Teney
A. Dick
Anton Van Den Hengel
213
31
0
15 Jan 2021
Seeing is Knowing! Fact-based Visual Question Answering using Knowledge
  Graph Embeddings
Seeing is Knowing! Fact-based Visual Question Answering using Knowledge Graph Embeddings
Kiran Ramnath
M. Hasegawa-Johnson
214
11
0
31 Dec 2020
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain
  Knowledge-Based VQA
KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQAComputer Vision and Pattern Recognition (CVPR), 2020
Kenneth Marino
Xinlei Chen
Devi Parikh
Abhinav Gupta
Marcus Rohrbach
272
226
0
20 Dec 2020
Knowledge-Routed Visual Question Reasoning: Challenges for Deep
  Representation Embedding
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation EmbeddingIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Qingxing Cao
Bailin Li
Xiaodan Liang
Keze Wang
Liang Lin
222
48
0
14 Dec 2020
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCapsAAAI Conference on Artificial Intelligence (AAAI), 2020
Qi Zhu
Chenyu Gao
Peng Wang
Qi Wu
202
58
0
09 Dec 2020
Transformation Driven Visual Reasoning
Transformation Driven Visual ReasoningComputer Vision and Pattern Recognition (CVPR), 2020
Xin Hong
Yanyan Lan
Liang Pang
Jiafeng Guo
Xueqi Cheng
LRM
181
25
0
26 Nov 2020
XTQA: Span-Level Explanations of the Textbook Question Answering
XTQA: Span-Level Explanations of the Textbook Question AnsweringIEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS), 2020
Jie Ma
Q. Zheng
Jun Liu
Qingyu Yin
Jianlong Zhou
Y. Huang
209
17
0
25 Nov 2020
Generating Natural Questions from Images for Multimodal Assistants
Generating Natural Questions from Images for Multimodal AssistantsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
Alkesh Patel
Sudarshan Ramanujam
Hadas Kotek
Christopher Klein
Jason D. Williams
VGen
184
10
0
17 Nov 2020
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a
  Class-imbalance View
Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance ViewIEEE Transactions on Image Processing (TIP), 2020
Yangyang Guo
Liqiang Nie
Zhiyong Cheng
Q. Tian
Min Zhang
362
79
0
30 Oct 2020
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A ReviewNeurocomputing (Neurocomputing), 2020
Wei Chen
Weiping Wang
Tianpeng Liu
M. Lew
VLM
329
36
0
16 Oct 2020
That looks interesting! Personalizing Communication and Segmentation
  with Random Forest Node Embeddings
That looks interesting! Personalizing Communication and Segmentation with Random Forest Node Embeddings
Weiwei Wang
Wiebke Eberhardt
Stefano Bromuri
165
1
0
13 Sep 2020
Cross-modal Knowledge Reasoning for Knowledge-based Visual Question
  Answering
Cross-modal Knowledge Reasoning for Knowledge-based Visual Question AnsweringPattern Recognition (Pattern Recognit.), 2020
Jiahao Yu
Zihao Zhu
Yujing Wang
Weifeng Zhang
Yue Hu
Jianlong Tan
198
113
0
31 Aug 2020
A Dataset and Baselines for Visual Question Answering on Art
A Dataset and Baselines for Visual Question Answering on Art
Noa Garcia
Chentao Ye
Zihua Liu
Qingtao Hu
Mayu Otani
Chenhui Chu
Yuta Nakashima
Teruko Mitamura
CoGe
157
65
0
28 Aug 2020
Knowledge Graph Extraction from Videos
Knowledge Graph Extraction from Videos
Louis Mahon
Eleonora Giunchiglia
Bowen Li
Thomas Lukasiewicz
102
21
0
20 Jul 2020
Knowledge-Based Video Question Answering with Unsupervised Scene
  Descriptions
Knowledge-Based Video Question Answering with Unsupervised Scene DescriptionsEuropean Conference on Computer Vision (ECCV), 2020
Noa Garcia
Yuta Nakashima
250
35
0
17 Jul 2020
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual
  Question Answering
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
Zihao Zhu
Jiahao Yu
Yujing Wang
Yajing Sun
Yue Hu
Qi Wu
250
149
0
16 Jun 2020
Give Me Something to Eat: Referring Expression Comprehension with
  Commonsense Knowledge
Give Me Something to Eat: Referring Expression Comprehension with Commonsense KnowledgeACM Multimedia (ACM MM), 2020
Peng Wang
Dongyang Liu
Hui Li
Qi Wu
ObjD
217
22
0
02 Jun 2020
Structured Multimodal Attentions for TextVQA
Structured Multimodal Attentions for TextVQAIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Chenyu Gao
Qi Zhu
Peng Wang
Hui Li
Yuliang Liu
Anton Van Den Hengel
Qi Wu
272
66
0
01 Jun 2020
Visuo-Linguistic Question Answering (VLQA) Challenge
Visuo-Linguistic Question Answering (VLQA) Challenge
Shailaja Keyur Sampat
Yezhou Yang
Chitta Baral
CoGe
138
1
0
01 May 2020
Knowledge-Based Visual Question Answering in Videos
Knowledge-Based Visual Question Answering in Videos
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
59
0
0
17 Apr 2020
An Entropy Clustering Approach for Assessing Visual Question Difficulty
An Entropy Clustering Approach for Assessing Visual Question DifficultyIEEE Access (IEEE Access), 2020
K. Terao
Toru Tamaki
B. Raytchev
K. Kaneda
Shuníchi Satoh
OODAAML
304
1
0
12 Apr 2020
Understanding Knowledge Gaps in Visual Question Answering: Implications
  for Gap Identification and Testing
Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing
Goonmeet Bajaj
Bortik Bandyopadhyay
Daniela Schmidt
Pranav Maneriker
Christopher Myers
Srinivasan Parthasarathy
173
2
0
08 Apr 2020
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene
  Text
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene TextComputer Vision and Pattern Recognition (CVPR), 2020
Difei Gao
Ke Li
Ruiping Wang
Shiguang Shan
Xilin Chen
212
126
0
31 Mar 2020
Linguistically Driven Graph Capsule Network for Visual Question
  Reasoning
Linguistically Driven Graph Capsule Network for Visual Question Reasoning
Qingxing Cao
Xiaodan Liang
Keze Wang
Liang Lin
GNN
281
3
0
23 Mar 2020
Multilayer Dense Connections for Hierarchical Concept Classification
Multilayer Dense Connections for Hierarchical Concept Classification
T. Parag
Hongcheng Wang
136
1
0
19 Mar 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual
  Question Answering
On the General Value of Evidence, and Bilingual Scene-Text Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2020
Xinyu Wang
Yuliang Liu
Chunhua Shen
Chun Chet Ng
Canjie Luo
Lianwen Jin
C. Chan
Anton Van Den Hengel
Liangwei Wang
207
116
0
24 Feb 2020
Augmenting Visual Question Answering with Semantic Frame Information in
  a Multitask Learning Approach
Augmenting Visual Question Answering with Semantic Frame Information in a Multitask Learning ApproachInternational Computer Science Conference (ICSC), 2020
Mehrdad Alizadeh
Barbara Di Eugenio
119
3
0
31 Jan 2020
Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models
Accuracy vs. Complexity: A Trade-off in Visual Question Answering ModelsPattern Recognition (Pattern Recognit.), 2020
M. Farazi
Salman H. Khan
Nick Barnes
204
18
0
20 Jan 2020
A Review on Intelligent Object Perception Methods Combining
  Knowledge-based Reasoning and Machine Learning
A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine LearningAAAI Spring Symposium Combining Machine Learning with Knowledge Engineering (CMLKE), 2019
Filippos Gouidis
Alexandros Vassiliades
Theodore Patkos
Antonis Argyros
Nick Bassiliades
Dimitris Plexousakis
OCL
174
12
0
26 Dec 2019
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications
Multimodal Intelligence: Representation Learning, Information Fusion, and ApplicationsIEEE Journal on Selected Topics in Signal Processing (JSTSP), 2019
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
HAIAI4TS
325
402
0
10 Nov 2019
KnowIT VQA: Answering Knowledge-Based Questions about Videos
KnowIT VQA: Answering Knowledge-Based Questions about VideosAAAI Conference on Artificial Intelligence (AAAI), 2019
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
344
90
0
23 Oct 2019
Relational Graph Representation Learning for Open-Domain Question
  Answering
Relational Graph Representation Learning for Open-Domain Question Answering
Sal Vivona
Kaveh Hassani
GNNNAI
113
10
0
18 Oct 2019
Multi-modal Deep Analysis for Multimedia
Multi-modal Deep Analysis for Multimedia
Wenwu Zhu
Xin Eric Wang
Hongzhi Li
219
49
0
11 Oct 2019
Explainable High-order Visual Question Reasoning: A New Benchmark and
  Knowledge-routed Network
Explainable High-order Visual Question Reasoning: A New Benchmark and Knowledge-routed Network
Qingxing Cao
Bailin Li
Xiaodan Liang
Liang Lin
142
14
0
23 Sep 2019
CRIC: A VQA Dataset for Compositional Reasoning on Vision and
  Commonsense
CRIC: A VQA Dataset for Compositional Reasoning on Vision and CommonsenseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019
Difei Gao
Ruiping Wang
Shiguang Shan
Xilin Chen
CoGeLRM
307
37
0
08 Aug 2019
An Empirical Study on Leveraging Scene Graphs for Visual Question
  Answering
An Empirical Study on Leveraging Scene Graphs for Visual Question AnsweringBritish Machine Vision Conference (BMVC), 2019
Cheng Zhang
Wei-Lun Chao
D. Xuan
182
57
0
28 Jul 2019
Bilinear Graph Networks for Visual Question Answering
Bilinear Graph Networks for Visual Question AnsweringIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2019
Dalu Guo
Chang Xu
Dacheng Tao
GNN
199
68
0
23 Jul 2019
Integrating Knowledge and Reasoning in Image Understanding
Integrating Knowledge and Reasoning in Image UnderstandingInternational Joint Conference on Artificial Intelligence (IJCAI), 2019
Somak Aditya
Yezhou Yang
Chitta Baral
OCL
140
47
0
24 Jun 2019
Adversarial Multimodal Network for Movie Question Answering
Zhaoquan Yuan
Siyuan Sun
Lixin Duan
Xiao Wu
Changsheng Xu
187
3
0
24 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on
  Dialogue Systems - Past, Present and Future Directions
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
211
57
0
02 Jun 2019
OK-VQA: A Visual Question Answering Benchmark Requiring External
  Knowledge
OK-VQA: A Visual Question Answering Benchmark Requiring External KnowledgeComputer Vision and Pattern Recognition (CVPR), 2019
Kenneth Marino
Mohammad Rastegari
Ali Farhadi
Roozbeh Mottaghi
673
1,378
0
31 May 2019
Previous
12345
Next
Page 4 of 5