Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1606.05433
Cited By
v1
v2
v3
v4 (latest)
FVQA: Fact-based Visual Question Answering
17 June 2016
Peng Wang
Qi Wu
Chunhua Shen
Anton van den Hengel
A. Dick
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FVQA: Fact-based Visual Question Answering"
50 / 241 papers shown
Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jialin Wu
Raymond J. Mooney
RALM
244
13
0
18 Oct 2022
COFAR: Commonsense and Factual Reasoning in Image Search
Prajwal Gatti
A. S. Penamakuri
Revant Teotia
Anand Mishra
Shubhashis Sengupta
Roshni Ramnani
ReLM
LRM
147
4
0
16 Oct 2022
TransAlign: Fully Automatic and Effective Entity Alignment for Knowledge Graphs
Rui Zhang
Xiaoyan Zhao
Bayu Distiawan Trisedya
Min Yang
Hong Cheng
Jianzhong Qi
102
0
0
16 Oct 2022
Learning by Asking Questions for Knowledge-based Novel Object Recognition
International Journal of Computer Vision (IJCV), 2022
Kohei Uehara
Tatsuya Harada
194
2
0
12 Oct 2022
Retrieval Augmented Visual Question Answering with Outside Knowledge
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Weizhe Lin
Bill Byrne
RALM
236
111
0
07 Oct 2022
A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Chaoqi Chen
Yushuang Wu
Qiyuan Dai
Hong-Yu Zhou
Mutian Xu
Sibei Yang
Xiaoguang Han
Yizhou Yu
ViT
MedIm
AI4CE
379
133
0
27 Sep 2022
CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical Reasoning
International Workshop on Neural-Symbolic Learning and Reasoning (NeSy), 2022
Adam Dahlgren Lindström
Savitha Sam Abraham
120
91
0
10 Aug 2022
LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
Zhuo Chen
Yufen Huang
Jiaoyan Chen
Yuxia Geng
Yin Fang
Jeff Z. Pan
Ningyu Zhang
Wen Zhang
235
50
0
26 Jul 2022
Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem
Yudong Han
Liqiang Nie
Jianhua Yin
Yue Yu
Yan Yan
240
23
0
24 Jul 2022
Semantic-aware Modular Capsule Routing for Visual Question Answering
IEEE Transactions on Image Processing (IEEE TIP), 2022
Yudong Han
Jianhua Yin
Yue Yu
Yin-wei Wei
Liqiang Nie
191
11
0
21 Jul 2022
A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
ACM Multimedia (ACM MM), 2022
Yangyang Guo
Liqiang Nie
Yongkang Wong
Zichen Liu
Zhiyong Cheng
Mohan S. Kankanhalli
198
51
0
30 Jun 2022
cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
International Conference on Pattern Recognition (ICPR), 2022
Kshitij Gupta
Devansh Gautam
R. Mamidi
VLM
303
4
0
07 Jun 2022
A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
European Conference on Computer Vision (ECCV), 2022
Dustin Schwenk
Apoorv Khandelwal
Christopher Clark
Kenneth Marino
Roozbeh Mottaghi
384
764
0
03 Jun 2022
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
Neural Information Processing Systems (NeurIPS), 2022
Yuanze Lin
Yujia Xie
Dongdong Chen
Yichong Xu
Chenguang Zhu
Lu Yuan
319
98
0
02 Jun 2022
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Zihan Zhao
Lu Chen
Ruisheng Cao
Hongshen Xu
Xingyu Chen
Kai Yu
200
9
0
13 May 2022
Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Y. Heo
Eun-Sol Kim
Woo Suk Choi
Byoung-Tak Zhang
142
35
0
22 Apr 2022
Attention Mechanism based Cognition-level Scene Understanding
Xuejiao Tang
Tai Le Quy
LRM
339
0
0
17 Apr 2022
Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog
Shunyu Zhang
X. Jiang
Zequn Yang
T. Wan
Zengchang Qin
164
14
0
10 Apr 2022
Learning Commonsense-aware Moment-Text Alignment for Fast Video Temporal Grounding
Ziyue Wu
Junyu Gao
Shucheng Huang
Changsheng Xu
239
6
0
04 Apr 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Computer Vision and Pattern Recognition (CVPR), 2022
Manuel Kolmet
Qunjie Zhou
Aljosa Osep
Laura Leal-Taixe
293
40
0
28 Mar 2022
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering
Computer Vision and Pattern Recognition (CVPR), 2022
Yang Ding
Jing Yu
Bangchang Liu
Yue Hu
Mingxin Cui
Qi Wu
171
81
0
17 Mar 2022
K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Kohei Uehara
Tatsuya Harada
153
14
0
15 Mar 2022
Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering
AAAI Conference on Artificial Intelligence (AAAI), 2022
Mingxiao Li
Marie-Francine Moens
216
19
0
06 Mar 2022
Joint Answering and Explanation for Visual Commonsense Reasoning
IEEE Transactions on Image Processing (IEEE TIP), 2022
Zhenyang Li
Yangyang Guo
Ke-Jyun Wang
Yin-wei Wei
Liqiang Nie
Mohan S. Kankanhalli
255
26
0
25 Feb 2022
A Review on Methods and Applications in Multimodal Deep Learning
Summaira Jabeen
Xi Li
Muhammad Shoib Amin
Abdul Jabbar
VLM
HAI
213
149
0
18 Feb 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2022
Xiangru Zhu
Zhixu Li
Xiaodan Wang
Xueyao Jiang
Yixiang Chen
Xuwu Wang
Yanghua Xiao
N. Yuan
210
237
0
11 Feb 2022
The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning
European Conference on Computer Vision (ECCV), 2022
Jack Hessel
Jena D. Hwang
Jinho Park
Rowan Zellers
Chandra Bhagavatula
Anna Rohrbach
Kate Saenko
Yejin Choi
ReLM
497
61
0
10 Feb 2022
NEWSKVQA: Knowledge-Aware News Video Question Answering
Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2022
Pranay Gupta
Manish Gupta
243
9
0
08 Feb 2022
A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering
Feng Gao
Q. Ping
Govind Thattai
Aishwarya N. Reganti
Yingting Wu
Premkumar Natarajan
150
18
0
14 Jan 2022
Self-Training Vision Language BERTs with a Unified Conditional Model
Xiaofeng Yang
Fengmao Lv
Fayao Liu
Guosheng Lin
SSL
VLM
313
18
0
06 Jan 2022
Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive Survey
Proceedings of the IEEE (Proc. IEEE), 2021
Jiaoyan Chen
Yuxia Geng
Zhuo Chen
Jeff Z. Pan
Yuan He
Wen Zhang
Ian Horrocks
Hua-zeng Chen
641
67
0
18 Dec 2021
KAT: A Knowledge Augmented Transformer for Vision-and-Language
Liangke Gui
Borui Wang
Qiuyuan Huang
Alexander G. Hauptmann
Yonatan Bisk
Jianfeng Gao
245
196
0
16 Dec 2021
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
261
60
0
15 Dec 2021
Improving and Diagnosing Knowledge-Based Visual Question Answering via Entity Enhanced Knowledge Injection
Diego Garcia-Olano
Yasumasa Onoe
Joydeep Ghosh
168
22
0
13 Dec 2021
Two-stage Rule-induction Visual Reasoning on RPMs with an Application to Video Prediction
Wentao He
Jianfeng Ren
Ruibin Bai
Xudong Jiang
LRM
302
8
0
24 Nov 2021
Medical Visual Question Answering: A Survey
Zhihong Lin
Donghao Zhang
Qingyi Tao
Danli Shi
Gholamreza Haffari
Qi Wu
M. He
Z. Ge
320
178
0
19 Nov 2021
Transferring Domain-Agnostic Knowledge in Video Question Answering
Tianran Wu
Noa Garcia
Mayu Otani
Chenhui Chu
Yuta Nakashima
Haruo Takemura
137
10
0
26 Oct 2021
Coarse-to-Fine Reasoning for Visual Question Answering
Binh X. Nguyen
Tuong Khanh Long Do
Huy Tran
Erman Tjiputra
Quang-Dieu Tran
A. Nguyen
NAI
304
44
0
06 Oct 2021
A Survey of Knowledge Enhanced Pre-trained Models
Jian Yang
Xinyu Hu
Gang Xiao
Yulong Shen
KELM
440
8
0
01 Oct 2021
Knowledge-based Embodied Question Answering
Sinan Tan
Mengmeng Ge
Di Guo
Huaping Liu
F. Sun
266
39
0
16 Sep 2021
Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
Ander Salaberria
Gorka Azkune
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
300
69
0
15 Sep 2021
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
AAAI Conference on Artificial Intelligence (AAAI), 2021
Zhengyuan Yang
Zhe Gan
Jianfeng Wang
Xiaowei Hu
Yumao Lu
Zicheng Liu
Lijuan Wang
590
489
0
10 Sep 2021
Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
Min Peng
Chongyang Wang
Yuan Gao
Yu Shi
Xiangdong Zhou
184
4
0
10 Sep 2021
Weakly-Supervised Visual-Retriever-Reader for Knowledge-based Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021
Man Luo
Yankai Zeng
Pratyay Banerjee
Chitta Baral
RALM
250
88
0
09 Sep 2021
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
Pratyay Banerjee
Tejas Gokhale
Yezhou Yang
Chitta Baral
LRM
163
19
0
04 Sep 2021
EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQA
IEEE Access (IEEE Access), 2021
Arka Ujjal Dey
Ernest Valveny
Gaurav Harit
353
3
0
22 Aug 2021
Interpretable Visual Understanding with Cognitive Attention Network
International Conference on Artificial Neural Networks (ICANN), 2021
Xuejiao Tang
Wenbin Zhang
Yi Yu
Kea Turner
Hanyu Wang
Mengyu Wang
Eirini Ntoutsi
275
19
0
06 Aug 2021
Zero-shot Visual Question Answering using Knowledge Graph
Zhuo Chen
Jiaoyan Chen
Yuxia Geng
Jeff Z. Pan
Zonggang Yuan
Huajun Chen
314
85
0
12 Jul 2021
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory
Xuejiao Tang
Xin Huang
Wenbin Zhang
T. Child
Qiong Hu
Zhen Liu
Ji Zhang
LRM
195
20
0
04 Jul 2021
NAAQA: A Neural Architecture for Acoustic Question Answering
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
Jerome Abdelnour
Jean Rouat
G. Salvi
291
5
0
11 Jun 2021
Previous
1
2
3
4
5
Next
Page 3 of 5