Combo of Thinking and Observing for Outside-Knowledge VQAAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 |
A Region-Prompted Adapter Tuning for Visual Abductive ReasoningACM Multimedia (ACM MM), 2023 |
VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks
for Visual Question AnsweringIEEE International Conference on Computer Vision (ICCV), 2022 |
MuKEA: Multimodal Knowledge Extraction and Accumulation for
Knowledge-based Visual Question AnsweringComputer Vision and Pattern Recognition (CVPR), 2022 |
Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for
Knowledge-based Visual Question AnsweringAAAI Conference on Artificial Intelligence (AAAI), 2022 |
CRIC: A VQA Dataset for Compositional Reasoning on Vision and
CommonsenseIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019 |