ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.11501
  4. Cited By
VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks
  for Visual Question Answering
v1v2 (latest)

VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering

IEEE International Conference on Computer Vision (ICCV), 2022
23 May 2022
Yanan Wang
Michihiro Yasunaga
Hongyu Ren
Shinya Wada
J. Leskovec
ArXiv (abs)PDFHTML

Papers citing "VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering"

15 / 15 papers shown
SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment
SLIP: Structural-aware Language-Image Pretraining for Vision-Language Alignment
Wenbo Lu
CLIPVLM
201
0
0
04 Nov 2025
Causal Debiasing for Visual Commonsense Reasoning
Causal Debiasing for Visual Commonsense ReasoningIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Jiayi Zou
Gengyun Jia
Bing-Kun Bao
CML
135
2
0
23 Oct 2025
Query-Specific GNN: A Comprehensive Graph Representation Learning Method for Retrieval Augmented Generation
Query-Specific GNN: A Comprehensive Graph Representation Learning Method for Retrieval Augmented Generation
Yuchen Yan
Zhihua Liu
Hao Wang
Weiming Li
Xiaoshuai Hao
108
0
0
13 Oct 2025
EyePCR: A Comprehensive Benchmark for Fine-Grained Perception, Knowledge Comprehension and Clinical Reasoning in Ophthalmic Surgery
EyePCR: A Comprehensive Benchmark for Fine-Grained Perception, Knowledge Comprehension and Clinical Reasoning in Ophthalmic Surgery
Gui Wang
Yang Wennuo
Xusen Ma
Zehao Zhong
Zhuoru Wu
Ende Wu
Rong Qu
W. Cheah
Jianfeng Ren
Linlin Shen
178
0
0
19 Sep 2025
FlexMUSE: Multimodal Unification and Semantics Enhancement Framework with Flexible interaction for Creative Writing
FlexMUSE: Multimodal Unification and Semantics Enhancement Framework with Flexible interaction for Creative Writing
Jiahao Chen
Zhiyong Ma
Wenbiao Du
Qingyuan Chuai
87
1
0
22 Aug 2025
MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection
MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection
Sanggeon Yun
Raheeb Hassan
Ryozo Masukawa
Nathaniel D. Bastian
Mohsen Imani
200
0
0
20 Aug 2025
Augmented Vision-Language Models: A Systematic Review
Augmented Vision-Language Models: A Systematic Review
Anthony C Davis
Burhan Sadiq
Tianmin Shu
Chien-Ming Huang
VLMLRM
196
0
0
24 Jul 2025
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
Efthymios Georgiou
Vassilis Katsouros
Yannis Avrithis
Alexandros Potamianos
394
1
0
15 Apr 2025
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionInformation Fusion (Inf. Fusion), 2025
Xiaofeng Han
Shunpeng Chen
Zenghuang Fu
Zhe Feng
Lue Fan
...
Li Guo
Weiliang Meng
Xiaopeng Zhang
Rongtao Xu
Shibiao Xu
445
37
0
03 Apr 2025
Predicate Hierarchies Improve Few-Shot State Classification
Predicate Hierarchies Improve Few-Shot State ClassificationInternational Conference on Learning Representations (ICLR), 2025
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
437
1
0
18 Feb 2025
PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection
  and Natural Language Interpretation
PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language InterpretationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Ryozo Masukawa
Sanggeon Yun
Yoshiki Yamaguchi
Mohsen Imani
156
4
0
30 Oct 2024
MMCert: Provable Defense against Adversarial Attacks to Multi-modal
  Models
MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models
Yanting Wang
Hongye Fu
Wei Zou
Jinyuan Jia
AAML
381
5
0
28 Mar 2024
VCD: A Dataset for Visual Commonsense Discovery in Images
VCD: A Dataset for Visual Commonsense Discovery in Images
Xiangqing Shen
Yurun Song
Siwei Wu
Rui Xia
275
6
0
27 Feb 2024
ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model
  for Visual Question Answering in Vietnamese
ViCLEVR: A Visual Reasoning Dataset and Hybrid Multimodal Fusion Model for Visual Question Answering in Vietnamese
Khiem Vinh Tran
Hao Phu Phan
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
155
15
0
27 Oct 2023
Graph Neural Networks in Vision-Language Image Understanding: A Survey
Graph Neural Networks in Vision-Language Image Understanding: A SurveyThe Visual Computer (TVC), 2023
Henry Senior
Greg Slabaugh
Shanxin Yuan
Luca Rossi
GNN
322
32
0
07 Mar 2023
1