ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2107.05348
  4. Cited By
Zero-shot Visual Question Answering using Knowledge Graph
v1v2v3v4 (latest)

Zero-shot Visual Question Answering using Knowledge Graph

12 July 2021
Zhuo Chen
Jiaoyan Chen
Yuxia Geng
Jeff Z. Pan
Zonggang Yuan
Huajun Chen
ArXiv (abs)PDFHTML

Papers citing "Zero-shot Visual Question Answering using Knowledge Graph"

37 / 37 papers shown
Seeing and Knowing in the Wild: Open-domain Visual Entity Recognition with Large-scale Knowledge Graphs via Contrastive Learning
Seeing and Knowing in the Wild: Open-domain Visual Entity Recognition with Large-scale Knowledge Graphs via Contrastive Learning
Hongkuan Zhou
Lavdim Halilaj
Sebastian Monka
Stefan Schmid
Yuqicheng Zhu
Jingcheng Wu
Nadeem Nazer
Steffen Staab
VLM
136
0
0
15 Oct 2025
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
Jiangnan Xie
Xiaolong Zheng
Liang Zheng
ObjD
170
0
0
08 Sep 2025
NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks
NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks
Aritra Dutta
Swapnanil Mukherjee
Deepanway Ghosal
Somak Aditya
VLM
94
0
0
27 Aug 2025
ViFP: A Framework for Visual False Positive Detection to Enhance Reasoning Reliability in VLMs
ViFP: A Framework for Visual False Positive Detection to Enhance Reasoning Reliability in VLMs
Ben Zhang
LuLu Yu
Lei Gao
QuanJiang Guo
QuanJiang Guo
Hui Gao
LRM
164
0
0
06 Aug 2025
Augmented Vision-Language Models: A Systematic Review
Augmented Vision-Language Models: A Systematic Review
Anthony C Davis
Burhan Sadiq
Tianmin Shu
Chien-Ming Huang
VLMLRM
196
0
0
24 Jul 2025
An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT
An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT
Shreya Singh
289
0
0
24 Feb 2025
Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering
Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering
Qian Tao
Xiaoyang Fan
Yong Xu
Xingquan Zhu
Yufei Tang
226
0
0
22 Jan 2025
Graph-guided Cross-composition Feature Disentanglement for Compositional Zero-shot Learning
Graph-guided Cross-composition Feature Disentanglement for Compositional Zero-shot LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Yuxia Geng
Runkai Zhu
Jiaoyan Chen
Jintai Chen
Zhuo Chen
Z. Chen
Can Xu
Yuxiang Wang
Xiaoliang Xu
Sheng-Jun Huang
CoGe
222
0
0
19 Aug 2024
Precision Empowers, Excess Distracts: Visual Question Answering With
  Dynamically Infused Knowledge In Language Models
Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language ModelsICON (ICON), 2024
Manas Jhalani
Annervaz K M
Pushpak Bhattacharyya
98
3
0
14 Jun 2024
Perception of Knowledge Boundary for Large Language Models through
  Semi-open-ended Question Answering
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question AnsweringNeural Information Processing Systems (NeurIPS), 2024
Zhihua Wen
Zhiliang Tian
Z. Jian
Zhen Huang
Pei Ke
Yifu Gao
Shiyu Huang
Dongsheng Li
268
25
0
23 May 2024
Self-Bootstrapped Visual-Language Model for Knowledge Selection and
  Question Answering
Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering
Dongze Hao
Qunbo Wang
Longteng Guo
Jie Jiang
Jing Liu
298
9
0
22 Apr 2024
CREST: Cross-modal Resonance through Evidential Deep Learning for
  Enhanced Zero-Shot Learning
CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning
Haojian Huang
Xiaozhen Qiao
Zhuo Chen
Haodong Chen
Bingyu Li
Zhe Sun
Mulin. Chen
Xuelong Li
395
18
0
15 Apr 2024
Evaluating the Factuality of Large Language Models using Large-Scale
  Knowledge Graphs
Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs
Xiaoze Liu
Feijie Wu
Tianyang Xu
Zhuo Chen
Yichi Zhang
Xiaoqian Wang
Jing Gao
HILM
319
15
0
01 Apr 2024
Intrinsic Subgraph Generation for Interpretable Graph based Visual
  Question Answering
Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering
Pascal Tilli
Ngoc Thang Vu
282
1
0
26 Mar 2024
DRAK: Unlocking Molecular Insights with Domain-Specific
  Retrieval-Augmented Knowledge in LLMs
DRAK: Unlocking Molecular Insights with Domain-Specific Retrieval-Augmented Knowledge in LLMs
Jinzhe Liu
Xiangsheng Huang
Zhuo Chen
Yin Fang
287
6
0
04 Mar 2024
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense
  and Hypothetical Reasoning
Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning
Danna Zheng
Mirella Lapata
Jeff Z. Pan
RALM
292
14
0
19 Feb 2024
Context Disentangling and Prototype Inheriting for Robust Visual
  Grounding
Context Disentangling and Prototype Inheriting for Robust Visual Grounding
Wei Tang
Liang Li
Xuejing Liu
Lu Jin
Jinhui Tang
Zechao Li
271
41
0
19 Dec 2023
Knowledgeable Preference Alignment for LLMs in Domain-specific Question
  Answering
Knowledgeable Preference Alignment for LLMs in Domain-specific Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yichi Zhang
Zhuo Chen
Yin Fang
Yanxi Lu
Fangming Li
Wen Zhang
Hua-zeng Chen
326
50
0
11 Nov 2023
From Image to Language: A Critical Analysis of Visual Question Answering
  (VQA) Approaches, Challenges, and Opportunities
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and OpportunitiesInformation Fusion (Inf. Fusion), 2023
Md Farhan Ishmam
Md Sakib Hossain Shovon
M. F. Mridha
Nilanjan Dey
399
71
0
01 Nov 2023
Rethinking Uncertainly Missing and Ambiguous Visual Modality in
  Multi-Modal Entity Alignment
Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity AlignmentInternational Workshop on the Semantic Web (SW), 2023
Zhuo Chen
Lingbing Guo
Yin Fang
Yichi Zhang
Jiaoyan Chen
Jeff Z. Pan
Yongqian Li
Hua-zeng Chen
Wen Zhang
356
45
0
30 Jul 2023
Visual Question Answering: A Survey on Techniques and Common Trends in
  Recent Literature
Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature
Ana Claudia Akemi Matsuki de Faria
Felype de Castro Bastos
Jose Victor Nogueira Alves da Silva
Vitor Lopes Fabris
Valeska Uchôa
Décio Gonccalves de Aguiar Neto
C. F. G. Santos
263
27
0
18 May 2023
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal
  Structured Representations
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2023
Yufen Huang
Jiji Tang
Zhuo Chen
Rongsheng Zhang
Xinfeng Zhang
...
Zeng Zhao
Zhou Zhao
Tangjie Lv
Zhipeng Hu
Wen Zhang
VLM
308
49
0
06 May 2023
NeuralKG-ind: A Python Library for Inductive Knowledge Graph
  Representation Learning
NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation LearningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Wen Zhang
Zhen Yao
Yin Hua
Zhiwei Huang
Hua-zeng Chen
AI4CE
196
2
0
28 Apr 2023
FVQA 2.0: Introducing Adversarial Samples into Fact-based Visual
  Question Answering
FVQA 2.0: Introducing Adversarial Samples into Fact-based Visual Question AnsweringFindings (Findings), 2023
Weizhe Lin
Zhilin Wang
Bill Byrne
AAML
178
6
0
19 Mar 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
235
5
0
04 Mar 2023
Open-domain Visual Entity Recognition: Towards Recognizing Millions of
  Wikipedia Entities
Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia EntitiesIEEE International Conference on Computer Vision (ICCV), 2023
Hexiang Hu
Yi Luan
Yang Chen
Urvashi Khandelwal
Mandar Joshi
Kenton Lee
Kristina Toutanova
Ming-Wei Chang
VLM
371
91
0
22 Feb 2023
Entity-Agnostic Representation Learning for Parameter-Efficient
  Knowledge Graph Embedding
Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2023
Yin Hua
Wen Zhang
Zhen Yao
Yushan Zhu
Yang Gao
Jeff Z. Pan
Hua-zeng Chen
159
14
0
03 Feb 2023
MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality
  Hybrid
MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality HybridACM Multimedia (ACM MM), 2022
Zhuo Chen
Jiaoyan Chen
Wen Zhang
Lingbing Guo
Yin Fang
...
Yichi Zhang
Yuxia Geng
Jeff Z. Pan
Wenting Song
Hua-zeng Chen
514
69
0
29 Dec 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learningArtificial Intelligence Review (Artif Intell Rev), 2022
Maria Lymperaiou
Giorgos Stamou
461
22
0
19 Nov 2022
Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis
  of Gene Expression Prediction
Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Yan Yang
Md Zakir Hossain
Eric A. Stone
Shafin Rahman
AI4TS
182
35
0
30 Oct 2022
Target-oriented Sentiment Classification with Sequential Cross-modal
  Semantic Graph
Target-oriented Sentiment Classification with Sequential Cross-modal Semantic GraphInternational Conference on Artificial Neural Networks (ICANN), 2022
Yufen Huang
Zhuo Chen
Jiaoyan Chen
Jeff Z. Pan
Zhen Yao
Wen Zhang
153
11
0
19 Aug 2022
LaKo: Knowledge-driven Visual Question Answering via Late
  Knowledge-to-Text Injection
LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection
Zhuo Chen
Yufen Huang
Jiaoyan Chen
Yuxia Geng
Yin Fang
Jeff Z. Pan
Ningyu Zhang
Wen Zhang
235
48
0
26 Jul 2022
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot LearningAAAI Conference on Artificial Intelligence (AAAI), 2022
Zhuo Chen
Yufen Huang
Jiaoyan Chen
Yuxia Geng
Wen Zhang
Yin Fang
Jeff Z. Pan
Huajun Chen
VLM
423
81
0
04 Jul 2022
Disentangled Ontology Embedding for Zero-shot Learning
Disentangled Ontology Embedding for Zero-shot LearningKnowledge Discovery and Data Mining (KDD), 2022
Yuxia Geng
Jiaoyan Chen
Wen Zhang
Yajing Xu
Zhuo Chen
Jeff Z. Pan
Yufen Huang
Feiyu Xiong
Hua-zeng Chen
222
27
0
08 Jun 2022
Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive
  Survey
Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive SurveyProceedings of the IEEE (Proc. IEEE), 2021
Jiaoyan Chen
Yuxia Geng
Zhuo Chen
Jeff Z. Pan
Yuan He
Wen Zhang
Ian Horrocks
Hua-zeng Chen
641
66
0
18 Dec 2021
Benchmarking Knowledge-driven Zero-shot Learning
Benchmarking Knowledge-driven Zero-shot LearningJournal of Web Semantics (Web Semantics), 2021
Yuxia Geng
Jiaoyan Chen
Zhuang Xiang
Zhuo Chen
Jeff Z. Pan
Juan Li
Zonggang Yuan
Huajun Chen
VLM
251
21
0
29 Jun 2021
Distributed Representations of Entities in Open-World Knowledge Graphs
Distributed Representations of Entities in Open-World Knowledge Graphs
Lingbing Guo
Zhuo Chen
Jiaoyan Chen
Weiqing Wang
Zequn Sun
Zhongpo Bo
Yin Fang
Chenghao Liu
Huajun Chen
Wei Hu
GNN
181
16
0
16 Oct 2020
1