v1v2v3v4 (latest)

Zero-shot Visual Question Answering using Knowledge Graph

12 July 2021

Huajun Chen

Papers citing "Zero-shot Visual Question Answering using Knowledge Graph"

37 / 37 papers shown

Seeing and Knowing in the Wild: Open-domain Visual Entity Recognition with Large-scale Knowledge Graphs via Contrastive Learning

136

15 Oct 2025

Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding

170

08 Sep 2025

NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks

27 Aug 2025

ViFP: A Framework for Visual False Positive Detection to Enhance Reasoning Reliability in VLMs

164

06 Aug 2025

Augmented Vision-Language Models: A Systematic Review

196

24 Jul 2025

An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT

Shreya Singh

289

24 Feb 2025

Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering

226

22 Jan 2025

Graph-guided Cross-composition Feature Disentanglement for Compositional Zero-shot LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

222

19 Aug 2024

Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language ModelsICON (ICON), 2024

Manas Jhalani

Annervaz K M

Pushpak Bhattacharyya

14 Jun 2024

Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question AnsweringNeural Information Processing Systems (NeurIPS), 2024

Dongsheng Li

268

23 May 2024

Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering

Jing Liu

298

22 Apr 2024

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

Haojian Huang

395

15 Apr 2024

Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs

Jing Gao

319

01 Apr 2024

Intrinsic Subgraph Generation for Interpretable Graph based Visual Question Answering

Pascal Tilli

Ngoc Thang Vu

282

26 Mar 2024

DRAK: Unlocking Molecular Insights with Domain-Specific Retrieval-Augmented Knowledge in LLMs

287

04 Mar 2024

Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning

292

19 Feb 2024

Context Disentangling and Prototype Inheriting for Robust Visual Grounding

Wei Tang

271

19 Dec 2023

Knowledgeable Preference Alignment for LLMs in Domain-specific Question AnsweringAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

326

11 Nov 2023

From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and OpportunitiesInformation Fusion (Inf. Fusion), 2023

Md Farhan Ishmam

Md Sakib Hossain Shovon

M. F. Mridha

Nilanjan Dey

399

01 Nov 2023

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity AlignmentInternational Workshop on the Semantic Web (SW), 2023

356

30 Jul 2023

Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature

Ana Claudia Akemi Matsuki de Faria

Felype de Castro Bastos

Jose Victor Nogueira Alves da Silva

Vitor Lopes Fabris

Valeska Uchôa

Décio Gonccalves de Aguiar Neto

C. F. G. Santos

263

18 May 2023

Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsAAAI Conference on Artificial Intelligence (AAAI), 2023

...

Zeng Zhao

308

06 May 2023

NeuralKG-ind: A Python Library for Inductive Knowledge Graph Representation LearningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023

196

28 Apr 2023

FVQA 2.0: Introducing Adversarial Samples into Fact-based Visual Question AnsweringFindings (Findings), 2023

178

19 Mar 2023

The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges

Maria Lymperaiou

Giorgos Stamou

VLM

235

04 Mar 2023

Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia EntitiesIEEE International Conference on Computer Vision (ICCV), 2023

371

22 Feb 2023

Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph EmbeddingAAAI Conference on Artificial Intelligence (AAAI), 2023

159

03 Feb 2023

MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality HybridACM Multimedia (ACM MM), 2022

...

514

29 Dec 2022

A survey on knowledge-enhanced multimodal learningArtificial Intelligence Review (Artif Intell Rev), 2022

Maria Lymperaiou

Giorgos Stamou

461

19 Nov 2022

Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression PredictionIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022

182

30 Oct 2022

Target-oriented Sentiment Classification with Sequential Cross-modal Semantic GraphInternational Conference on Artificial Neural Networks (ICANN), 2022

153

19 Aug 2022

LaKo: Knowledge-driven Visual Question Answering via Late Knowledge-to-Text Injection

Ningyu Zhang

235

26 Jul 2022

DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot LearningAAAI Conference on Artificial Intelligence (AAAI), 2022

Huajun Chen

423

04 Jul 2022

Disentangled Ontology Embedding for Zero-shot LearningKnowledge Discovery and Data Mining (KDD), 2022

222

08 Jun 2022

Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive SurveyProceedings of the IEEE (Proc. IEEE), 2021

641

18 Dec 2021

Benchmarking Knowledge-driven Zero-shot LearningJournal of Web Semantics (Web Semantics), 2021

Huajun Chen

251

29 Jun 2021

Distributed Representations of Entities in Open-World Knowledge Graphs

Huajun Chen

181

16 Oct 2020