186

Predictive Multiplicity of Knowledge Graph Embeddings in Link Prediction

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Main:1 Pages
16 Figures
Bibliography:1 Pages
9 Tables
Appendix:19 Pages
Abstract

Knowledge graph embedding (KGE) models are often used to predict missing links for knowledge graphs (KGs). However, multiple KG embeddings can perform almost equally well for link prediction yet suggest conflicting predictions for certain queries, termed \textit{predictive multiplicity} in literature. This behavior poses substantial risks for KGE-based applications in high-stake domains but has been overlooked in KGE research. In this paper, we define predictive multiplicity in link prediction. We introduce evaluation metrics and measure predictive multiplicity for representative KGE methods on commonly used benchmark datasets. Our empirical study reveals significant predictive multiplicity in link prediction, with 8%8\% to 39%39\% testing queries exhibiting conflicting predictions. To address this issue, we propose leveraging voting methods from social choice theory, significantly mitigating conflicts by 66%66\% to 78%78\% according to our experiments.

View on arXiv
Comments on this paper