30
0

Inference-friendly Graph Compression for Graph Neural Networks

Abstract

Graph Neural Networks (GNNs) have demonstrated promising performance in graph analysis. Nevertheless, the inference process of GNNs remains costly, hindering their applications for large graphs. This paper proposes inference-friendly graph compression (IFGC), a graph compression scheme to accelerate GNNs inference. Given a graph GG and a GNN MM, an IFGC computes a small compressed graph GcG_c, to best preserve the inference results of MM over GG, such that the result can be directly inferred by accessing GcG_c with no or little decompression cost. (1) We characterize IFGC with a class of inference equivalence relation. The relation captures the node pairs in GG that are not distinguishable for GNN inference. (2) We introduce three practical specifications of IFGC for representative GNNs: structural preserving compression (SPGC), which computes GcG_c that can be directly processed by GNN inference without decompression; (α\alpha, rr)-compression, that allows for a configurable trade-off between compression ratio and inference quality, and anchored compression that preserves inference results for specific nodes of interest. For each scheme, we introduce compression and inference algorithms with guarantees of efficiency and quality of the inferred results. We conduct extensive experiments on diverse sets of large-scale graphs, which verifies the effectiveness and efficiency of our graph compression approaches.

View on arXiv
@article{fan2025_2504.13034,
  title={ Inference-friendly Graph Compression for Graph Neural Networks },
  author={ Yangxin Fan and Haolai Che and Yinghui Wu },
  journal={arXiv preprint arXiv:2504.13034},
  year={ 2025 }
}
Comments on this paper