20
0

Dynamic Superblock Pruning for Fast Learned Sparse Retrieval

Abstract

This paper proposes superblock pruning (SP) during top-k online document retrieval for learned sparse representations. SP structures the sparse index as a set of superblocks on a sequence of document blocks and conducts a superblock-level selection to decide if some superblocks can be pruned before visiting their child blocks. SP generalizes the previous flat block or cluster-based pruning, allowing the early detection of groups of documents that cannot or are less likely to appear in the final top-k list. SP can accelerate sparse retrieval in a rank-safe or approximate manner under a high-relevance competitiveness constraint. Our experiments show that the proposed scheme significantly outperforms state-of-the-art baselines on MS MARCO passages on a single-threaded CPU.

View on arXiv
@article{carlson2025_2504.17045,
  title={ Dynamic Superblock Pruning for Fast Learned Sparse Retrieval },
  author={ Parker Carlson and Wentai Xie and Shanxiu He and Tao Yang },
  journal={arXiv preprint arXiv:2504.17045},
  year={ 2025 }
}
Comments on this paper