142

SIEVE: Effective Filtered Vector Search with Collection of Indexes

Proceedings of the VLDB Endowment (PVLDB), 2025
Main:12 Pages
19 Figures
Bibliography:2 Pages
8 Tables
Appendix:2 Pages
Abstract

Many real-world tasks such as recommending videos with the kids tag can be reduced to finding most similar vectors associated with hard predicates. This task, filtered vector search, is challenging as prior state-of-the-art graph-based (unfiltered) similarity search techniques quickly degenerate when hard constraints are considered. That is, effective graph-based filtered similarity search relies on sufficient connectivity for reaching the most similar items within just a few hops. To consider predicates, recent works propose modifying graph traversal to visit only the items that may satisfy predicates. However, they fail to offer the just-a-few-hops property for a wide range of predicates: they must restrict predicates significantly or lose efficiency if only a small fraction of items satisfy predicates.

View on arXiv
Comments on this paper