88

Reveal Hidden Pitfalls and Navigate Next Generation of Vector Similarity Search from Task-Centric Views

Tingyang Chen
Cong Fu
Jiahua Wu
Haotian Wu
Hua Fan
Xiangyu Ke
Yunjun Gao
Yabo Ni
Anxiang Zeng
Main:13 Pages
10 Figures
Bibliography:2 Pages
7 Tables
Abstract

Vector Similarity Search (VSS) in high-dimensional spaces is rapidly emerging as core functionality in next-generation database systems for numerous data-intensive services -- from embedding lookups in large language models (LLMs), to semantic information retrieval and recommendation engines. Current benchmarks, however, evaluate VSS primarily on the recall-latency trade-off against a ground truth defined solely by distance metrics, neglecting how retrieval quality ultimately impacts downstream tasks. This disconnect can mislead both academic research and industrial practice.

View on arXiv
Comments on this paper