Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale MatchingThe Web Conference (WWW), 2025 |
Transformer-VQ: Linear-Time Transformers via Vector QuantizationInternational Conference on Learning Representations (ICLR), 2023 |
Faster Maximum Inner Product Search in High DimensionsInternational Conference on Machine Learning (ICML), 2022 |
Billion-scale similarity search with GPUsIEEE Transactions on Big Data (TBD), 2017 |