Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1702.08734
Cited By
Billion-scale similarity search with GPUs
IEEE Transactions on Big Data (TBD), 2017
28 February 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Billion-scale similarity search with GPUs"
50 / 2,110 papers shown
Title
An Efficient Embedding Based Ad Retrieval with GPU-Powered Feature Interaction
Yifan Lei
Jiahua Luo
Tingyu Jiang
Bo Zhang
L. Wang
Dapeng Liu
Zhaoren Wu
Haijie Gu
Huan Yu
Jie Jiang
36
0
0
27 Nov 2025
ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
Alberto Compagnoni
Marco Morini
Sara Sarto
Federico Cocchi
Davide Caffagni
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
RALM
LRM
143
0
0
27 Nov 2025
Qwen3-VL Technical Report
Shuai Bai
Yuxuan Cai
Ruizhe Chen
Keqin Chen
Xionghui Chen
...
Jingren Zhou
F. I. S. Kevin Zhou
J. Zhou
Yuanzhi Zhu
Ke Zhu
VLM
751
29
0
26 Nov 2025
APT-CGLP: Advanced Persistent Threat Hunting via Contrastive Graph-Language Pre-Training
Xuebo Qiu
Mingqi Lv
Yimei Zhang
Tieming Chen
Tiantian Zhu
Qijie Song
Shouling Ji
161
0
0
25 Nov 2025
A Systematic Analysis of Large Language Models with RAG-enabled Dynamic Prompting for Medical Error Detection and Correction
Farzad Ahmed
Joniel Augustine Jerome
Meliha Yetisgen
Özlem Uzuner
104
0
0
25 Nov 2025
Learning Plug-and-play Memory for Guiding Video Diffusion Models
Selena Song
Ziming Xu
Zijun Zhang
Kun Zhou
Jiaxian Guo
Lianhui Qin
Biwei Huang
VGen
172
0
0
24 Nov 2025
The Catastrophic Paradox of Human Cognitive Frameworks in Large Language Model Evaluation: A Comprehensive Empirical Analysis of the CHC-LLM Incompatibility
Mohan Reddy
ELM
152
0
0
23 Nov 2025
Spectral Super-Resolution Neural Operator with Atmospheric Radiative Transfer Prior
Ziye Zhang
Bin Pan
Zhenwei Shi
44
0
0
22 Nov 2025
ProHD: Projection-Based Hausdorff Distance Approximation
Jiuzhou Fu
Luanzheng Guo
Nathan Tallent
Dongfang Zhao
36
0
0
22 Nov 2025
PhysMorph-GS: Differentiable Shape Morphing via Joint Optimization of Physics and Rendering Objectives
Design Automation Conference (DAC), 2025
Chang-Yong Song
David Hyde
AI4CE
125
0
0
21 Nov 2025
DelTriC: A Novel Clustering Method with Accurate Outlier
Tomáš Javůrek
Michal Gregor
Sebastian Kula
Marian Simko
124
0
0
21 Nov 2025
AutoLink: Autonomous Schema Exploration and Expansion for Scalable Schema Linking in Text-to-SQL at Scale
Z. Wang
Y. Zheng
Zhenbiao Cao
Xiaojin Zhang
Zhongyu Wei
Pei Fu
Zhenbo Luo
Wei Chen
Xiang Bai
139
0
0
21 Nov 2025
Incorporating Token Importance in Multi-Vector Retrieval
Archish S
Ankit Garg
Kirankumar Shiragur
N. Kayal
116
0
0
20 Nov 2025
CroPS: Improving Dense Retrieval with Cross-Perspective Positive Samples in Short-Video Search
Ao Xie
Jiahui Chen
Quanzhi Zhu
Xiaoze Jiang
Zhiheng Qin
Enyun Yu
Han Li
57
0
0
19 Nov 2025
B+ANN: A Fast Billion-Scale Disk-based Nearest-Neighbor Index
Selim Furkan Tekin
Rajesh Bordawekar
144
0
0
19 Nov 2025
SilverTorch: A Unified Model-based System to Democratize Large-Scale Recommendation on GPUs
Bi Xue
H. Wu
L. Chen
Chao Yang
Yiming Ma
...
Pawel Garbacki
Zheng Fang
Yiyi Pan
Min Ni
Yang Liu
145
0
0
18 Nov 2025
RAG-Driven Data Quality Governance for Enterprise ERP Systems
Sedat Bin Vedat
Enes Kutay Yarkan
Meftun Akarsu
Recep Kaan Karaman
Arda Sar
Çağrı Çelikbilek
Savaş Saygılı
68
0
0
18 Nov 2025
Data-driven Acceleration of MPC with Guarantees
Agustin Castellano
Shijie Pan
Enrique Mallada
44
0
0
17 Nov 2025
Dimension vs. Precision: A Comparative Analysis of Autoencoders and Quantization for Efficient Vector Retrieval on BEIR SciFact
Satyanarayan Pati
MQ
127
0
0
17 Nov 2025
Grounded by Experience: Generative Healthcare Prediction Augmented with Hierarchical Agentic Retrieval
Chuang Zhao
Hui Tang
Hongke Zhao
Xiaofang Zhou
Xiaomeng Li
69
0
0
17 Nov 2025
Generative Caching for Structurally Similar Prompts and Responses
Sarthak Chakraborty
Suman Nath
Xuchao Zhang
Chetan Bansal
Indranil Gupta
130
1
0
14 Nov 2025
Prompt Tuning for Natural Language to SQL with Embedding Fine-Tuning and RAG
Jisoo Jang
Tien-Cuong Bui
Yunjun Choi
Wen-Syan Li
84
0
0
11 Nov 2025
When Evidence Contradicts: Toward Safer Retrieval-Augmented Generation in Healthcare
Saeedeh Javadi
Sara Mirabi
Manan Gangar
Bahadorreza Ofoghi
RALM
HILM
211
0
0
10 Nov 2025
3dSAGER: Geospatial Entity Resolution over 3D Objects (Technical Report)
Bar Genossar
Sagi Dalyot
Roee Shraga
A. Gal
93
0
0
09 Nov 2025
MemoriesDB: A Temporal-Semantic-Relational Database for Long-Term Agent Memory / Modeling Experience as a Graph of Temporal-Semantic Surfaces
Joel Ward
VLM
60
0
0
09 Nov 2025
Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for Hallucination Detection
Zepeng Bao
Shen Zhou
Qiankun Pi
Jianhao Chen
Mayi Xu
Ming Zhong
Yuanyuan Zhu
T. Qian
56
0
0
08 Nov 2025
Guardian-regularized Safe Offline Reinforcement Learning for Smart Weaning of Mechanical Circulatory Devices
Aysin Tumay
S. Sun
Sonia Fereidooni
Aaron Dumas
Elise Jortberg
Rose Yu
OffRL
124
0
0
08 Nov 2025
Reasoning-Guided Claim Normalization for Noisy Multilingual Social Media Posts
Manan Sharma
Arya Suneesh
Manish Jain
Pawan Kumar Rajpoot
Prasanna Devadiga
Bharatdeep Hazarika
Ashish Shrivastava
Kishan Gurumurthy
Anshuman B Suresh
Aditya U Baliga
100
0
0
07 Nov 2025
Search Is Not Retrieval: Decoupling Semantic Matching from Contextual Assembly in RAG
Harshit Nainwani
Hediyeh Baban
AI4TS
216
0
0
07 Nov 2025
Robust Neural Audio Fingerprinting using Music Foundation Models
Shubhr Singh
Kiran Bhat
Xavier Riley
Benjamin Resnick
John Thickstun
Walter De Brouwer
93
0
0
07 Nov 2025
SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning
Xiaodong Wang
Jing Huang
Kevin J. Liang
SSL
368
0
0
07 Nov 2025
Differentially Private In-Context Learning with Nearest Neighbor Search
A. Koskela
Tejas D. Kulkarni
Laith Zumot
112
0
0
06 Nov 2025
Coordination-Free Lane Partitioning for Convergent ANN Search
Carl Kugblenu
Petri Vuorimaa
84
0
0
06 Nov 2025
Reusing Pre-Training Data at Test Time is a Compute Multiplier
Alex Fang
Thomas Voice
Ruoming Pang
Ludwig Schmidt
Tom Gunter
90
0
0
06 Nov 2025
Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
Xinying Qian
Ying Zhang
Yu Zhao
Baohang Zhou
Xuhui Sui
Xiaojie Yuan
RALM
227
0
0
06 Nov 2025
Cache Mechanism for Agent RAG Systems
Shuhang Lin
Zhencan Peng
Lingyao Li
Xiao Lin
Xi Zhu
Yongfeng Zhang
97
0
0
04 Nov 2025
LUMA-RAG: Lifelong Multimodal Agents with Provably Stable Streaming Alignment
Rohan Wandre
Yash Gajewar
Namrata Patel
Vivek Dhalkari
68
0
0
04 Nov 2025
RAGSmith: A Framework for Finding the Optimal Composition of Retrieval-Augmented Generation Methods Across Datasets
Muhammed Yusuf Kartal
Suha Kagan Kose
Korhan Sevinç
Burak Aktas
90
0
0
03 Nov 2025
Scaling Graph Chain-of-Thought Reasoning: A Multi-Agent Framework with Efficient LLM Serving
Chengying Huan
Ziheng Meng
Yongchao Liu
Zhengyi Yang
Yun Zhu
...
Haitao Zhang
Chuntao Hong
Shaonan Ma
Guihai Chen
Chen Tian
LRM
104
0
0
03 Nov 2025
Hybrid Retrieval-Augmented Generation Agent for Trustworthy Legal Question Answering in Judicial Forensics
Yueqing Xi
Yifan Bai
Huasen Luo
Weiliang Wen
Hui Liu
Haoliang Li
AILaw
RALM
265
0
0
03 Nov 2025
Rescuing the Unpoisoned: Efficient Defense against Knowledge Corruption Attacks on RAG Systems
Minseok Kim
Hankook Lee
Hyungjoon Koo
AAML
SILM
157
0
0
03 Nov 2025
Taxonomy-based Negative Sampling In Personalized Semantic Search for E-commerce
Uthman Jinadu
Siawpeng Er
Le Yu
Chen Liang
Bingxin Li
Yi Ding
Aleksandar Velkoski
80
0
0
01 Nov 2025
TRACES: Temporal Recall with Contextual Embeddings for Real-Time Video Anomaly Detection
Yousuf Ahmed Siddiqui
Sufiyaan Usmani
Umer Tariq
Jawwad A. Shamsi
Muhammad Burhan Khan
AI4TS
92
0
0
01 Nov 2025
Generalized Category Discovery under Domain Shift: A Frequency Domain Perspective
Wei Feng
Z. Ge
70
0
0
01 Nov 2025
Similarity-Distance-Magnitude Language Models
Allen Schmaltz
72
0
0
30 Oct 2025
Context Engineering 2.0: The Context of Context Engineering
Qishuo Hua
Lyumanshan Ye
Dayuan Fu
Yang Xiao
Xiaojie Cai
Yunze Wu
Jifan Lin
Junfei Wang
Pengfei Liu
305
1
0
30 Oct 2025
Quantitative Intertextuality from the Digital Humanities Perspective: A Survey
Siyu Duan
AI4CE
92
0
0
30 Oct 2025
Retrieval-Augmented Multimodal Depression Detection
Ruibo Hou
Shiyu Teng
Jiaqing Liu
Shurong Chai
Yinhao Li
Lanfen Lin
Yen-Wei Chen
RALM
114
0
0
29 Oct 2025
Category-Aware Semantic Caching for Heterogeneous LLM Workloads
Chen Wang
Xunzhuo Liu
Yue Zhu
Alaa Youssef
Priya Nagpurkar
Huamin Chen
81
0
0
29 Oct 2025
Instance-Level Composed Image Retrieval
Bill Psomas
George Retsinas
Nikos Efthymiadis
P. Filntisis
Yannis Avrithis
Petros Maragos
Ondřej Chum
Giorgos Tolias
100
1
0
29 Oct 2025
1
2
3
4
...
41
42
43
Next