ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1702.08734
  4. Cited By
Billion-scale similarity search with GPUs

Billion-scale similarity search with GPUs

IEEE Transactions on Big Data (TBD), 2017
28 February 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
ArXiv (abs)PDFHTML

Papers citing "Billion-scale similarity search with GPUs"

50 / 2,117 papers shown
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Yuan Li
Qi Luo
Xiaonan Li
B. Li
Qinyuan Cheng
Bo Wang
Y. Zheng
Yuxin Wang
Zhangyue Yin
Xipeng Qiu
RALMLRM
367
2
0
26 May 2025
Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval
Optimized Text Embedding Models and Benchmarks for Amharic Passage RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Kidist Amde Mekonnen
Yosef Worku Alemneh
Maarten de Rijke
RALM
320
2
0
25 May 2025
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM
Xun Gong
Anqi Lv
Zhiming Wang
Huijia Zhu
Y. Qian
182
6
0
25 May 2025
Enhancing Training Data Attribution with Representational Optimization
Enhancing Training Data Attribution with Representational Optimization
W. Sun
Haokun Liu
Nikhil Kandpal
Colin Raffel
Yiming Yang
TDI
466
0
0
24 May 2025
Improving Ad matching via Cluster-Adaptive Keyword Expansion and Relevance tuning
Improving Ad matching via Cluster-Adaptive Keyword Expansion and Relevance tuning
Dipanwita Saha
Anis Zaman
Hua Zou
Ning Chen
Xinxin Shu
Nadia Vase
Abraham Bagherjeiran
149
1
0
24 May 2025
VIBE: Vector Index Benchmark for Embeddings
Elias Jääsaari
Ville Hyvönen
Matteo Ceccarello
Teemu Roos
Martin Aumüller
VLM
343
2
0
23 May 2025
Less Context, Same Performance: A RAG Framework for Resource-Efficient LLM-Based Clinical NLP
Less Context, Same Performance: A RAG Framework for Resource-Efficient LLM-Based Clinical NLP
Satya Narayana Cheetirala
Ganesh Raut
Dhavalkumar Patel
Fabio Sanatana
Robert Freeman
...
Omar Dawkins
Reba Miller
Randolph M. Steinhagen
Eyal Klang
Prem Timsina
RALM
186
2
0
23 May 2025
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation
Li Zhong
Ahmed Ghazal
Jun-Jun Wan
Frederik Zilly
Patrick Mackens
Joachim E. Vollrath
Bogdan Sorin Coseriu
368
1
0
23 May 2025
Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling
Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling
Xinxing Shi
Xiaoyu Jiang
Mauricio A. Álvarez
BDL
416
0
0
22 May 2025
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning
Changtai Zhu
Siyin Wang
Ruijun Feng
Kai Song
Xipeng Qiu
LRM
296
6
0
21 May 2025
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving
Zhiwen Chen
Bo Leng
Zhuoren Li
Hanming Deng
Guizhe Jin
Ran Yu
Huanxi Wen
601
2
0
21 May 2025
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data
Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data
Faeze Ghorbanpour
Daryna Dementieva
Kangyang Luo
345
0
0
20 May 2025
LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference
Guangyuan Ma
Yongliang Ma
Xuanrui Gou
Zhenpeng Su
Ming Zhou
Songlin Hu
RALM
366
1
0
18 May 2025
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural Routing
Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural RoutingIEEE Journal on Selected Areas in Communications (JSAC), 2025
Andrei-Laurentiu Bornea
Fadhel Ayed
Antonio De Domenico
Nicola Piovesan
Tareq Si Salem
Ali Maatouk
246
2
0
17 May 2025
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models
Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models
Camille Couturier
Spyros Mastorakis
Haiying Shen
Saravan Rajmohan
Victor Rühle
KELM
236
2
0
16 May 2025
Nearest Neighbor Multivariate Time Series Forecasting
Nearest Neighbor Multivariate Time Series ForecastingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024
Huiliang Zhang
Ping Nie
Lijun Sun
Benoit Boulet
AI4TS
273
5
0
16 May 2025
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
Yifan Wu
Lutao Yan
Yizhang Zhu
Yinan Mei
Jiannan Wang
Nan Tang
Yuyu Luo
521
4
0
15 May 2025
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration
Rishabh Agrawal
Himanshu Kumar
356
1
0
13 May 2025
VLM-KG: Multimodal Radiology Knowledge Graph Generation
VLM-KG: Multimodal Radiology Knowledge Graph Generation
Abdullah Abdullah
Seong Tae Kim
199
0
0
13 May 2025
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency
Adel Ammar
Anis Koubaa
Omer Nacar
W. Boulila
RALM3DV
271
3
0
13 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
561
0
0
10 May 2025
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Wei Yang
Jingjing Fu
Rongpin Wang
Jinyu Wang
Lei Song
Jiang Bian
349
5
0
10 May 2025
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB
Cost-Effective, Low Latency Vector Search with Azure Cosmos DBProceedings of the VLDB Endowment (PVLDB), 2025
Nitish Upreti
H. Simhadri
Hari Sudan Sundar
Samer Boshra
Balachandar Perumalswamy
...
Kevin Pilch
Simon Moreno
Aayush Kataria
Vipul Vishal
H. Simhadri
215
2
0
09 May 2025
Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation
Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation
Fahad Shahbaz Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
398
1
0
08 May 2025
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks
Sebastian Barros
AI4TS
210
0
0
06 May 2025
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Qi Gan
Sao Mai Nguyen
Eric Fenaux
Stephan Clémençon
Mounîm El Yacoubi
3DH
242
1
0
06 May 2025
Leveraging LLMs to Create Content Corpora for Niche Domains
Leveraging LLMs to Create Content Corpora for Niche Domains
Franklin Zhang
Sonya Zhang
Alon Halevy
CLL
162
0
0
02 May 2025
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item EmbeddingsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Aleksandr V. Petrov
Craig MacDonald
Nicola Tonellotto
239
2
0
01 May 2025
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Clustering Internet Memes Through Template Matching and Multi-Dimensional SimilarityInternational Conference on Web and Social Media (ICWSM), 2025
Tygo Bloem
Filip Ilievski
317
1
0
30 Apr 2025
Efficient Conversational Search via Topical Locality in Dense Retrieval
Efficient Conversational Search via Topical Locality in Dense RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Cristina Ioana Muntean
F. M. Nardini
R. Perego
Guido Rocchietti
Cosimo Rulli
171
0
0
30 Apr 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Linjuan Wu
Haoran Wei
Huan Lin
Tianhao Li
Baosong Yang
Fei Huang
Weiming Lu
292
2
0
29 Apr 2025
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Santosh Bhupathi
AI4TSGNN
112
0
0
26 Apr 2025
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
Yangxinyu Xie
Bowen Jiang
Tanwi Mallick
Joshua Bergerson
John K Hutchison
...
Robert B. Ross
Yan Feng
L. Levy
Weijie J. Su
Camillo J Taylor
238
8
0
24 Apr 2025
DataS^3: Dataset Subset Selection for Specialization
DataS^3: Dataset Subset Selection for Specialization
Neha Hulkund
Alaa Maalouf
Levi Cai
Daniel Yang
Tsun-Hsuan Wang
...
Ken Goldberg
Hannah Kerner
Irene Chen
Yogesh A. Girdhar
Sara Beery
260
2
0
22 Apr 2025
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Intent-aware Diffusion with Contrastive Learning for Sequential RecommendationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Yuanpeng Qu
Hajime Nobuhara
DiffMAI4TS
264
8
0
22 Apr 2025
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
Yaxiong Wu
Sheng Liang
Chen Zhang
Yucheng Wang
Yanzhe Zhang
Huifeng Guo
Ruiming Tang
Yong Liu
KELM
380
40
0
22 Apr 2025
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
ColBERT-serve: Efficient Multi-Stage Memory-Mapped ScoringEuropean Conference on Information Retrieval (ECIR), 2025
Kaili Huang
Thejas Venkatesh
Uma Dingankar
Antonio Mallia
Daniel Campos
...
Matei A. Zaharia
Kwabena Boahen
Omar Khattab
Saarthak Sarup
Keshav Santhanam
381
2
0
21 Apr 2025
Event2Vec: Processing Neuromorphic Events directly by Representations in Vector Space
Event2Vec: Processing Neuromorphic Events directly by Representations in Vector Space
Wei Fang
Priyadarshini Panda
AI4TS
300
0
0
21 Apr 2025
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
Xinyu Wang
Jijun Chi
Zhenghan Tai
Tung Sum Thomas Kwok
Muzhi Li
...
Jerry Huang
Jingrui Tian
Fengran Mo
Yufei Cui
Ling Zhou
514
13
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
495
1
0
18 Apr 2025
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Towards Lossless Token Pruning in Late-Interaction Retrieval ModelsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Yuxuan Zong
Benjamin Piwowarski
305
0
0
17 Apr 2025
Nemotron-CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Nemotron-CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Jane Polak Scowcroft
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
308
13
0
17 Apr 2025
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025
Hao Deng
Haibo Xing
Kanefumi Matsuyama
Moyu Zhang
Jinxin Hu
Hong Wen
Yu Zhang
Xiaoyi Zeng
Jing-Xuan Zhang
288
2
0
17 Apr 2025
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Hyungwoo Lee
Kihyun Kim
Jinwoo Kim
Jungmin So
Myung-Hoon Cha
H. Kim
James J. Kim
Youngjae Kim
277
1
0
16 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Shixuan Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
329
1
0
15 Apr 2025
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Xing David Wang
Ulf Leser
227
0
0
14 Apr 2025
MURR: Model Updating with Regularized Replay for Searching a Document Stream
MURR: Model Updating with Regularized Replay for Searching a Document StreamEuropean Conference on Information Retrieval (ECIR), 2025
Eugene Yang
Nicola Tonellotto
Dawn J Lawrie
Sean MacAvaney
James Mayfield
Douglas W. Oard
Scott Miller
KELM
227
0
0
14 Apr 2025
Understanding and Optimizing Multi-Stage AI Inference Pipelines
Understanding and Optimizing Multi-Stage AI Inference Pipelines
Abhimanyu Bambhaniya
Hanjiang Wu
Suvinay Subramanian
Sudarshan Srinivasan
Souvik Kundu
Amir Yazdanbakhsh
Suvinay Subramanian
Madhu Kumar
Tushar Krishna
1.0K
0
0
14 Apr 2025
VectorLiteRAG: Latency-Aware and Fine-Grained Resource Partitioning for Efficient RAG
VectorLiteRAG: Latency-Aware and Fine-Grained Resource Partitioning for Efficient RAG
Joo-Young Kim
Divya Mahajan
VLM
788
0
0
11 Apr 2025
Automating quantum feature map design via large language models
Automating quantum feature map design via large language models
Kenya Sakka
K. Mitarai
Keisuke Fujii
209
7
0
10 Apr 2025
Previous
123...567...414243
Next