Billion-scale similarity search with GPUs

IEEE Transactions on Big Data (TBD), 2017

28 February 2017

Papers citing "Billion-scale similarity search with GPUs"

50 / 2,117 papers shown

R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning

367

26 May 2025

Optimized Text Embedding Models and Benchmarks for Amharic Passage RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

320

25 May 2025

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

182

25 May 2025

Enhancing Training Data Attribution with Representational Optimization

466

24 May 2025

Improving Ad matching via Cluster-Adaptive Keyword Expansion and Relevance tuning

149

24 May 2025

VIBE: Vector Index Benchmark for Embeddings

343

23 May 2025

Less Context, Same Performance: A RAG Framework for Resource-Efficient LLM-Based Clinical NLP

Satya Narayana Cheetirala

...

Randolph M. Steinhagen

Eyal Klang

Prem Timsina

RALM

186

23 May 2025

Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation

368

23 May 2025

Neighbour-Driven Gaussian Process Variational Autoencoders for Scalable Structured Latent Modelling

416

22 May 2025

ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning

296

21 May 2025

HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving

601

21 May 2025

Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data

Faeze Ghorbanpour

Daryna Dementieva

Kangyang Luo

345

20 May 2025

LightRetriever: A LLM-based Text Retrieval Architecture with Extremely Faster Query Inference

366

18 May 2025

Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural RoutingIEEE Journal on Selected Areas in Communications (JSAC), 2025

Andrei-Laurentiu Bornea

246

17 May 2025

Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models

236

16 May 2025

Nearest Neighbor Multivariate Time Series ForecastingIEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024

273

16 May 2025

Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights

521

15 May 2025

Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration

Rishabh Agrawal

Himanshu Kumar

356

13 May 2025

VLM-KG: Multimodal Radiology Knowledge Graph Generation

Abdullah Abdullah

Seong Tae Kim

199

13 May 2025

Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency

271

13 May 2025

References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation

561

10 May 2025

OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal RetrievalAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

349

10 May 2025

Cost-Effective, Low Latency Vector Search with Azure Cosmos DBProceedings of the VLDB Endowment (PVLDB), 2025

Balachandar Perumalswamy

...

215

09 May 2025

Neural Catalog: Scaling Species Recognition with Catalog of Life-Augmented Generation

398

08 May 2025

RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks

Sebastian Barros

AI4TS

210

06 May 2025

Polar Coordinate-Based 2D Pose Prior with Neural Distance Field

242

06 May 2025

Leveraging LLMs to Create Content Corpora for Niche Domains

162

02 May 2025

Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item EmbeddingsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

Aleksandr V. Petrov

Craig MacDonald

Nicola Tonellotto

239

01 May 2025

Clustering Internet Memes Through Template Matching and Multi-Dimensional SimilarityInternational Conference on Web and Social Media (ICWSM), 2025

Tygo Bloem

Filip Ilievski

317

30 Apr 2025

Efficient Conversational Search via Topical Locality in Dense RetrievalAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

Cristina Ioana Muntean

171

30 Apr 2025

Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training

292

29 Apr 2025

Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations

Santosh Bhupathi

AI4TS GNN

112

26 Apr 2025

A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation

...

238

24 Apr 2025

DataS^3: Dataset Subset Selection for Specialization

...

260

22 Apr 2025

Intent-aware Diffusion with Contrastive Learning for Sequential RecommendationAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

Yuanpeng Qu

Hajime Nobuhara

DiffM AI4TS

264

22 Apr 2025

From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs

380

22 Apr 2025

ColBERT-serve: Efficient Multi-Stage Memory-Mapped ScoringEuropean Conference on Information Retrieval (ECIR), 2025

...

381

21 Apr 2025

Event2Vec: Processing Neuromorphic Events directly by Representations in Vector Space

Wei Fang

Priyadarshini Panda

AI4TS

300

21 Apr 2025

FinSage: A Multi-aspect RAG System for Financial Filings Question Answering

...

514

20 Apr 2025

From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs

495

18 Apr 2025

Towards Lossless Token Pruning in Late-Interaction Retrieval ModelsAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2025

Yuxuan Zong

Benjamin Piwowarski

305

17 Apr 2025