Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1702.08734
Cited By
Billion-scale similarity search with GPUs
28 February 2017
Jeff Johnson
Matthijs Douze
Hervé Jégou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Billion-scale similarity search with GPUs"
50 / 1,819 papers shown
Title
Boosting Text-to-Chart Retrieval through Training with Synthesized Semantic Insights
Yifan Wu
Lutao Yan
Yizhang Zhu
Yinan Mei
Jiannan Wang
Nan Tang
Yuyu Luo
19
0
0
15 May 2025
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration
Rishabh Agrawal
Himanshu Kumar
21
0
0
13 May 2025
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency
Adel Ammar
Anis Koubaa
Omer Nacar
W. Boulila
RALM
3DV
35
0
0
13 May 2025
References Indeed Matter? Reference-Free Preference Optimization for Conversational Query Reformulation
Doyoung Kim
Youngjun Lee
Joeun Kim
Jihwan Bang
Hwanjun Song
Susik Yoon
Jae-Gil Lee
29
0
0
10 May 2025
OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval
Wei Yang
Jingjing Fu
R. Wang
Jinyu Wang
Lei Song
Jiang Bian
24
0
0
10 May 2025
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB
Nitish Upreti
Krishnan Sundaram
Hari Sudan Sundar
Samer Boshra
Balachandar Perumalswamy
...
Kevin Pilch
Simon Moreno
Aayush Kataria
Vipul Vishal
H. Simhadri
19
0
0
09 May 2025
VR-RAG: Open-vocabulary Species Recognition with RAG-Assisted Large Multi-Modal Models
F. Khan
Jun Chen
Youssef Mohamed
Chun-Mei Feng
Mohamed Elhoseiny
VLM
33
0
0
08 May 2025
RAN Cortex: Memory-Augmented Intelligence for Context-Aware Decision-Making in AI-Native Networks
Sebastian Barros
AI4TS
26
0
0
06 May 2025
Polar Coordinate-Based 2D Pose Prior with Neural Distance Field
Qi Gan
Sao Mai Nguyen
Eric Fenaux
Stephan Clémençon
Mounîm El Yacoubi
3DH
50
0
0
06 May 2025
30DayGen: Leveraging LLMs to Create a Content Corpus for Habit Formation
Franklin Zhang
Sonya Zhang
Alon Halevy
CLL
34
0
0
02 May 2025
Efficient Recommendation with Millions of Items by Dynamic Pruning of Sub-Item Embeddings
Aleksandr V. Petrov
Craig MacDonald
Nicola Tonellotto
29
0
0
01 May 2025
Efficient Conversational Search via Topical Locality in Dense Retrieval
Cristina Ioana Muntean
F. M. Nardini
R. Perego
Guido Rocchietti
Cosimo Rulli
24
0
0
30 Apr 2025
Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity
Tygo Bloem
Filip Ilievski
19
0
0
30 Apr 2025
Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training
Linjuan Wu
H. Wei
Huan Lin
Tianhao Li
Baosong Yang
Weiming Lu
26
0
0
29 Apr 2025
Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations
Santosh Bhupathi
AI4TS
GNN
27
0
0
26 Apr 2025
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation
Yangxinyu Xie
Bowen Jiang
Tanwi Mallick
Joshua Bergerson
John K Hutchison
...
Robert B. Ross
Yan Feng
L. Levy
Weijie J. Su
Camillo J. Taylor
32
0
0
24 Apr 2025
Intent-aware Diffusion with Contrastive Learning for Sequential Recommendation
Yuanpeng Qu
Hajime Nobuhara
DiffM
AI4TS
27
0
0
22 Apr 2025
DataS^3: Dataset Subset Selection for Specialization
Neha Hulkund
Alaa Maalouf
Levi Cai
Daniel Yang
T. Wang
...
Ken Goldberg
Hannah Kerner
Irene Chen
Yogesh A. Girdhar
Sara Beery
28
0
0
22 Apr 2025
From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs
Yaxiong Wu
Sheng Liang
Chen Zhang
Y. Wang
Y. Zhang
Huifeng Guo
Ruiming Tang
Y. Liu
KELM
38
1
0
22 Apr 2025
ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring
Kaili Huang
Thejas Venkatesh
Uma Dingankar
Antonio Mallia
Daniel Campos
...
Matei A. Zaharia
Kwabena Boahen
Omar Khattab
Saarthak Sarup
Keshav Santhanam
32
0
0
21 Apr 2025
FinSage: A Multi-aspect RAG System for Financial Filings Question Answering
X. Wang
Jijun Chi
Zhenghan Tai
Tung Sum Thomas Kwok
Muzhi Li
...
Suyuchen Wang
Yihong Wu
Jerry Huang
Jingrui Tian
Ling Zhou
67
0
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
32
0
0
18 Apr 2025
CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval
Hao Deng
Haibo Xing
Kanefumi Matsuyama
Moyu Zhang
Jinxin Hu
Hong Wen
Yu Zhang
Xiaoyi Zeng
Jing-Xuan Zhang
29
0
0
17 Apr 2025
Towards Lossless Token Pruning in Late-Interaction Retrieval Models
Yuxuan Zong
Benjamin Piwowarski
34
0
0
17 Apr 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
33
0
0
17 Apr 2025
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Hyungwoo Lee
Kihyun Kim
Jinwoo Kim
Jungmin So
Myung-Hoon Cha
H. Kim
James J. Kim
Youngjae Kim
30
0
0
16 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
S. Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
30
0
0
15 Apr 2025
Enhancing Document Retrieval for Curating N-ary Relations in Knowledge Bases
Xing David Wang
Ulf Leser
26
0
0
14 Apr 2025
Understanding and Optimizing Multi-Stage AI Inference Pipelines
A. Bambhaniya
Hanjiang Wu
Suvinay Subramanian
S. Srinivasan
Souvik Kundu
Amir Yazdanbakhsh
Midhilesh Elavazhagan
Madhu Kumar
Tushar Krishna
112
0
0
14 Apr 2025
MURR: Model Updating with Regularized Replay for Searching a Document Stream
Eugene Yang
Nicola Tonellotto
Dawn J Lawrie
Sean MacAvaney
James Mayfield
Douglas W. Oard
Scott Miller
KELM
33
0
0
14 Apr 2025
An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline
J. Kim
Divya Mahajan
VLM
117
0
0
11 Apr 2025
Impact of Language Guidance: A Reproducibility Study
Cherish Puniani
Advika Sinha
Shree Singhi
Aayan Yadav
VLM
44
0
0
10 Apr 2025
Automating quantum feature map design via large language models
Kenya Sakka
K. Mitarai
Keisuke Fujii
31
2
0
10 Apr 2025
Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning
Tooraj Helmi
24
0
0
08 Apr 2025
RETROcode: Leveraging a Code Database for Improved Natural Language to Code Generation
Nathanael Beau
Benoît Crabbé
23
0
0
08 Apr 2025
MicroNN: An On-device Disk-resident Updatable Vector Database
Jeffrey Pound
Floris Chabert
Arjun Bhushan
Ankur Goswami
Anil Pacaci
S. R. Chowdhury
24
0
0
08 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
J. Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
123
0
0
07 Apr 2025
Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG
Hengran Zhang
Minghao Tang
Keping Bi
J. Guo
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
19
0
0
07 Apr 2025
Efficient Constant-Space Multi-Vector Retrieval
Sean MacAvaney
Antonio Mallia
Nicola Tonellotto
28
1
0
02 Apr 2025
Knowledge-Base based Semantic Image Transmission Using CLIP
Chongyang Li
Yanmei He
Tianqian Zhang
Mingjian He
Shouyin Liu
31
0
0
01 Apr 2025
LLM-Assisted Proactive Threat Intelligence for Automated Reasoning
Shuva Paul
Farhad Alemi
Richard Macwan
46
0
0
01 Apr 2025
MetaCLBench: Meta Continual Learning Benchmark on Resource-Constrained Edge Devices
Sijia Li
Young D. Kwon
Lik-Hang Lee
Pan Hui
34
0
0
31 Mar 2025
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
J. Lin
Tian Wang
Kun Qian
LRM
40
2
0
31 Mar 2025
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN Search
Ximu Zeng
Liwei Deng
Penghao Chen
Xu Chen
Han Su
Kai Zheng
39
0
0
30 Mar 2025
Long-Tail Crisis in Nearest Neighbor Language Models
Yuto Nishida
Makoto Morishita
Hiroyuki Deguchi
Hidetaka Kamigaito
Taro Watanabe
RALM
61
0
0
28 Mar 2025
MemInsight: Autonomous Memory Augmentation for LLM Agents
Rana Salama
Jason (Jinglun) Cai
Michelle Yuan
Anna Currey
Monica Sunkara
Yi Zhang
Yassine Benajiba
LLMAG
RALM
84
1
0
27 Mar 2025
Training-Free Personalization via Retrieval and Reasoning on Fingerprints
Deepayan Das
Davide Talon
Yiming Wang
Massimiliano Mancini
Elisa Ricci
VLM
LRM
45
0
0
24 Mar 2025
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights
Jordan Chipka
Chris Moyer
Clay Troyer
Tyler Fuelling
Jeremy Hochstedler
AI4CE
27
0
0
24 Mar 2025
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images
Dongheng Lin
Han Hu
Jianbo Jiao
46
0
0
23 Mar 2025
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
Linxi Liang
Jing Gong
Mingwei Liu
Chong Wang
Guangsheng Ou
Yanlin Wang
Xin Peng
Zibin Zheng
ALM
59
0
0
21 Mar 2025
1
2
3
4
...
35
36
37
Next