Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
1702.08734
Cited By
Billion-scale similarity search with GPUs
IEEE Transactions on Big Data (TBD), 2017
28 February 2017
Jeff Johnson
Matthijs Douze
Edouard Grave
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Billion-scale similarity search with GPUs"
50 / 2,113 papers shown
Title
Retrieval-Augmented Multimodal Depression Detection
Ruibo Hou
Shiyu Teng
Jiaqing Liu
Shurong Chai
Yinhao Li
Lanfen Lin
Yen-Wei Chen
RALM
126
0
0
29 Oct 2025
Instance-Level Composed Image Retrieval
Bill Psomas
George Retsinas
Nikos Efthymiadis
P. Filntisis
Yannis Avrithis
Petros Maragos
Ondřej Chum
Giorgos Tolias
124
1
0
29 Oct 2025
AttnCache: Accelerating Self-Attention Inference for LLM Prefill via Attention Cache
IACR Cryptology ePrint Archive (IACR ePrint), 2025
Dinghong Song
Yuan Feng
Y. Wang
S. Chen
Cyril Guyot
F. Blagojevic
Hyeran Jeon
Pengfei Su
Dong Li
179
0
0
29 Oct 2025
Category-Aware Semantic Caching for Heterogeneous LLM Workloads
Chen Wang
Xunzhuo Liu
Yue Zhu
Alaa Youssef
Priya Nagpurkar
Huamin Chen
81
0
0
29 Oct 2025
Iterative Critique-Refine Framework for Enhancing LLM Personalization
Durga Prasad Maram
Dhruvin Gandhi
Z. Yao
Gayathri Akkinapalli
Franck Dernoncourt
Yu Wang
Ryan Rossi
Nesreen K. Ahmed
112
0
0
28 Oct 2025
DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts
Binbin Li
Guimiao Yang
Zisen Qi
Haiping Wang
Yu Ding
VLM
315
0
0
28 Oct 2025
ChessQA: Evaluating Large Language Models for Chess Understanding
Qianfeng Wen
Zhenwei Tang
Ashton Anderson
ELM
LRM
185
1
0
28 Oct 2025
Talk2Ref: A Dataset for Reference Prediction from Scientific Talks
Frederik Broy
Maike Züfle
Jan Niehues
60
0
0
28 Oct 2025
SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications
Edouard Lansiaux
117
0
0
27 Oct 2025
FAIR-RAG: Faithful Adaptive Iterative Refinement for Retrieval-Augmented Generation
Mohammad Aghajani Asl
Majid Asgari-Bidhendi
B. Minaei-Bidgoli
88
1
0
25 Oct 2025
Large Language Models Meet Text-Attributed Graphs: A Survey of Integration Frameworks and Applications
Guangxin Su
Hanchen Wang
Jianwei Wang
Wenjie Zhang
Ying Zhang
Jian Pei
244
1
0
24 Oct 2025
Generative Reasoning Recommendation via LLMs
Minjie Hong
Zetong Zhou
Zirun Guo
Ziang Zhang
Ruofan Hu
Weinan Gan
Jieming Zhu
Zhou Zhao
LRM
96
0
0
23 Oct 2025
From Answers to Guidance: A Proactive Dialogue System for Legal Documents
Ashish Chouhan
Michael Gertz
AILaw
321
0
0
22 Oct 2025
Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets
Jiashi Feng
Xiu Li
Jing Lin
Jiahang Liu
Gaohong Liu
...
S. S. Wang
Qianyi Wu
Fan Yang
J. Zhang
Xuanmeng Zhang
VGen
111
2
0
22 Oct 2025
LLMs as Sparse Retrievers:A Framework for First-Stage Product Search
Hongru Song
Yu-an Liu
Ruqing Zhang
Jiafeng Guo
Maarten de Rijke
Sen Li
W. Peng
Fuyu Lv
Xueqi Cheng
143
0
0
21 Oct 2025
Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Ji Du
Xin Wang
Fangwei Hao
Mingyang Yu
Chunyuan Chen
Jiesheng Wu
Bin Wang
Jing Xu
Ping Li
180
0
0
21 Oct 2025
Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents
Guangfu Guo
Xiaoqian Lu
Yue Feng
LRM
176
0
0
21 Oct 2025
Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation
Wei-Chia Chang
Yan-Ann Chen
VLM
44
0
0
21 Oct 2025
LIME: Link-based user-item Interaction Modeling with decoupled xor attention for Efficient test time scaling
Yunjiang Jiang
Ayush Agarwal
Yang Liu
Bi Xue
OffRL
110
0
0
21 Oct 2025
DVAGen: Dynamic Vocabulary Augmented Generation
Wei Du
Nuowei Liu
Jie Wang
Jiahao Kuang
Tao Ji
X. Wang
Y. Wu
56
0
0
20 Oct 2025
Rethinking On-policy Optimization for Query Augmentation
Zhichao Xu
Shengyao Zhuang
Xueguang Ma
Bingsen Chen
Yijun Tian
Fengran Mo
Jie Cao
Vivek Srikumar
RALM
LRM
135
0
0
20 Oct 2025
Cross-Genre Authorship Attribution via LLM-Based Retrieve-and-Rerank
Shantanu Agarwal
Joel Barry
Steven Fincke
Scott Miller
72
0
0
19 Oct 2025
Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices
Patrizio Dazzi
William Guglielmo
F. M. Nardini
R. Perego
Salvatore Trani
68
0
0
19 Oct 2025
TACLA: An LLM-Based Multi-Agent Tool for Transactional Analysis Training in Education
Monika Zamojska
Jarosław A. Chudziak
LLMAG
120
0
0
19 Oct 2025
Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades
F. M. Nardini
R. Perego
Nicola Tonellotto
Salvatore Trani
104
0
0
18 Oct 2025
Selecting and Combining Large Language Models for Scalable Code Clone Detection
Muslim Chochlov
Gul Aftab Ahmed
James Vincent Patten
Yuanhua Han
Guoxian Lu
David Gregg
J. Buckley
125
0
0
17 Oct 2025
GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework
Yijia Sun
Shanshan Huang
Zhiyuan Guan
Qiang Luo
Ruiming Tang
Kun Gai
Guorui Zhou
68
0
0
17 Oct 2025
BiMax: Bidirectional MaxSim Score for Document-Level Alignment
Xiaotian Wang
T. Utsuro
Masaaki Nagata
96
0
0
17 Oct 2025
Operationalising Extended Cognition: Formal Metrics for Corporate Knowledge and Legal Accountability
Elija Perrier
53
0
0
17 Oct 2025
JEDA: Query-Free Clinical Order Search from Ambient Dialogues
Praphul Singh
Corey D Barrett
Sumana Srivasta
Amitabh Saikia
Irfan Bulu
Sri Gadde
Krishnaram Kenthapadi
118
0
0
16 Oct 2025
Large Scale Retrieval for the LinkedIn Feed using Causal Language Models
Sudarshan Srinivasa Ramanujam
Antonio Alonso
Saurabh Kataria
Siddharth Dangi
Akhilesh Gupta
...
Annie Xiao
Caitlin Kolb
Thomas Kistler
Zach Moore
Hamed Firooz
RALM
90
0
0
16 Oct 2025
GemiRec: Interest Quantization and Generation for Multi-Interest Recommendation
Zhibo Wu
Yunfan Wu
Quan Liu
Lin Jiang
Ping Yang
Yao Hu
81
0
0
16 Oct 2025
Assessing Web Search Credibility and Response Groundedness in Chat Assistants
Ivan Vykopal
Matúš Pikuliak
Simon Ostermann
Marian Simko
72
0
0
15 Oct 2025
Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation
Yi Zhang
Lili Xie
Ruihong Qiu
Jiajun Liu
Sen Wang
OffRL
74
0
0
15 Oct 2025
SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression
Biao Zhang
Lixin Chen
Tong Liu
Bo Zheng
92
0
0
14 Oct 2025
VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering
Zhenghan Tai
Hanwei Wu
Qingchen Hu
Jijun Chi
Hailin He
...
Fengran Mo
Xinyue Yu
Yufei Cui
Ling Zhou
Xinyu Wang
109
0
0
12 Oct 2025
RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation
Zhichao Xu
Minheng Wang
Y. X. R. Wang
Wenqian Ye
Yuntao Du
Yunpu Ma
Yijun Tian
OffRL
RALM
178
0
0
12 Oct 2025
Real2USD: Scene Representations in Universal Scene Description Language
Christopher D. Hsu
Pratik Chaudhari
LM&Ro
134
0
0
12 Oct 2025
EA4LLM: A Gradient-Free Approach to Large Language Model Optimization via Evolutionary Algorithms
Wentao Liu
Siyu Song
Hao Hao
Aimin Zhou
137
0
0
12 Oct 2025
Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs
SuYang Xi
Chenxi Yang
Hong Ding
Yiqing Ni
Catherine C. Liu
Yunhao Liu
Chengqi Zhang
LRM
98
0
0
12 Oct 2025
PrediQL: Automated Testing of GraphQL APIs with LLMs
Shaolun Liu
Sina Marefat
Omar Tsai
Yu Chen
Zecheng Deng
Jia Wang
Mohammad A. Tayebi
93
0
0
12 Oct 2025
Context-Aware Visual Prompting: Automating Geospatial Web Dashboards with Large Language Models and Agent Self-Validation for Decision Support
Haowen Xu
Jose Tupayachi
Xiao-Ying Yu
48
0
0
10 Oct 2025
NL2GenSym: Natural Language to Generative Symbolic Rules for SOAR Cognitive Architecture via Large Language Models
Fang Yuan
Junjie Zeng
Yue Hu
Zhengqiu Zhu
Quanjun Yin
Yuxiang Xie
LLMAG
120
0
0
10 Oct 2025
RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data
Mohammad Baqar
40
0
0
09 Oct 2025
Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning
Andrew Lee
Ian Chuang
D. Gao
Kai Fukazawa
Iman Soltani
132
0
0
09 Oct 2025
ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval
Jianlyu Chen
Junwei Lan
Chaofan Li
Defu Lian
Zheng Liu
RALM
ReLM
LRM
138
0
0
09 Oct 2025
The Effect of Attention Head Count on Transformer Approximation
Penghao Yu
Haotian Jiang
Zeyu Bao
Ruoxi Yu
Qianxiao Li
40
0
0
08 Oct 2025
Evaluating Fundus-Specific Foundation Models for Diabetic Macular Edema Detection
Franco Javier Arellano
José Ignacio Orlando
MedIm
96
0
0
08 Oct 2025
Towards Reliable Retrieval in RAG Systems for Large Legal Datasets
Markus Reuter
Tobias Lingenberg
Rūta Liepiņa
F. Lagioia
Marco Lippi
Giovanni Sartor
Andrea Passerini
Burcu Sayin
AILaw
RALM
216
1
0
08 Oct 2025
Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model
Imranur Rahman
Md Rayhanur Rahman
61
1
0
07 Oct 2025
Previous
1
2
3
4
5
...
41
42
43
Next