Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
1702.08734
Cited By

Billion-scale similarity search with GPUs

Billion-scale similarity search with GPUs

IEEE Transactions on Big Data (TBD), 2017

28 February 2017

ArXiv (abs)PDF HTML

Papers citing "Billion-scale similarity search with GPUs"

50 / 2,114 papers shown

Similarity-Distance-Magnitude Language Models

Similarity-Distance-Magnitude Language Models

84

0

0

30 Oct 2025

Instance-Level Composed Image Retrieval

Instance-Level Composed Image Retrieval

George Retsinas

Nikos Efthymiadis

Yannis Avrithis

160

1

0

29 Oct 2025

AttnCache: Accelerating Self-Attention Inference for LLM Prefill via Attention Cache

AttnCache: Accelerating Self-Attention Inference for LLM Prefill via Attention CacheIACR Cryptology ePrint Archive (IACR ePrint), 2025

208

0

0

29 Oct 2025

Retrieval-Augmented Multimodal Depression Detection

Retrieval-Augmented Multimodal Depression Detection

146

0

0

29 Oct 2025

Category-Aware Semantic Caching for Heterogeneous LLM Workloads

Category-Aware Semantic Caching for Heterogeneous LLM Workloads

Priya Nagpurkar

97

0

0

29 Oct 2025

Iterative Critique-Refine Framework for Enhancing LLM Personalization

Iterative Critique-Refine Framework for Enhancing LLM Personalization

Durga Prasad Maram

Gayathri Akkinapalli

Franck Dernoncourt

Nesreen K. Ahmed

132

0

0

28 Oct 2025

DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts

DualCap: Enhancing Lightweight Image Captioning via Dual Retrieval with Similar Scenes Visual Prompts

327

0

0

28 Oct 2025

ChessQA: Evaluating Large Language Models for Chess Understanding

ChessQA: Evaluating Large Language Models for Chess Understanding

Ashton Anderson

197

1

0

28 Oct 2025

Talk2Ref: A Dataset for Reference Prediction from Scientific Talks

Talk2Ref: A Dataset for Reference Prediction from Scientific Talks

72

0

0

28 Oct 2025

SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications

SwiftEmbed: Ultra-Fast Text Embeddings via Static Token Lookup for Real-Time Applications

Edouard Lansiaux

Antoine Simonet

Eric Wiel

129

0

0

27 Oct 2025

FAIR-RAG: Faithful Adaptive Iterative Refinement for Retrieval-Augmented Generation

FAIR-RAG: Faithful Adaptive Iterative Refinement for Retrieval-Augmented Generation

Mohammad Aghajani Asl

Majid Asgari-Bidhendi

B. Minaei-Bidgoli

108

1

0

25 Oct 2025

Large Language Models Meet Text-Attributed Graphs: A Survey of Integration Frameworks and Applications

Large Language Models Meet Text-Attributed Graphs: A Survey of Integration Frameworks and Applications

288

1

0

24 Oct 2025

Generative Reasoning Recommendation via LLMs

Generative Reasoning Recommendation via LLMs

124

0

0

23 Oct 2025

From Answers to Guidance: A Proactive Dialogue System for Legal Documents

From Answers to Guidance: A Proactive Dialogue System for Legal Documents

325

1

0

22 Oct 2025

Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

...

119

3

0

22 Oct 2025

LLMs as Sparse Retrievers:A Framework for First-Stage Product Search

LLMs as Sparse Retrievers:A Framework for First-Stage Product Search

Maarten de Rijke

165

0

0

21 Oct 2025

Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents

Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents

180

0

0

21 Oct 2025

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection

196

1

0

21 Oct 2025

Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation

Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation

44

0

0

21 Oct 2025

LIME: Link-based user-item Interaction Modeling with decoupled xor attention for Efficient test time scaling

LIME: Link-based user-item Interaction Modeling with decoupled xor attention for Efficient test time scaling

122

0

0

21 Oct 2025

DVAGen: Dynamic Vocabulary Augmented Generation

DVAGen: Dynamic Vocabulary Augmented Generation

76

0

0

20 Oct 2025

Rethinking On-policy Optimization for Query Augmentation

Rethinking On-policy Optimization for Query Augmentation

Shengyao Zhuang

179

0

0

20 Oct 2025

Cross-Genre Authorship Attribution via LLM-Based Retrieve-and-Rerank

Cross-Genre Authorship Attribution via LLM-Based Retrieve-and-Rerank

Shantanu Agarwal

92

0

0

19 Oct 2025

Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices

Exact Nearest-Neighbor Search on Energy-Efficient FPGA Devices

William Guglielmo

Salvatore Trani

80

0

0

19 Oct 2025

TACLA: An LLM-Based Multi-Agent Tool for Transactional Analysis Training in Education

TACLA: An LLM-Based Multi-Agent Tool for Transactional Analysis Training in Education

Monika Zamojska

Jarosław A. Chudziak

124

0

0

19 Oct 2025

Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades

Blending Learning to Rank and Dense Representations for Efficient and Effective Cascades

Nicola Tonellotto

Salvatore Trani

124

0

0

18 Oct 2025

Selecting and Combining Large Language Models for Scalable Code Clone Detection

Selecting and Combining Large Language Models for Scalable Code Clone Detection

Muslim Chochlov

Gul Aftab Ahmed

James Vincent Patten

141

0

0

17 Oct 2025

GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework

GRank: Towards Target-Aware and Streamlined Industrial Retrieval with a Generate-Rank Framework

80

1

0

17 Oct 2025

BiMax: Bidirectional MaxSim Score for Document-Level Alignment

BiMax: Bidirectional MaxSim Score for Document-Level Alignment

108

0

0

17 Oct 2025

Operationalising Extended Cognition: Formal Metrics for Corporate Knowledge and Legal Accountability

Operationalising Extended Cognition: Formal Metrics for Corporate Knowledge and Legal Accountability

77

0

0

17 Oct 2025

JEDA: Query-Free Clinical Order Search from Ambient Dialogues

JEDA: Query-Free Clinical Order Search from Ambient Dialogues

Corey D Barrett

Sumana Srivasta

Krishnaram Kenthapadi

122

0

0

16 Oct 2025

Large Scale Retrieval for the LinkedIn Feed using Causal Language Models

Large Scale Retrieval for the LinkedIn Feed using Causal Language Models

Sudarshan Srinivasa Ramanujam

Saurabh Kataria

Siddharth Dangi

...

102

0

0

16 Oct 2025

GemiRec: Interest Quantization and Generation for Multi-Interest Recommendation

GemiRec: Interest Quantization and Generation for Multi-Interest Recommendation

101

0

0

16 Oct 2025

Assessing Web Search Credibility and Response Groundedness in Chat Assistants

Assessing Web Search Credibility and Response Groundedness in Chat Assistants

Matúš Pikuliak

Simon Ostermann

88

0

0

15 Oct 2025

Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation

Beyond Static LLM Policies: Imitation-Enhanced Reinforcement Learning for Recommendation

106

0

0

15 Oct 2025

SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression

SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression

128

0

0

14 Oct 2025

VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

...

117

0

0

12 Oct 2025

RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation

RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation

206

0

0

12 Oct 2025

Real2USD: Scene Representations in Universal Scene Description Language

Real2USD: Scene Representations in Universal Scene Description Language

Christopher D. Hsu

Pratik Chaudhari

154

0

0

12 Oct 2025

EA4LLM: A Gradient-Free Approach to Large Language Model Optimization via Evolutionary Algorithms

EA4LLM: A Gradient-Free Approach to Large Language Model Optimization via Evolutionary Algorithms

145

0

0

12 Oct 2025

Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs

Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs

Catherine C. Liu

118

0

0

12 Oct 2025

PrediQL: Automated Testing of GraphQL APIs with LLMs

PrediQL: Automated Testing of GraphQL APIs with LLMs

Mohammad A. Tayebi

109

0

0

12 Oct 2025

Context-Aware Visual Prompting: Automating Geospatial Web Dashboards with Large Language Models and Agent Self-Validation for Decision Support

Context-Aware Visual Prompting: Automating Geospatial Web Dashboards with Large Language Models and Agent Self-Validation for Decision Support

76

0

0

10 Oct 2025

NL2GenSym: Natural Language to Generative Symbolic Rules for SOAR Cognitive Architecture via Large Language Models

NL2GenSym: Natural Language to Generative Symbolic Rules for SOAR Cognitive Architecture via Large Language Models

132

0

0

10 Oct 2025

RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data

RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data

40

0

0

09 Oct 2025

Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning

Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning

188

0

0

09 Oct 2025

ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval

ReasonEmbed: Enhanced Text Embeddings for Reasoning-Intensive Document Retrieval

154

0

0

09 Oct 2025

The Effect of Attention Head Count on Transformer Approximation

The Effect of Attention Head Count on Transformer Approximation

52

0

0

08 Oct 2025

Evaluating Fundus-Specific Foundation Models for Diabetic Macular Edema Detection

Evaluating Fundus-Specific Foundation Models for Diabetic Macular Edema Detection

Franco Javier Arellano

José Ignacio Orlando

96

0

0

08 Oct 2025

Towards Reliable Retrieval in RAG Systems for Large Legal Datasets

Towards Reliable Retrieval in RAG Systems for Large Legal Datasets

Tobias Lingenberg

Giovanni Sartor

Andrea Passerini

232

4

0

08 Oct 2025

1 2 3 4 5...41 42 43