LLMmap: Fingerprinting For Large Language Models

22 July 2024

Dario Pasquini

Evgenios M. Kornaropoulos

G. Ateniese

ArXiv (abs)PDF HTML

Main:13 Pages

13 Figures

Bibliography:3 Pages

11 Tables

Appendix:9 Pages

Abstract

We introduce LLMmap, a first-generation fingerprinting attack targeted at LLM-integrated applications. LLMmap employs an active fingerprinting approach, sending carefully crafted queries to the application and analyzing the responses to identify the specific LLM model in use. With as few as 8 interactions, LLMmap can accurately identify LLMs with over 95% accuracy. More importantly, LLMmap is designed to be robust across different application layers, allowing it to identify LLMs operating under various system prompts, stochastic sampling hyperparameters, and even complex generation frameworks such as RAG or Chain-of-Thought.

View on arXiv

Comments on this paper