v1v2 (latest)

Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding

International Conference on Computational Linguistics (COLING), 2024

5 September 2024

Cheng Wang

Yiwei Wang

Bryan Hooi

Yujun Cai

Nanyun Peng

Kai-Wei Chang

ArXiv (abs)PDF HTML

Papers citing "Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding"

29 / 29 papers shown

False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize

180

04 Sep 2025

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

...

372

12 Jun 2025

Exploring the limits of strong membership inference attacks on large language models

Jamie Hayes

Ilia Shumailov

Christopher A. Choquette-Choo

Matthew Jagielski

G. Kaissis

...

Matthieu Meeus

Yves-Alexandre de Montjoye

Franziska Boenisch

Adam Dziedzic

A. Feder Cooper

340

24 May 2025

On Membership Inference Attacks in Knowledge Distillation

Ziyao Cui

Minxing Zhang

Jian Pei

249

17 May 2025

Scaling Up Membership Inference: When and How Attacks Succeed on Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

1.8K

31 Oct 2024

ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods

576

23 Jun 2024

Enhancing Contextual Understanding in Large Language Models through Contrastive DecodingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

215

04 May 2024

Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

Jianyi Zhang

Hao Frank Yang

Hai "Helen" Li

675

03 Apr 2024

DE-COP: Detecting Copyrighted Content in Language Models Training Data

André V. Duarte

Xuandong Zhao

Arlindo L. Oliveira

Lei Li

377

15 Feb 2024

Low-Cost High-Power Membership Inference AttacksInternational Conference on Machine Learning (ICML), 2023

Sajjad Zarifzadeh

Philippe Liu

Reza Shokri

332

06 Dec 2023

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Albert Gu

Tri Dao

Mamba

558

5,168

01 Dec 2023

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each BenchmarkConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Oscar Sainz

Jon Ander Campos

Iker García-Ferrero

Julen Etxaniz

Oier López de Lacalle

Eneko Agirre

233

260

27 Oct 2023

Proving Test Set Contamination in Black Box Language ModelsInternational Conference on Learning Representations (ICLR), 2023

Yonatan Oren

Nicole Meister

Niladri Chatterji

Faisal Ladhak

Tatsunori B. Hashimoto

HILM

359

197

26 Oct 2023

Privacy-Preserving In-Context Learning with Differentially Private Few-Shot GenerationInternational Conference on Learning Representations (ICLR), 2023

Fatemehsadat Mireshghallah

394

21 Sep 2023

Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities

221

107

24 Aug 2023

Scalable Membership Inference Attacks via Quantile RegressionNeural Information Processing Systems (NeurIPS), 2023

257

07 Jul 2023

Membership Inference Attacks against Language Models via Neighbourhood ComparisonAnnual Meeting of the Association for Computational Linguistics (ACL), 2023

Justus Mattern

Fatemehsadat Mireshghallah

Zhijing Jin

Bernhard Schölkopf

Mrinmaya Sachan

Taylor Berg-Kirkpatrick

MIALM

471

269

29 May 2023

Trusting Your Evidence: Hallucinate Less with Context-aware DecodingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023

Weijia Shi

Luke Zettlemoyer

234

290

24 May 2023

Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

582

161

28 Apr 2023

Pythia: A Suite for Analyzing Large Language Models Across Training and ScalingInternational Conference on Machine Learning (ICML), 2023

...

384

1,621

03 Apr 2023

LLaMA: Open and Efficient Foundation Language Models

...

5.1K

17,636

27 Feb 2023

Extracting Training Data from Diffusion ModelsUSENIX Security Symposium (USENIX Security), 2023

473

800

30 Jan 2023

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

...

362

949

14 Apr 2022

Quantifying Privacy Risks of Masked Language Models Using Membership Inference AttacksConference on Empirical Methods in Natural Language Processing (EMNLP), 2022

Fatemehsadat Mireshghallah

Kartik Goyal

Archit Uniyal

Taylor Berg-Kirkpatrick

Reza Shokri

MIALM

464

207

08 Mar 2022

Membership Inference Attacks From First Principles

672

910

07 Dec 2021

On the Importance of Difficulty Calibration in Membership Inference AttacksInternational Conference on Learning Representations (ICLR), 2021

Lauren Watson

Chuan Guo

Graham Cormode

Alex Sablayrolles

289

172

15 Nov 2021

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-ExpertsAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Yejin Choi

569

443

07 May 2021

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

...

894

2,535

31 Dec 2020

Membership Inference Attacks against Machine Learning Models

898

4,792

18 Oct 2016