Scalable Extraction of Training Data from (Production) Language Models

28 November 2023

Christopher A. Choquette-Choo

ArXiv (abs)PDF HTML HuggingFace (3 upvotes)

Papers citing "Scalable Extraction of Training Data from (Production) Language Models"

50 / 281 papers shown

Voice Jailbreak Attacks Against GPT-4o

Michael Backes

287

29 May 2024

Aya 23: Open Weight Releases to Further Multilingual Progress

...

452

125

23 May 2024

Data Contamination Calibration for Black-box LLMs

207

20 May 2024

Token-wise Influential Training Data Retrieval for Large Language Models

242

20 May 2024

Risks and Opportunities of Open-Source Generative AI

...

428

14 May 2024

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Jindong Gu

315

09 May 2024

Revisiting character-level adversarial attacks

244

07 May 2024

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

689

28 Apr 2024

Near to Mid-term Risks and Opportunities of Open-Source Generative AI

Francisco Eiras

Aleksandar Petrov

Bertie Vidgen

Christian Schroeder de Witt

Fabio Pizzati

...

Paul Röttger

291

25 Apr 2024

Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

Het Patel

Umair Rehman

Farkhund Iqbal

283

23 Apr 2024

Rethinking LLM Memorization through the Lens of Adversarial Compression

J. Zico Kolter

501

23 Apr 2024

Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?

297

19 Apr 2024

Private Attribute Inference from Images with Vision-Language Models

Martin Vechev

253

16 Apr 2024

LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models

202

12 Apr 2024

AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs

Zeyi Liao

Huan Sun

AAML

310

151

11 Apr 2024

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

294

09 Apr 2024

Initial Exploration of Zero-Shot Privacy Utility Tradeoffs in Tabular Data Using GPT-4

Bishwas Mandal

G. Amariucai

Shuangqing Wei

231

07 Apr 2024

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Jingyu Zhang

Marc Marone

Tianjian Li

Benjamin Van Durme

Daniel Khashabi

581

05 Apr 2024

Digital Forgetting in Large Language Models: A Survey of Unlearning MethodsArtificial Intelligence Review (Artif Intell Rev), 2024

Alberto Blanco-Justicia

N. Jebreel

Benet Manzanares-Salor

333

02 Apr 2024

DOCMASTER: A Unified Platform for Annotation, Training, & Inference in Document Question-Answering

200

30 Mar 2024

Localizing Paragraph Memorization in Language Models

209

28 Mar 2024

A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish

Masahiro Kaneko

Timothy Baldwin

PILM

264

24 Mar 2024

Dated Data: Tracing Knowledge Cutoffs in Large Language Models

Daniel Khashabi

Benjamin Van Durme

283

19 Mar 2024

MELTing point: Mobile Evaluation of Language Transformers

Stefanos Laskaridis

Kleomenis Katevas

Lorenzo Minto

Hamed Haddadi

301

19 Mar 2024

Securing Large Language Models: Threats, Vulnerabilities and Responsible Practices

429

19 Mar 2024

What Was Your Prompt? A Remote Keylogging Attack on AI AssistantsUSENIX Security Symposium (USENIX Security), 2024

236

14 Mar 2024

Gemma: Open Models Based on Gemini Research and Technology

Gemma Team

Gemma Team Thomas Mesnard

...

593

836

13 Mar 2024

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Swapnaja Achintalwar

Adriana Alvarado Garcia

...

Shalisha Witherspooon

Marcel Zalmanovici

307

09 Mar 2024

On Protecting the Data Privacy of Large Language Models (LLMs): A SurveyInternational Conference on Mathematics and Computing (ICMC), 2024

408

158

08 Mar 2024

A Safe Harbor for AI Evaluation and Red TeamingInternational Conference on Machine Learning (ICML), 2024

...

255

07 Mar 2024

Here Comes The AI Worm: Unleashing Zero-click Worms that Target GenAI-Powered Applications

Stav Cohen

Ron Bitton

Ben Nassi

434

05 Mar 2024

Training Machine Learning models at the Edge: A Survey

Aymen Rayane Khouas

Mohamed Reda Bouadjenek

Hakim Hacid

Sunil Aryal

425

05 Mar 2024

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Aly M. Kassem

Omar Mahmoud

Niloofar Mireshghallah

Yejin Choi

415

05 Mar 2024

Large language models surpass human experts in predicting neuroscience results

...

224

121

04 Mar 2024

Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy

359

02 Mar 2024

Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap

295

29 Feb 2024

Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

Yinpeng Dong

247

105

28 Feb 2024

On the Challenges and Opportunities in Generative AI

...

759

28 Feb 2024

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Zhenting Qi

269

27 Feb 2024

Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models

Marvin Li

421

26 Feb 2024

Fast Adversarial Attacks on Language Models In One GPU Minute

Vinu Sankar Sadasivan

Shoumik Saha

Gaurang Sriramanan

Priyatham Kattakinda

Atoosa Malemir Chegini

Soheil Feizi

MIALM

335

23 Feb 2024

Watermarking Makes Language Models Radioactive

Pierre Fernandez

184

22 Feb 2024

Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment

288

21 Feb 2024

Privacy-Preserving Instructions for Aligning Large Language Models

456

21 Feb 2024

Generative AI Security: Challenges and Countermeasures

219

20 Feb 2024

Unveiling the Magic: Investigating Attention Distillation in Retrieval-augmented Generation

262

19 Feb 2024

How Susceptible are Large Language Models to Ideological Manipulation?

Taiwei Shi

314

18 Feb 2024

Chain-of-Thought Reasoning Without Prompting

Xuezhi Wang

Denny Zhou

ReLM LRM

618

205

15 Feb 2024

DE-COP: Detecting Copyrighted Content in Language Models Training Data

André V. Duarte

Xuandong Zhao

Arlindo L. Oliveira

Lei Li

378

15 Feb 2024

Copyright Traps for Large Language Models

Matthieu Meeus

Igor Shilov

Manuel Faysse

Yves-Alexandre de Montjoye

345

14 Feb 2024