Self-Consuming Generative Models Go MAD

International Conference on Learning Representations (ICLR), 2023

4 July 2023

Sina Alemohammad

Josue Casco-Rodriguez

Richard G. Baraniuk

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "Self-Consuming Generative Models Go MAD"

50 / 122 papers shown

Aligning Instruction Tuning with Pre-training

...

662

16 Jan 2025

Spatial Information Integration in Small Language Models for Document Layout Generation and ClassificationACM Symposium on Applied Computing (SAC), 2025

Pablo Melendez

Clemens Havas

223

09 Jan 2025

Malware Classification using a Hybrid Hidden Markov Model-Convolutional Neural Network

Ritik Mehta

Olha Jurecková

Mark Stamp

313

160

25 Dec 2024

Rate of Model Collapse in Recursive TrainingInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2024

A. Suresh

A. Thangaraj

Aditya Nanda Kishore Khandavally

SyDa

212

23 Dec 2024

Why Does ChatGPT "Delve" So Much? Exploring the Sources of Lexical Overrepresentation in Large Language ModelsInternational Conference on Computational Linguistics (COLING), 2024

Tom S. Juzek

Zina B. Ward

322

16 Dec 2024

The Superalignment of Superhuman Intelligence with Large Language ModelsScience China Information Sciences (Sci. China Inf. Sci.), 2024

451

15 Dec 2024

Image Generation Diversity Issues and How to Tame ThemComputer Vision and Pattern Recognition (CVPR), 2024

322

25 Nov 2024

Uncovering Hidden Subspaces in Video Diffusion Models Using Re-Identification

253

07 Nov 2024

DetectRL: Benchmarking LLM-Generated Text Detection in Real-World ScenariosNeural Information Processing Systems (NeurIPS), 2024

619

31 Oct 2024

Universality of the

π^2/6

Pathway in Avoiding Model Collapse

Apratim Dey

D. Donoho

306

30 Oct 2024

Shallow Diffuse: Robust and Invisible Watermarking through Low-Dimensional Subspaces in Diffusion Models

433

28 Oct 2024

Intention Is All You Need

Advait Sarkar

214

24 Oct 2024

Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World

Joshua Kazdan

Rylan Schaeffer

Apratim Dey

Matthias Gerstgrasser

Rafael Rafailov

D. Donoho

Sanmi Koyejo

612

22 Oct 2024

Bias Amplification: Large Language Models as Increasingly Biased Media

Adriano Soares Koshiyama

356

19 Oct 2024

Data Diversity as Implicit Regularization: How Does Diversity Shape the Weight Space of Deep Neural Networks?

Yang Ba

M. Mancenido

Rong Pan

275

18 Oct 2024

Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts

359

18 Oct 2024

Towards Reliable Verification of Unauthorized Data Usage in Personalized Text-to-Image Diffusion ModelsIEEE Symposium on Security and Privacy (S&P), 2024

Boheng Li

Yiming Li

207

14 Oct 2024

Will the Inclusion of Generated Data Amplify Bias Across Generations in Future Image Classification Models?

201

14 Oct 2024

Maximizing the Potential of Synthetic Data: Insights from Random Matrix TheoryInternational Conference on Learning Representations (ICLR), 2024

348

11 Oct 2024

Strong Model CollapseInternational Conference on Learning Representations (ICLR), 2024

Elvis Dohmatob

Yunzhen Feng

Arjun Subramonian

Julia Kempe

279

07 Oct 2024

Self-Improving Diffusion Models with Synthetic Data

205

29 Aug 2024

Self-Directed Synthetic Dialogues and Revisions Technical Report

Nathan Lambert

Luca Soldaini

174

25 Jul 2024

DataDream: Few-shot Guided Dataset Generation

237

15 Jul 2024

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling

232

02 Jul 2024

A survey on the impacts of recommender systems on users, items, and human-AI ecosystems

Luca Pappalardo

...

Gabriele Barlacchi

Virginia Morini

Valentina Pansanella

D. Pedreschi

Emanuele Ferragina

262

29 Jun 2024

RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold

Amrith Rajagopal Setlur

479

20 Jun 2024

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

Jie Chen

Yupeng Zhang

Bingning Wang

Wayne Xin Zhao

Ji-Rong Wen

Weipeng Chen

SyDa

319

18 Jun 2024

Understanding Hallucinations in Diffusion Models through Mode Interpolation

369

13 Jun 2024

Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement

245

11 Jun 2024

JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits

Xue Lin

Cho-Jui Hsieh

Ruoxi Jia

316

06 Jun 2024

Exploring the Escalation of Source Bias in User, Data, and Recommender System Feedback Loop

307

28 May 2024

Sociotechnical Implications of Generative Artificial Intelligence for Information Access

Bhaskar Mitra

Henriette Cramer

Olya Gurevich

297

19 May 2024

Crowdsourcing with Enhanced Data Quality Assurance: An Efficient Approach to Mitigate Resource Scarcity Challenges in Training Large Language Models for Healthcare

187

16 May 2024

At the edge of a generative cultural precipice

Diego Porres

Alex Gomez-Villa

144

30 Apr 2024

A Survey on Self-Evolution of Large Language Models

Ting-En Lin

Fei Huang

Jingren Zhou

302

22 Apr 2024

RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

Bruno Castro da Silva

407

12 Apr 2024

Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models

Xiaoxue Yang

Akane Sano

DiffM

433

12 Apr 2024

G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images

Qi Wu

170

11 Apr 2024

Heat Death of Generative Models in Closed-Loop LearningIEEE Conference on Decision and Control (CDC), 2024

202

02 Apr 2024

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

Matthias Gerstgrasser

...

Diyi Yang

309

105

01 Apr 2024

Structured Evaluation of Synthetic Tabular Data

Scott Cheng-Hsin Yang

Baxter S. Eaves

Michael Schmidt

Ken Swanson

Patrick Shafto

298

15 Mar 2024

Fairness Feedback Loops: Training on Synthetic Data Amplifies BiasConference on Fairness, Accountability and Transparency (FAccT), 2024

Sierra Wyllie

Ilia Shumailov

Nicolas Papernot

237

12 Mar 2024

Large Language Models for Data Annotation: A Survey

Huan Liu

397

21 Feb 2024

Towards Theoretical Understandings of Self-Consuming Generative Models

291

19 Feb 2024

How to Train Data-Efficient LLMs

Julian McAuley

267

15 Feb 2024

Model Collapse Demystified: The Case of Regression

Elvis Dohmatob

Yunzhen Feng

Julia Kempe

360

12 Feb 2024

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

Haoyu Wang

Guozheng Ma

Ziqiao Meng

Zeyu Qin

Li Shen

...

271

12 Feb 2024

Self-Correcting Self-Consuming Loops for Generative Model TrainingInternational Conference on Machine Learning (ICML), 2024

345

11 Feb 2024

A Tale of Tails: Model Collapse as a Change of Scaling LawsInternational Conference on Machine Learning (ICML), 2024

320

107

10 Feb 2024

Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

...

Nikolay Malkin

275

09 Feb 2024