Evaluating Distributional Distortion in Neural Language Modeling

International Conference on Learning Representations (ICLR), 2022

24 March 2022

Papers citing "Evaluating Distributional Distortion in Neural Language Modeling"

16 / 16 papers shown

Why Less is More (Sometimes): A Theory of Data Curation

Elvis Dohmatob

Mohammad Pezeshki

Reyhane Askari Hemmat

157

05 Nov 2025

FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline

104

22 Aug 2025

LLM as a Broken Telephone: Iterative Generation Distorts InformationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Amr Mohamed

Mingmeng Geng

Michalis Vazirgiannis

Guokan Shang

422

27 Feb 2025

The Best Instruction-Tuning Data are Those That Fit

575

06 Feb 2025

Maximizing the Potential of Synthetic Data: Insights from Random Matrix TheoryInternational Conference on Learning Representations (ICLR), 2024

349

11 Oct 2024

Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement

256

11 Jun 2024

ModelShield: Adaptive and Robust Watermark against Model Extraction AttackIEEE Transactions on Information Forensics and Security (IEEE TIFS), 2024

569

03 May 2024

Predict the Next Word: Humans exhibit uncertainty in this task and language models _____

Evgenia Ilia

Wilker Aziz

280

27 Feb 2024

A Tale of Tails: Model Collapse as a Change of Scaling LawsInternational Conference on Machine Learning (ICML), 2024

321

107

10 Feb 2024

On Using Distribution-Based Compositionality Assessment to Evaluate Compositional Generalisation in Machine Translation

234

14 Nov 2023

EMO: Earth Mover Distance Optimization for Auto-Regressive Language ModelingInternational Conference on Learning Representations (ICLR), 2023

Siyu Ren

Zhiyong Wu

Kenny Q. Zhu

360

07 Oct 2023

Tailoring Language Generation Models under Total Variation DistanceInternational Conference on Learning Representations (ICLR), 2023

247

26 Feb 2023

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context LearningNeural Information Processing Systems (NeurIPS), 2023

Xinyi Wang

Wanrong Zhu

Michael Stephen Saxon

Mark Steyvers

William Yang Wang

BDL

543

163

27 Jan 2023

Neural-Symbolic Inference for Robust Autoregressive Graph Parsing via Compositional Uncertainty QuantificationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

218

26 Jan 2023

Metadata Archaeology: Unearthing Data Subsets by Leveraging Training DynamicsInternational Conference on Learning Representations (ICLR), 2022

Shoaib Ahmed Siddiqui

267

20 Sep 2022

How much do language models copy from their training data? Evaluating linguistic novelty in text generation using RAVEN

235

161

18 Nov 2021