v1v2 (latest)

The Curious Case of Neural Text Degeneration

22 April 2019

Yejin Choi

Papers citing "The Curious Case of Neural Text Degeneration"

50 / 2,402 papers shown

Jekyll-and-Hyde Tipping Point in an AI's Behavior

Neil F. Johnson

Frank Yingjie Huo

182

29 Apr 2025

Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts

310

29 Apr 2025

LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

373

28 Apr 2025

TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation

890

25 Apr 2025

Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection

Atharva Kulkarni

Yuan-kang Zhang

Joel Ruben Antony Moniz

385

25 Apr 2025

Energy Considerations of Large Language Model Inference and Efficiency OptimizationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

493

24 Apr 2025

ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data

337

23 Apr 2025

What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token PatternsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

337

22 Apr 2025

EmoSEM: Segment and Explain Emotion Stimuli in Visual Art

306

20 Apr 2025

Understanding the Repeat Curse in Large Language Models from a Feature PerspectiveAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

611

19 Apr 2025

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

719

466

18 Apr 2025

LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM LeaderboardInterfaces to Database Systems (IDS), 2025

297

17 Apr 2025

Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation

...

230

17 Apr 2025

Sparks of Science: Hypothesis Generation Using Structured Paper Data

277

17 Apr 2025

MAIN: Mutual Alignment Is Necessary for instruction tuning

251

17 Apr 2025

Provable Secure Steganography Based on Adaptive Dynamic Sampling

Kaiyi Pang

Minhao Bai

DiffM

156

17 Apr 2025

Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models -

354

16 Apr 2025

Multilingual Contextualization of Large Language Models for Document-Level Machine Translation

351

16 Apr 2025

Evaluating the Diversity and Quality of LLM Generated Content

278

16 Apr 2025

Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance

329

15 Apr 2025

Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Shuai Zhao

Linchao Zhu

Yi Yang

456

14 Apr 2025

Analysis of Attention in Video Diffusion Transformers

278

14 Apr 2025

Weight Ensembling Improves Reasoning in Language Models

625

14 Apr 2025

Alleviating the Fear of Losing Alignment in LLM Fine-tuningIEEE Symposium on Security and Privacy (S&P), 2025

280

13 Apr 2025

Parameterized Synthetic Text Generation with SimpleStories

349

12 Apr 2025

On The Landscape of Spoken Language Models: A Comprehensive Survey

368

11 Apr 2025

Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation

Alireza Salemi

Chris Samarinas

Hamed Zamani

192

10 Apr 2025

Cellular Development Follows the Path of Minimum Action

252

10 Apr 2025

Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning

608

07 Apr 2025

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

256

03 Apr 2025

LLM Social Simulations Are a Promising Research Method

507

03 Apr 2025

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

412

02 Apr 2025

Repetitions are not all alike: distinct mechanisms sustain repetition in language models

Matéo Mahaut

Francesca Franzon

344

01 Apr 2025

Collaborative LLM Numerical Reasoning with Local Data Protection

367

01 Apr 2025

Model Hemorrhage and the Robustness Limits of Large Language Models

317

31 Mar 2025

The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning

253

31 Mar 2025

Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models

Tom Kempton

Stuart Burrell

229

27 Mar 2025

Latent Beam Diffusion Models for Generating Visual Sequences

385

26 Mar 2025

TempTest: Local Normalization Distortion and the Detection of Machine-generated TextInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025

272

26 Mar 2025

SparSamp: Efficient Provably Secure Steganography Based on Sparse Sampling

190

25 Mar 2025

SG-Tailor: Inter-Object Commonsense Relationship Reasoning for Scene Graph Manipulation

310

23 Mar 2025

Modifying Large Language Model Post-Training for Diverse Creative Writing

John Joon Young Chung

227

21 Mar 2025

Aligned Probing: Relating Toxic Behavior and Model Internals

286

17 Mar 2025

DAPI: Domain Adaptive Toxicity Probe Vector Intervention for Fine-Grained DetoxificationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

226

17 Mar 2025

Investigating Human-Aligned Large Language Model Uncertainty

253

16 Mar 2025

Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs

289

13 Mar 2025

Can LLMs Understand Time Series Anomalies?International Conference on Learning Representations (ICLR), 2024

Zihao Zhou

Rose Yu

AI4TS

403

13 Mar 2025

Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation

268

12 Mar 2025

Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework

Miguel R. D. Rodrigues

LRM

266

11 Mar 2025

Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

224

11 Mar 2025