v1v2 (latest)

The Curious Case of Neural Text Degeneration

22 April 2019

Yejin Choi

Papers citing "The Curious Case of Neural Text Degeneration"

50 / 2,402 papers shown

Probability-Consistent Preference Optimization for Enhanced LLM ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

227

29 May 2025

Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness

257

29 May 2025

Does Machine Unlearning Truly Remove Knowledge?

...

271

29 May 2025

Large Language Model Meets Constraint PropagationInternational Joint Conference on Artificial Intelligence (IJCAI), 2024

102

29 May 2025

Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal TransportAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Yuu Jinnai

199

29 May 2025

First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay

249

28 May 2025

Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives

Ander Artola Velasco

Stratis Tsirtsis

William Orchard

Manuel Gomez Rodriguez

388

27 May 2025

RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models

203

27 May 2025

Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies

Terrance Liu

Shuyi Wang

Daniel Preotiuc-Pietro

Yash Chandarana

Chirag Gupta

278

27 May 2025

Frictional Agent Alignment Framework: Slow Down and Don't Break ThingsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

308

26 May 2025

Learning to Reason without External Rewards

429

26 May 2025

Foundations of Top-

k

Decoding For Language Models

208

25 May 2025

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

...

431

25 May 2025

Self-Training Large Language Models with Confident Reasoning

183

23 May 2025

Distilling LLM Agent into Small Models with Retrieval and Code Tools

765

23 May 2025

Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression

257

22 May 2025

Exploring the Relationship Between Diversity and Quality in Ad Text Generation

230

22 May 2025

CASTILLO: Characterizing Response Length Distributions of Large Language Models

Daniel F. Perez-Ramirez

Dejan Kostic

Magnus Boman

185

22 May 2025

Optimal Policy Minimum Bayesian Risk

Ramón Fernandez Astudillo

246

22 May 2025

CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models

202

22 May 2025

The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning

468

21 May 2025

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

233

21 May 2025

Advancing LLM Safe Alignment with Safety Representation Ranking

222

21 May 2025

Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification

313

20 May 2025

RLVR-World: Training World Models with Reinforcement Learning

508

20 May 2025

Text Generation Beyond Discrete Token Sampling

514

20 May 2025

Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs

Morgan Lindsay Heisler

181

20 May 2025

AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models

442

20 May 2025

GuRE:Generative Query REwriter for Legal Passage Retrieval

367

19 May 2025

Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification

294

19 May 2025

Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?

400

19 May 2025

Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce

Haojin Wang

Zining Zhu

Freda Shi

279

18 May 2025

Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission

276

17 May 2025

Induction Head Toxicity Mechanistically Explains Repetition Curse in Large Language Models

276

17 May 2025

CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation

Xu Liu

Guanyi Chen

HILM LRM

192

17 May 2025

ShiQ: Bringing back Bellman to LLMs

...

Pierre Harvey Richemond

Florian Strub

Matthieu Geist

OffRL

240

16 May 2025

Rethinking Repetition Problems of LLMs in Code GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

243

15 May 2025

ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention

320

15 May 2025

Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language ModelsJournal of Systems and Software (JSS), 2025

Junda Zhao

Yuliang Song

Eldan Cohen

343

14 May 2025

Alignment Drift in CEFR-prompted LLMs for Interactive Spanish TutoringWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025

Mina Almasi

Ross Deans Kristensen-McLachlan

330

13 May 2025

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language ModelsInternational Conference on Learning Representations (ICLR), 2025

422

13 May 2025

Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data

J. Giroux

C. Fanelli

234

13 May 2025

One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models

314

12 May 2025

Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions

517

09 May 2025

Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs

Chetan Pathade

AAML SILM

492

07 May 2025

Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction

Yu Mao

Holger Pirk

Chun Jason Xue

196

07 May 2025

Semantic Probabilistic Control of Language Models

306

04 May 2025

What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction

Eitan Wagner

Omri Abend

458

04 May 2025

Multi-agents based User Values Mining for Recommendation

Lawrence Yunliang Chen

Wei Yuan

Tong Chen

Xiangyu Zhao

Nguyen Quoc Viet Hung

Hongzhi Yin

OffRL

289

02 May 2025

Focus on Likely Classes for Test-Time Prediction

Johannes Schneider

237

02 May 2025