Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.09751
Cited By
v1
v2 (latest)
The Curious Case of Neural Text Degeneration
22 April 2019
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Curious Case of Neural Text Degeneration"
50 / 2,402 papers shown
Probability-Consistent Preference Optimization for Enhanced LLM Reasoning
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yunqiao Yang
Houxing Ren
Zimu Lu
Ke Wang
Weikang Shi
A-Long Zhou
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
227
0
0
29 May 2025
Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness
Yongjin Yang
Euiin Yi
Jongwoo Ko
Kimin Lee
Zhijing Jin
Se-Young Yun
LLMAG
257
9
0
29 May 2025
Does Machine Unlearning Truly Remove Knowledge?
Haokun Chen
Y. Zhang
Yuan Bi
Yao Zhang
Tong Liu
...
Jindong Gu
Claudia Grosser
Denis Krompass
Nassir Navab
Volker Tresp
MU
271
8
0
29 May 2025
Large Language Model Meets Constraint Propagation
International Joint Conference on Artificial Intelligence (IJCAI), 2024
Alexandre Bonlarron
Florian Régin
Elisabetta De Maria
Jean-Charles Régin
102
0
0
29 May 2025
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yuu Jinnai
OT
199
0
0
29 May 2025
First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay
Andrew Zhu
Evan Osgood
Chris Callison-Burch
LLMAG
249
0
0
28 May 2025
Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives
Ander Artola Velasco
Stratis Tsirtsis
William Orchard
Manuel Gomez Rodriguez
388
3
0
27 May 2025
RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
Dario Satriani
Enzo Veltri
Donatello Santoro
Paolo Papotti
LMTD
HILM
203
0
0
27 May 2025
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Terrance Liu
Shuyi Wang
Daniel Preotiuc-Pietro
Yash Chandarana
Chirag Gupta
278
2
0
27 May 2025
Frictional Agent Alignment Framework: Slow Down and Don't Break Things
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Abhijnan Nath
Carine Graff
Andrei Bachinin
Nikhil Krishnaswamy
308
4
0
26 May 2025
Learning to Reason without External Rewards
Xuandong Zhao
Zhewei Kang
Aosong Feng
Sergey Levine
Dawn Song
OffRL
ReLM
LRM
429
99
0
26 May 2025
Foundations of Top-
k
k
k
Decoding For Language Models
Georgy Noarov
Soham Mallick
Tao Wang
Sunay Joshi
Yan Sun
Yangxinyu Xie
Mengxin Yu
Edgar Dobriban
208
7
0
25 May 2025
LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models
Fengqi Zhu
Rongzhen Wang
Shen Nie
Xiaolu Zhang
Chunwei Wu
...
Jun Zhou
Jianfei Chen
Yankai Lin
Ji-Rong Wen
Chongxuan Li
431
96
0
25 May 2025
Self-Training Large Language Models with Confident Reasoning
Hyosoon Jang
Yunhui Jang
Sungjae Lee
Jungseul Ok
SungSoo Ahn
ReLM
LRM
183
3
0
23 May 2025
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Minki Kang
Jongwon Jeong
Seanie Lee
Jaewoong Cho
Sung Ju Hwang
LRM
765
11
0
23 May 2025
Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression
Sreetama Sarkar
Yue Che
Alex Gavin
Peter A. Beerel
Souvik Kundu
MLLM
VLM
257
6
0
22 May 2025
Exploring the Relationship Between Diversity and Quality in Ad Text Generation
Yoichi Aoki
Soichiro Murakami
Ukyo Honda
Akihiko Kato
230
0
0
22 May 2025
CASTILLO: Characterizing Response Length Distributions of Large Language Models
Daniel F. Perez-Ramirez
Dejan Kostic
Magnus Boman
185
1
0
22 May 2025
Optimal Policy Minimum Bayesian Risk
Ramón Fernandez Astudillo
Md Arafat Sultan
Aashka Trivedi
Yousef El-Kurdi
Tahira Naseem
Radu Florian
Salim Roukos
OffRL
246
2
0
22 May 2025
CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Arnav Verma
Kushin Mukherjee
Christopher Potts
Elisa Kreiss
Judith E. Fan
202
1
0
22 May 2025
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Shivam Agarwal
Zimin Zhang
Lifan Yuan
Jiawei Han
Yuan Yao
468
84
0
21 May 2025
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Siyue Zhang
Yilun Zhao
Liyuan Geng
Arman Cohan
Anh Tuan Luu
Chen Zhao
233
7
0
21 May 2025
Advancing LLM Safe Alignment with Safety Representation Ranking
Tianqi Du
Zeming Wei
Quan Chen
Chenheng Zhang
Yisen Wang
ALM
222
6
0
21 May 2025
Unraveling Interwoven Roles of Large Language Models in Authorship Privacy: Obfuscation, Mimicking, and Verification
Tuc Nguyen
Yifan Hu
Thai Le
DeLMO
313
0
0
20 May 2025
RLVR-World: Training World Models with Reinforcement Learning
Jialong Wu
Shaofeng Yin
Ningya Feng
Mingsheng Long
OffRL
VGen
508
16
0
20 May 2025
Text Generation Beyond Discrete Token Sampling
Yufan Zhuang
Liyuan Liu
Chandan Singh
Jingbo Shang
Jianfeng Gao
OOD
514
9
0
20 May 2025
Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs
Morgan Lindsay Heisler
Linzi Xing
Ge Shi
Hanieh Sadri
Gursimran Singh
Weiwei Zhang
Tao Ye
Ying Xiong
Yong Zhang
Zhenan Fan
181
1
0
20 May 2025
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models
Guangke Chen
Fu Song
Zhe Zhao
Xiaojun Jia
Yang Liu
Yanchen Qiao
Weizhe Zhang
AuLLM
AAML
442
4
0
20 May 2025
GuRE:Generative Query REwriter for Legal Passage Retrieval
Daehee Kim
Deokhyung Kang
Jonghwi Kim
Sangwon Ryu
Gary Geunbae Lee
RALM
AILaw
367
0
0
19 May 2025
Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification
Jikai Wang
Zhenxu Tian
Jilong Li
Qingrong Xia
Xinyu Duan
Zhefeng Wang
Baoxing Huai
Min Zhang
294
3
0
19 May 2025
Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?
Zilu Tang
Afra Feyza Akyürek
Ekin Akyürek
Derry Wijaya
400
0
0
19 May 2025
Distribution Prompting: Understanding the Expressivity of Language Models Through the Next-Token Distributions They Can Produce
Haojin Wang
Zining Zhu
Freda Shi
279
1
0
18 May 2025
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Jinho Choi
Tony Q. S. Quek
Seong-Lyun Kim
276
1
0
17 May 2025
Induction Head Toxicity Mechanistically Explains Repetition Curse in Large Language Models
Shuxun Wang
Qingyu Yin
Chak Tou Leong
Qiang Zhang
Linyi Yang
276
3
0
17 May 2025
CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation
Xu Liu
Guanyi Chen
HILM
LRM
192
0
0
17 May 2025
ShiQ: Bringing back Bellman to LLMs
Pierre Clavier
Nathan Grinsztajn
Raphaël Avalos
Yannis Flet-Berliac
Irem Ergun
...
Eugene Tarassov
Olivier Pietquin
Pierre Harvey Richemond
Florian Strub
Matthieu Geist
OffRL
240
1
0
16 May 2025
Rethinking Repetition Problems of LLMs in Code Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Yihong Dong
Yuchen Liu
Xue Jiang
Zhi Jin
Ge Li
243
4
0
15 May 2025
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
Jintian Shao
Hongyi Huang
Hongyi Huang
Beiwen Zhang
ZhiYu Wu
You Shan
MingKai Zheng
320
0
0
15 May 2025
Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models
Journal of Systems and Software (JSS), 2025
Junda Zhao
Yuliang Song
Eldan Cohen
343
3
0
14 May 2025
Alignment Drift in CEFR-prompted LLMs for Interactive Spanish Tutoring
Workshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2025
Mina Almasi
Ross Deans Kristensen-McLachlan
330
5
0
13 May 2025
Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models
International Conference on Learning Representations (ICLR), 2025
Donghoon Kim
Minji Bae
Kyuhong Shim
B. Shim
422
5
0
13 May 2025
Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data
J. Giroux
C. Fanelli
234
1
0
13 May 2025
One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models
Haoran Gu
Handing Wang
Yi Mei
Mengjie Zhang
Yaochu Jin
314
0
0
12 May 2025
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Dhruvesh Patel
Aishwarya Sahoo
Avinash Amballa
Tahira Naseem
Tim G. J. Rudner
Andrew McCallum
KELM
517
2
0
09 May 2025
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
Chetan Pathade
AAML
SILM
492
23
0
07 May 2025
Lossless Compression of Large Language Model-Generated Text via Next-Token Prediction
Yu Mao
Holger Pirk
Chun Jason Xue
196
2
0
07 May 2025
Semantic Probabilistic Control of Language Models
Kareem Ahmed
Catarina G Belém
Padhraic Smyth
Sameer Singh
306
4
0
04 May 2025
What do Language Model Probabilities Represent? From Distribution Estimation to Response Prediction
Eitan Wagner
Omri Abend
458
2
0
04 May 2025
Multi-agents based User Values Mining for Recommendation
Lawrence Yunliang Chen
Wei Yuan
Tong Chen
Xiangyu Zhao
Nguyen Quoc Viet Hung
Hongzhi Yin
OffRL
289
1
0
02 May 2025
Focus on Likely Classes for Test-Time Prediction
Johannes Schneider
237
0
0
02 May 2025
Previous
1
2
3
...
5
6
7
...
47
48
49
Next