Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.09751
Cited By
v1
v2 (latest)
The Curious Case of Neural Text Degeneration
22 April 2019
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Curious Case of Neural Text Degeneration"
50 / 2,400 papers shown
Fine-tuning Flow Matching Generative Models with Intermediate Feedback
Jiajun Fan
Chaoran Cheng
Shuaike Shen
Xiangxin Zhou
Ge Liu
EGVM
160
1
0
20 Oct 2025
MEG-GPT: A transformer-based foundation model for magnetoencephalography data
Rukuang Huang
Sungjun Cho
Chetan Gohil
Oiwi Parker Jones
M. Woolrich
MedIm
128
0
0
20 Oct 2025
Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models
Jiajun Fan
Tong Wei
Chaoran Cheng
Yuxin Chen
Ge Liu
100
1
0
20 Oct 2025
Online Learning Defense against Iterative Jailbreak Attacks via Prompt Optimization
Masahiro Kaneko
Zeerak Talat
Timothy Baldwin
AAML
141
2
0
19 Oct 2025
Vocab Diet: Reshaping the Vocabulary of LLMs with Vector Arithmetic
Yuval Reif
Guy Kaplan
Roy Schwartz
KELM
170
0
0
19 Oct 2025
Bits Leaked per Query: Information-Theoretic Bounds on Adversarial Attacks against LLMs
Masahiro Kaneko
Timothy Baldwin
AAML
186
0
0
19 Oct 2025
MuseTok: Symbolic Music Tokenization for Generation and Semantic Understanding
Jingyue Huang
Zachary Novack
Phillip Long
Yupeng Hou
Ke Chen
Taylor Berg-Kirkpatrick
Julian McAuley
MGen
172
0
0
18 Oct 2025
Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation
Fei Wang
Li Shen
Liang Ding
Chao Xue
Ye Liu
Changxing Ding
149
0
0
17 Oct 2025
Perfect Prediction or Plenty of Proposals? What Matters Most in Planning for Autonomous Driving
Aron Distelzweig
Faris Janjoš
Oliver Scheel
Sirish Reddy Varra
Raghu Rajan
Joschka Boedecker
131
1
0
17 Oct 2025
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models
Samuel Paech
Allen Roush
Judah Goldfeder
Ravid Shwartz-Ziv
227
0
0
16 Oct 2025
Adaptive Rescheduling in Prefill-Decode Disaggregated LLM Inference
Zhibin Wang
Zetao Hong
Xue Li
Zibo Wang
Shipeng Li
...
Qing Wang
Chengying Huan
Rong Gu
Sheng Zhong
Chen Tian
AI4TS
110
0
0
15 Oct 2025
Stable LLM Ensemble: Interaction between Example Representativeness and Diversity
Junichiro Niimi
133
0
0
15 Oct 2025
How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study
Matthieu Dubois
François Yvon
Pablo Piantanida
DeLMO
199
1
0
15 Oct 2025
From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails
Ravi Pandya
Madison Bland
D. Nguyen
Changliu Liu
J. F. Fisac
Andrea V. Bajcsy
138
1
0
15 Oct 2025
Self-Augmented Visual Contrastive Decoding
Eun Woo Im
M. K. Ali
Vivek Gupta
133
0
0
15 Oct 2025
ESI: Epistemic Uncertainty Quantification via Semantic-preserving Intervention for Large Language Models
Mingda Li
Xinyu Li
Weinan Zhang
Longxuan Ma
136
0
0
15 Oct 2025
Too Open for Opinion? Embracing Open-Endedness in Large Language Models for Social Simulation
Bolei Ma
Yong Cao
Indira Sen
Anna Haensch
Frauke Kreuter
Barbara Plank
Daniel Hershcovich
AI4CE
134
0
0
14 Oct 2025
Traveling Salesman-Based Token Ordering Improves Stability in Homomorphically Encrypted Language Models
Donghwan Rho
Sieun Seo
Hyewon Sung
Chohong Min
Ernest K. Ryu
125
1
0
14 Oct 2025
A Multilingual, Large-Scale Study of the Interplay between LLM Safeguards, Personalisation, and Disinformation
João A. Leite
Arnav Arora
Silvia Gargova
João Luz
Gustavo Sampaio
Ian Roberts
Carolina Scarton
Kalina Bontcheva
167
1
0
14 Oct 2025
Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
Bianca Raimondi
Daniela Dalbagno
Maurizio Gabbrielli
AI4CE
95
0
0
14 Oct 2025
Teaching Language Models to Faithfully Express their Uncertainty
Bryan Eikema
Evgenia Ilia
José G. C. de Souza
Chrysoula Zerva
Wilker Aziz
HILM
162
0
0
14 Oct 2025
Reference-Specific Unlearning Metrics Can Hide the Truth: A Reality Check
Sungjun Cho
Dasol Hwang
Frederic Sala
Sangheum Hwang
Kyunghyun Cho
Sungmin Cha
DiffM
MU
166
0
0
14 Oct 2025
CoSPED: Consistent Soft Prompt Targeted Data Extraction and Defense
Yang Zhuochen
Fok Kar Wai
Thing Vrizlynn
AAML
SILM
250
0
0
13 Oct 2025
Representation-Based Exploration for Language Models: From Test-Time to Post-Training
Jens Tuyls
Dylan J. Foster
A. Krishnamurthy
Jordan T. Ash
134
1
0
13 Oct 2025
EAGER: Entropy-Aware GEneRation for Adaptive Inference-Time Scaling
Daniel Scalena
Leonidas Zotos
Elisabetta Fersini
Malvina Nissim
Ahmet Üstün
LRM
85
1
0
13 Oct 2025
When Images Speak Louder: Mitigating Language Bias-induced Hallucinations in VLMs through Cross-Modal Guidance
Jinjin Cao
Zhiyang Chen
Zijun Wang
Liyuan Ma
Weijian Luo
Guojun Qi
VLM
161
1
0
12 Oct 2025
Conformal Sparsification for Bandwidth-Efficient Edge-Cloud Speculative Decoding
Payel Bhattacharjee
Fengwei Tian
Meiyu Zhong
Guangyi Zhang
Osvaldo Simeone
Ravi Tandon
124
0
0
11 Oct 2025
You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs
Yijie Xu
Huizai Yao
Zhiyu Guo
Weiyu Guo
Pengteng Li
Aiwei Liu
Xuming Hu
Hui Xiong
162
1
0
11 Oct 2025
Autonomous Soft Robotic Guidewire Navigation via Imitation Learning
Noah Barnes
Ji Woong Kim
Lingyun Di
Hannah Qu
Anuruddha Bhattacharjee
...
Olivia Young
Mark Fuge
Ryan D. Sochol
Jeremy D. Brown
A. Krieger
56
0
0
10 Oct 2025
Prompting Test-Time Scaling Is A Strong LLM Reasoning Data Augmentation
Sondos Mahmoud Bsharat
Zhiqiang Shen
ReLM
LRM
100
0
0
10 Oct 2025
Don't Throw Away Your Pretrained Model
Shangbin Feng
Wenhao Yu
Yike Wang
Hongming Zhang
Yulia Tsvetkov
Dong Yu
MoMe
215
1
0
10 Oct 2025
FLToP CTC: Frame-Level Token Pruning via Relative Threshold for Efficient and Memory-Saving Decoding on Diverse Platforms
Atul Shree
Harshith Jupuru
98
0
0
10 Oct 2025
Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy
Xiaoxiao Ma
Feng Zhao
Pengyang Ling
Haibo Qiu
Zhixiang Wei
Hu Yu
Jie Huang
Zhixiong Zeng
Lin Ma
164
2
0
10 Oct 2025
Contrastive Decoding for Synthetic Data Generation in Low-Resource Language Modeling
Jannek Ulm
Kevin Du
Vésteinn Snæbjarnarson
SyDa
158
1
0
09 Oct 2025
Foundations of LLM Knowledge Materialization: Termination, Reproducibility, Robustness
Luca Giordano
Simon Razniewski
145
0
0
08 Oct 2025
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
Wafaa Mohammed
Vlad Niculae
Chrysoula Zerva
130
0
0
08 Oct 2025
Sample Smart, Not Hard: Correctness-First Decoding for Better Reasoning in LLMs
Xueyan Li
Guinan Su
Mrinmaya Sachan
Jonas Geiping
LRM
100
0
0
07 Oct 2025
D
3
\bf{D^3}
D
3
QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
Yanran Zhang
Bingyao Yu
Yu Zheng
Wenzhao Zheng
Yueqi Duan
Lei Chen
Jie Zhou
Jiwen Lu
MQ
185
1
0
07 Oct 2025
Modeling Student Learning with 3.8 Million Program Traces
Alexis Ross
Megha Srivastava
Jeremiah Blanchard
Jacob Andreas
93
5
0
06 Oct 2025
Speak, Edit, Repeat: High-Fidelity Voice Editing and Zero-Shot TTS with Cross-Attentive Mamba
Baher Mohammad
Magauiya Zhussip
Stamatios Lefkimmiatis
Mamba
150
0
0
06 Oct 2025
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning
Chenghao Yang
Lin Gui
Chenxiao Yang
Victor Veitch
Lizhu Zhang
Zhuokai Zhao
OffRL
182
0
0
06 Oct 2025
Large Language Models Preserve Semantic Isotopies in Story Continuations
Marc Cavazza
CLL
111
0
0
06 Oct 2025
TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Chanjoo Jung
Jaehyung Kim
146
0
0
06 Oct 2025
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
Mohsen Hariri
Amirhossein Samandar
Michael Hinczewski
Vipin Chaudhary
ALM
338
0
0
05 Oct 2025
LLM Microscope: What Model Internals Reveal About Answer Correctness and Context Utilization
Jiarui Liu
Jivitesh Jain
Mona T. Diab
Nishant Subramani
148
0
0
05 Oct 2025
Auditing Pay-Per-Token in Large Language Models
Ander Artola Velasco
Stratis Tsirtsis
Manuel Gomez Rodriguez
MLAU
229
0
0
05 Oct 2025
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara
Varindra V. Persad Maharaj
ELM
197
1
0
05 Oct 2025
From Filters to VLMs: Benchmarking Defogging Methods through Object Detection and Segmentation Performance
Ardalan Aryashad
Parsa Razmara
Amin Mahjoub
Seyedarmin Azizi
Mahdi Salmani
Arad Firouzkouhi
VLM
111
0
0
04 Oct 2025
On the Empirical Power of Goodness-of-Fit Tests in Watermark Detection
Weiqing He
Xiang Li
Tianqi Shang
Li Shen
Weijie J. Su
Q. Long
WaLM
241
0
0
04 Oct 2025
Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation
Faruk Alpay
Hamdi Alakkad
64
0
0
03 Oct 2025
Previous
1
2
3
4
5
...
46
47
48
Next