Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.09751
Cited By
v1
v2 (latest)
The Curious Case of Neural Text Degeneration
22 April 2019
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Curious Case of Neural Text Degeneration"
50 / 2,402 papers shown
Jekyll-and-Hyde Tipping Point in an AI's Behavior
Neil F. Johnson
Frank Yingjie Huo
182
3
0
29 Apr 2025
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts
Hanhua Hong
Chenghao Xiao
Yang Wang
Y. Liu
Wenge Rong
Chenghua Lin
310
3
0
29 Apr 2025
LZ Penalty: An information-theoretic repetition penalty for autoregressive language models
Antonio A. Ginart
Naveen Kodali
Jason D. Lee
Caiming Xiong
Siyang Song
John Emmons
373
0
0
28 Apr 2025
TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation
Gwen Yidou Weng
Benjie Wang
Karen Ullrich
BDL
890
4
0
25 Apr 2025
Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection
Atharva Kulkarni
Yuan-kang Zhang
Joel Ruben Antony Moniz
Xiou Ge
Bo-Hsiang Tseng
Dhivya Piraviperumal
Siyang Song
Hong-ye Yu
HILM
385
5
0
25 Apr 2025
Energy Considerations of Large Language Model Inference and Efficiency Optimizations
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jared Fernandez
Clara Na
Vashisth Tiwari
Yonatan Bisk
Sasha Luccioni
Emma Strubell
493
19
0
24 Apr 2025
ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data
Haoran Gu
Handing Wang
Yi Mei
Mengjie Zhang
Yaochu Jin
337
3
0
23 Apr 2025
What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Michael A. Hedderich
Anyi Wang
Raoyuan Zhao
Florian Eichin
Jonas Fischer
Barbara Plank
337
3
0
22 Apr 2025
EmoSEM: Segment and Explain Emotion Stimuli in Visual Art
Jing Zhang
Dan Guo
Zhangbin Li
Meng Wang
306
0
0
20 Apr 2025
Understanding the Repeat Curse in Large Language Models from a Feature Perspective
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Junchi Yao
Shu Yang
Jianhua Xu
Lijie Hu
Mengdi Li
Di Wang
611
17
0
19 Apr 2025
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Yang Yue
Zhiqi Chen
Rui Lu
Andrew Zhao
Zhaokai Wang
Yang Yue
Shiji Song
Gao Huang
ReLM
LRM
719
466
0
18 Apr 2025
LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard
Interfaces to Database Systems (IDS), 2025
Varun Rao
Youran Sun
Mahendra Kumar
Tejas Mutneja
Agastya Mukherjee
Haizhao Yang
AIFin
297
1
0
17 Apr 2025
Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
Wentai Deng
Hanlin Wang
Ying Wang
Xueying Du
Zuoyu Ou
...
Xin Su
Zifei Shan
Fangming Zou
Xin Peng
Xin Peng
230
4
0
17 Apr 2025
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Charles OÑeill
Tirthankar Ghosal
Roberta Răileanu
Mike Walmsley
Thang Bui
Kevin Schawinski
I. Ciucă
LRM
277
10
0
17 Apr 2025
MAIN: Mutual Alignment Is Necessary for instruction tuning
Fanyi Yang
Jianfeng Liu
Xinsong Zhang
Haoyu Liu
Xixin Cao
Yuefeng Zhan
H. Sun
Weiwei Deng
Feng Sun
Qi Zhang
ALM
251
0
0
17 Apr 2025
Provable Secure Steganography Based on Adaptive Dynamic Sampling
Kaiyi Pang
Minhao Bai
DiffM
156
1
0
17 Apr 2025
Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models -
Laura Fieback
Nishilkumar Balar
Jakob Spiegelberg
Hanno Gottschalk
MLLM
VLM
354
0
0
16 Apr 2025
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Miguel Moura Ramos
Patrick Fernandes
Sweta Agrawal
André F.T. Martins
351
5
0
16 Apr 2025
Evaluating the Diversity and Quality of LLM Generated Content
Alexander Shypula
Shuo Li
Botong Zhang
Vishakh Padmakumar
Kayo Yin
Osbert Bastani
278
25
0
16 Apr 2025
Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance
Shixuan Liu
Zhenzhe Zheng
Xiaoyao Huang
Fan Wu
Guihai Chen
Jie Wu
329
1
0
15 Apr 2025
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Shuai Zhao
Linchao Zhu
Yi Yang
Yi Yang
456
4
0
14 Apr 2025
Analysis of Attention in Video Diffusion Transformers
Yuxin Wen
Jim Wu
Ajay Jain
Tom Goldstein
Ashwinee Panda
278
8
0
14 Apr 2025
Weight Ensembling Improves Reasoning in Language Models
Xingyu Dang
Christina Baek
Kaiyue Wen
Zico Kolter
Aditi Raghunathan
MoMe
LRM
625
21
0
14 Apr 2025
Alleviating the Fear of Losing Alignment in LLM Fine-tuning
IEEE Symposium on Security and Privacy (S&P), 2025
Kang Yang
Guanhong Tao
X. Chen
Jun Xu
280
11
0
13 Apr 2025
Parameterized Synthetic Text Generation with SimpleStories
Lennart Finke
Chandan Sreedhara
Thomas Dooms
Mat Allen
Emerald Zhang
Juan Diego Rodriguez
Noa Nabeshima
Thomas Marshall
Dan Braun
SyDa
349
0
0
12 Apr 2025
On The Landscape of Spoken Language Models: A Comprehensive Survey
Siddhant Arora
Kai-Wei Chang
Chung-Ming Chien
Yifan Peng
Haibin Wu
Yossi Adi
Emmanuel Dupoux
Hung-yi Lee
Karen Livescu
Shinji Watanabe
368
66
0
11 Apr 2025
Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation
Alireza Salemi
Chris Samarinas
Hamed Zamani
192
0
0
10 Apr 2025
Cellular Development Follows the Path of Minimum Action
Rohola Zandie
Farhan Khodaee
Yufan Xia
Elazer R. Edelman
252
1
0
10 Apr 2025
Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning
Anja Surina
Amin Mansouri
Lars Quaedvlieg
Amal Seddas
Maryna Viazovska
Emmanuel Abbe
Çağlar Gülçehre
608
21
0
07 Apr 2025
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
Kim Sung-Bin
Jeongsoo Choi
Puyuan Peng
Joon Son Chung
Tae-Hyun Oh
David Harwath
VGen
256
6
0
03 Apr 2025
LLM Social Simulations Are a Promising Research Method
Jacy Reese Anthis
Ryan Liu
Sean M. Richardson
Austin C. Kozlowski
Bernard Koch
James A. Evans
Erik Brynjolfsson
Michael S. Bernstein
ALM
507
85
0
03 Apr 2025
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding
Wasi Uddin Ahmad
Mehrzad Samadi
Somshubra Majumdar
Aleksander Ficek
Siddhartha Jain
Jocelyn Huang
Vahid Noroozi
Boris Ginsburg
LRM
412
46
0
02 Apr 2025
Repetitions are not all alike: distinct mechanisms sustain repetition in language models
Matéo Mahaut
Francesca Franzon
344
2
0
01 Apr 2025
Collaborative LLM Numerical Reasoning with Local Data Protection
Min Zhang
Yuzhe Lu
Yun Zhou
Panpan Xu
Lin Lee Cheong
Chang-Tien Lu
Haozhu Wang
367
0
0
01 Apr 2025
Model Hemorrhage and the Robustness Limits of Large Language Models
Ziyang Ma
Hui Yuan
Guang Dai
Gui-Song Xia
Bo Du
Liangpei Zhang
Dacheng Tao
317
1
0
31 Mar 2025
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
Mingkai Tian
Guorong Li
Yuankai Qi
Amin Beheshti
Javen Qinfeng Shi
Anton van den Hengel
Qingming Huang
VGen
253
0
0
31 Mar 2025
Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models
Tom Kempton
Stuart Burrell
229
2
0
27 Mar 2025
Latent Beam Diffusion Models for Generating Visual Sequences
Guilherme Fernandes
Vasco Ramos
Regev Cohen
Idan Szpektor
João Magalhães
385
1
0
26 Mar 2025
TempTest: Local Normalization Distortion and the Detection of Machine-generated Text
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Tom Kempton
Stuart Burrell
Connor Cheverall
DeLMO
272
1
0
26 Mar 2025
SparSamp: Efficient Provably Secure Steganography Based on Sparse Sampling
Yaofei Wang
Gang Pei
Kejiang Chen
Jinyang Ding
Chao Pan
Weilong Pang
Donghui Hu
Weinan Zhang
190
6
0
25 Mar 2025
SG-Tailor: Inter-Object Commonsense Relationship Reasoning for Scene Graph Manipulation
Haoliang Shang
Hanyu Wu
Guangyao Zhai
Boyang Sun
Fangjinhua Wang
F. Tombari
Marc Pollefeys
310
1
0
23 Mar 2025
Modifying Large Language Model Post-Training for Diverse Creative Writing
John Joon Young Chung
Vishakh Padmakumar
Melissa Roemmele
Yuqian Sun
Max Kreminski
MoMe
227
20
0
21 Mar 2025
Aligned Probing: Relating Toxic Behavior and Model Internals
Andreas Waldis
Vagrant Gautam
Anne Lauscher
Dietrich Klakow
Iryna Gurevych
286
2
0
17 Mar 2025
DAPI: Domain Adaptive Toxicity Probe Vector Intervention for Fine-Grained Detoxification
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Cho Hyeonsu
Dooyoung Kim
Youngjoong Ko
MoMe
226
1
0
17 Mar 2025
Investigating Human-Aligned Large Language Model Uncertainty
Kyle Moore
Jesse Roberts
Daryl Watson
Pamela Wisniewski
253
1
0
16 Mar 2025
Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
Chongjun Tu
Peng Ye
Dongzhan Zhou
Wenlong Zhang
Gang Yu
Tao Chen
Wanli Ouyang
289
7
0
13 Mar 2025
Can LLMs Understand Time Series Anomalies?
International Conference on Learning Representations (ICLR), 2024
Zihao Zhou
Rose Yu
AI4TS
403
31
0
13 Mar 2025
Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation
Zihao Chen
H. Handa
Miho Ohsaki
Kimiaki Shirahama
268
1
0
12 Mar 2025
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework
Zhuo Zhi
Chen Feng
Adam Daneshmend
Mine Orlu
Andreas Demosthenous
L. Yin
Da Li
Ziquan Liu
Miguel R. D. Rodrigues
LRM
266
8
0
11 Mar 2025
Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Wen Luo
Feifan Song
Wei Li
Guangyue Peng
Shaohang Wei
Houfeng Wang
AI4CE
224
2
0
11 Mar 2025
Previous
1
2
3
...
6
7
8
...
47
48
49
Next
Page 7 of 49
Page
of 49
Go