Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1904.09751
Cited By
v1
v2 (latest)
The Curious Case of Neural Text Degeneration
22 April 2019
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Curious Case of Neural Text Degeneration"
50 / 2,402 papers shown
Beyond the Singular: Revealing the Value of Multiple Generations in Benchmark Evaluation
Wenbo Zhang
Hengrui Cai
Wenyu Chen
338
1
0
13 Feb 2025
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches
D. Elbaz
Oren Salzman
OffRL
333
0
0
13 Feb 2025
From Haystack to Needle: Label Space Reduction for Zero-shot Classification
Nathan Vandemoortele
Bram Steenwinckel
F. Ongenae
Sofie Van Hoecke
VLM
336
1
0
12 Feb 2025
Measuring Diversity in Synthetic Datasets
Yuchang Zhu
Huizhe Zhang
Bingzhe Wu
Jintang Li
Zibin Zheng
Peilin Zhao
Liang Chen
Yatao Bian
459
0
0
12 Feb 2025
Bag of Tricks for Inference-time Computation of LLM Reasoning
Fan Liu
Wenshuo Chao
Naiqiang Tan
Hao Liu
OffRL
LRM
712
12
0
11 Feb 2025
Self-Training Large Language Models for Tool-Use Without Demonstrations
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Ne Luo
Aryo Pradipta Gema
Xuanli He
Emile van Krieken
Pietro Lesci
Pasquale Minervini
LLMAG
310
8
0
09 Feb 2025
Enabling Autoregressive Models to Fill In Masked Tokens
Daniel Israel
Aditya Grover
Karen Ullrich
AI4CE
361
7
0
09 Feb 2025
ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data
Xiaoyang Liu
Kangjie Bao
Jiashuo Zhang
Yunqi Liu
Yu Chen
Yu Chen
Yang Jiao
Tao Luo
AIMat
366
13
0
08 Feb 2025
Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions
Jingxin Xu
Guoshun Nan
Sheng Guan
Sicong Leng
Wenshu Fan
Zixiao Wang
Yuyang Ma
Zhili Zhou
Yanzhao Hou
Xiaofeng Tao
LM&MA
342
1
0
08 Feb 2025
Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning
Manh Luong
Khai Nguyen
Dinh Q. Phung
Gholamreza Haffari
Zhuang Li
OT
293
0
0
08 Feb 2025
Optimizing Temperature for Language Models with Multi-Sample Inference
Weihua Du
Yiming Yang
Sean Welleck
497
14
0
07 Feb 2025
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference
Toby Simonds
294
4
0
05 Feb 2025
Twilight: Adaptive Attention Sparsity with Hierarchical Top-
p
p
p
Pruning
Ziming Mao
Jiaming Tang
Shuo Yang
Hanshuo Wang
Tian Tang
Boyu Tian
Eric Liang
Enze Xie
Mingyu Gao
619
11
0
04 Feb 2025
Evaluation of Large Language Models via Coupled Token Generation
N. C. Benz
Stratis Tsirtsis
Eleni Straitouri
Ivi Chatzi
Ander Artola Velasco
Suhas Thejaswi
Manuel Gomez Rodriguez
374
3
0
03 Feb 2025
Latent Thought Models with Variational Bayes Inference-Time Computation
Deqian Kong
Minglu Zhao
Dehong Xu
Bo Pang
Shu Wang
...
Zhangzhang Si
Chuan Li
Jianwen Xie
Sirui Xie
Ying Nian Wu
VLM
LRM
BDL
383
10
0
03 Feb 2025
Diverse Preference Optimization
Jack Lanchantin
Angelica Chen
Shehzaad Dhuliawala
Ping Yu
Jason Weston
Sainbayar Sukhbaatar
Ilia Kulikov
739
23
0
30 Jan 2025
Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization
North American Chapter of the Association for Computational Linguistics (NAACL), 2025
Zilu Tang
Rajen Chatterjee
Sarthak Garg
HILM
202
5
0
28 Jan 2025
AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback
Joshua Park
Yongfeng Zhang
LLMAG
LM&Ro
298
3
0
23 Jan 2025
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities
Florian Kankowski
Torgrim Solstad
Sina Zarriess
Oliver Bott
355
2
0
22 Jan 2025
O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
Haotian Luo
Li Shen
Haiying He
Yun Wang
Shiwei Liu
Wei Li
Naiqiang Tan
Xiaochun Cao
Dacheng Tao
VLM
LRM
533
190
0
22 Jan 2025
BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks
Zhuang Li
499
2
0
21 Jan 2025
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
International Conference on Learning Representations (ICLR), 2024
Junyu Chen
Han Cai
Junsong Chen
Enze Xie
Shang Yang
Haotian Tang
Zhekai Zhang
Yaojie Lu
Song Han
DiffM
497
14
0
20 Jan 2025
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
Ziyao Zhang
Yanlin Wang
Chong Wang
Jiachi Chen
Zibin Zheng
430
90
0
20 Jan 2025
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
Yen-Ting Lin
Di Jin
Tengyu Xu
Tianhao Wu
Sainbayar Sukhbaatar
...
Yuandong Tian
Arash Rahnama
Sinong Wang
Hao Ma
Han Fang
LRM
204
11
0
18 Jan 2025
Simplified and Generalized Masked Diffusion for Discrete Data
Neural Information Processing Systems (NeurIPS), 2024
Jiaxin Shi
Kehang Han
Zehao Wang
Arnaud Doucet
Michalis K. Titsias
DiffM
612
301
0
17 Jan 2025
LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks
International Conference on Software and Computer Applications (ICSCA), 2025
Zan-Kai Chong
Hiroyuki Ohsaki
Bryan Ng
283
10
0
13 Jan 2025
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Vladimir Bataev
Subhankar Ghosh
Vitaly Lavrukhin
Jason Chun Lok Li
AI4TS
284
4
0
10 Jan 2025
Learning the Language of Protein Structure
Benoit Gaujac
Jérémie Donà
Liviu Copoiu
Timothy Atkinson
Thomas Pierrot
Thomas D. Barrett
295
15
0
08 Jan 2025
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation
Alireza Salemi
Cheng-rong Li
Mingyang Zhang
Qiaozhu Mei
Weize Kong
Tao Chen
Zhuowan Li
Michael Bendersky
Hamed Zamani
LRM
RALM
ReLM
278
21
0
07 Jan 2025
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Helin Wang
Meng Yu
Jiarui Hai
Chen Chen
Yuchen Hu
Rilin Chen
Najim Dehak
Dong Yu
393
11
0
03 Jan 2025
Mind the Data Gap: Bridging LLMs to Enterprise Data Integration
Moe Kayali
Fabian Wenz
Nesime Tatbul
Çağatay Demiralp
213
7
0
31 Dec 2024
The Emotional Spectrum of LLMs: Leveraging Empathy and Emotion-Based Markers for Mental Health Support
Workshop on Computational Linguistics and Clinical Psychology (CLPsych), 2024
Alessandro De Grandi
Federico Ravenda
Andrea Raballo
Fabio Crestani
AI4MH
260
2
0
31 Dec 2024
Exploring and Controlling Diversity in LLM-Agent Conversation
Kuanchao Chu
Yi-Pei Chen
Hideki Nakayama
LLMAG
507
8
0
30 Dec 2024
How Evaluation Choices Distort the Outcome of Generative Drug Discovery
Journal of Cheminformatics (J Cheminform), 2024
Rıza Özçelik
F. Grisoni
265
3
0
24 Dec 2024
Human-Readable Adversarial Prompts: An Investigation into LLM Vulnerabilities Using Situational Context
Nilanjana Das
Edward Raff
Aman Chadha
Manas Gaur
AAML
606
4
0
20 Dec 2024
REFA: Reference Free Alignment for multi-preference optimization
Taneesh Gupta
Rahul Madhavan
Xuchao Zhang
Chetan Bansal
Saravan Rajmohan
494
1
0
20 Dec 2024
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
610
4
0
18 Dec 2024
Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
AAAI Conference on Artificial Intelligence (AAAI), 2024
Jiaqi Chen
Xiaoye Zhu
Tianyang Liu
Yuxiao Chen
Xinhui Chen
...
Lei Zhang
Chenyu Yan
Guanghao Mei
Jie M. Zhang
Guang Dai
DeLMO
224
8
0
11 Dec 2024
QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization
Shiyue Zhang
David Wan
Arie Cattan
Ayal Klein
Ido Dagan
Joey Tianyi Zhou
365
4
0
10 Dec 2024
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLM
Pacific Asia Conference on Language, Information and Computation (PACLIC), 2024
Takuro Fujii
Satoru Katsumata
214
0
0
09 Dec 2024
Constrained Decoding with Speculative Lookaheads
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Nishanth Nakshatri
Shamik Roy
Rajarshi Das
Suthee Chaidaroon
Leonid Boytsov
Rashmi Gangadharaiah
472
3
0
09 Dec 2024
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
Lajos Muzsai
David Imolai
András Lukács
LLMAG
264
33
0
02 Dec 2024
Paint Outside the Box: Synthesizing and Selecting Training Data for Visual Grounding
Zilin Du
Haoxin Li
Jianfei Yu
Boyang Li
1.3K
1
0
01 Dec 2024
How far can bias go? -- Tracing bias from pretraining data to alignment
Marion Thaler
Abdullatif Köksal
Alina Leidinger
Anna Korhonen
Hinrich Schutze
411
4
0
28 Nov 2024
Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference
Andrii Skliar
T. V. Rozendaal
Romain Lepert
Todor Boinovski
M. V. Baalen
Markus Nagel
Paul N. Whatmough
B. Bejnordi
MoE
409
7
0
27 Nov 2024
GeoFormer: A Multi-Polygon Segmentation Transformer
British Machine Vision Conference (BMVC), 2024
Maxim Khomiakov
Michael Riis Andersen
J. Frellsen
222
1
0
25 Nov 2024
Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt Engineering
International Conference on Software Engineering (ICSE), 2024
Tanghaoran Zhang
Yue Yu
Xinjun Mao
Shangwen Wang
Kang Yang
Yao Lu
Zhang Zhang
Yuxin Zhao
208
10
0
23 Nov 2024
Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance
Haozhe Zhao
Shuzheng Si
L. Chen
Yichi Zhang
Maosong Sun
Mingjia Zhang
Baobao Chang
VLM
194
15
0
21 Nov 2024
Closer Look at Efficient Inference Methods: A Survey of Speculative Decoding
Hyun Ryu
Eric Kim
358
3
0
20 Nov 2024
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
Xinyan Guan
Yanjiang Liu
Xinyu Lu
Boxi Cao
Xianpei Han
...
Le Sun
Jie Lou
Bowen Yu
Yaojie Lu
Hongyu Lin
ALM
590
9
0
18 Nov 2024
Previous
1
2
3
...
8
9
10
...
47
48
49
Next
Page 9 of 49
Page
of 49
Go