v1v2 (latest)

The Curious Case of Neural Text Degeneration

22 April 2019

Yejin Choi

Papers citing "The Curious Case of Neural Text Degeneration"

50 / 2,402 papers shown

Beyond the Singular: Revealing the Value of Multiple Generations in Benchmark Evaluation

Wenbo Zhang

Hengrui Cai

Wenyu Chen

338

13 Feb 2025

Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches

D. Elbaz

Oren Salzman

OffRL

333

13 Feb 2025

From Haystack to Needle: Label Space Reduction for Zero-shot Classification

336

12 Feb 2025

Measuring Diversity in Synthetic Datasets

459

12 Feb 2025

Bag of Tricks for Inference-time Computation of LLM Reasoning

712

11 Feb 2025

Self-Training Large Language Models for Tool-Use Without DemonstrationsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

310

09 Feb 2025

Enabling Autoregressive Models to Fill In Masked Tokens

361

09 Feb 2025

ATLAS: Autoformalizing Theorems through Lifting, Augmentation, and Synthesis of Data

366

08 Feb 2025

Refining Positive and Toxic Samples for Dual Safety Self-Alignment of LLMs with Minimal Human Interventions

342

08 Feb 2025

Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning

293

08 Feb 2025

Optimizing Temperature for Language Models with Multi-Sample Inference

Weihua Du

Yiming Yang

Sean Welleck

497

07 Feb 2025

Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference

Toby Simonds

294

05 Feb 2025

Twilight: Adaptive Attention Sparsity with Hierarchical Top-

p

619

04 Feb 2025

Evaluation of Large Language Models via Coupled Token Generation

Manuel Gomez Rodriguez

374

03 Feb 2025

Latent Thought Models with Variational Bayes Inference-Time Computation

...

383

03 Feb 2025

Diverse Preference Optimization

739

30 Jan 2025

Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference OptimizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025

202

28 Jan 2025

AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback

Joshua Park

Yongfeng Zhang

LLMAG LM&Ro

298

23 Jan 2025

Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities

355

22 Jan 2025

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

533

190

22 Jan 2025

BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks

Zhuang Li

499

21 Jan 2025

Deep Compression Autoencoder for Efficient High-Resolution Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2024

497

20 Jan 2025

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation

430

20 Jan 2025

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

...

204

18 Jan 2025

Simplified and Generalized Masked Diffusion for Discrete DataNeural Information Processing Systems (NeurIPS), 2024

612

301

17 Jan 2025

LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert NetworksInternational Conference on Software and Computer Applications (ICSCA), 2025

Zan-Kai Chong

Hiroyuki Ohsaki

Bryan Ng

283

13 Jan 2025

TTS-Transducer: End-to-End Speech Synthesis with Neural TransducerIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

284

10 Jan 2025

Learning the Language of Protein Structure

295

08 Jan 2025

Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation

278

07 Jan 2025

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

393

03 Jan 2025

Mind the Data Gap: Bridging LLMs to Enterprise Data Integration

213

31 Dec 2024

The Emotional Spectrum of LLMs: Leveraging Empathy and Emotion-Based Markers for Mental Health SupportWorkshop on Computational Linguistics and Clinical Psychology (CLPsych), 2024

260

31 Dec 2024

Exploring and Controlling Diversity in LLM-Agent Conversation

507

30 Dec 2024

How Evaluation Choices Distort the Outcome of Generative Drug DiscoveryJournal of Cheminformatics (J Cheminform), 2024

Rıza Özçelik

F. Grisoni

265

24 Dec 2024

Human-Readable Adversarial Prompts: An Investigation into LLM Vulnerabilities Using Situational Context

606

20 Dec 2024

REFA: Reference Free Alignment for multi-preference optimization

494

20 Dec 2024

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive InvestigationAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Vera Neplenbroek

Arianna Bisazza

Raquel Fernández

610

18 Dec 2024

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text DetectionAAAI Conference on Artificial Intelligence (AAAI), 2024

...

Lei Zhang

224

11 Dec 2024

QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization

365

10 Dec 2024

JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMPacific Asia Conference on Language, Information and Computation (PACLIC), 2024

Takuro Fujii

Satoru Katsumata

214

09 Dec 2024

Constrained Decoding with Speculative LookaheadsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

472

09 Dec 2024

HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing

264

02 Dec 2024

Paint Outside the Box: Synthesizing and Selecting Training Data for Visual Grounding

1.3K

01 Dec 2024

How far can bias go? -- Tracing bias from pretraining data to alignment

411

28 Nov 2024

Mixture of Cache-Conditional Experts for Efficient Mobile Device Inference

409

27 Nov 2024

GeoFormer: A Multi-Polygon Segmentation TransformerBritish Machine Vision Conference (BMVC), 2024

Maxim Khomiakov

Michael Riis Andersen

J. Frellsen

222

25 Nov 2024

Instruct or Interact? Exploring and Eliciting LLMs' Capability in Code Snippet Adaptation Through Prompt EngineeringInternational Conference on Software Engineering (ICSE), 2024

208

23 Nov 2024

Looking Beyond Text: Reducing Language bias in Large Vision-Language Models via Multimodal Dual-Attention and Soft-Image Guidance

194

21 Nov 2024

Closer Look at Efficient Inference Methods: A Survey of Speculative Decoding

Hyun Ryu

Eric Kim

358

20 Nov 2024

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

...

590

18 Nov 2024