HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019

19 May 2019

Yejin Choi

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,253 papers shown

BLISS: A Lightweight Bilevel Influence Scoring Method for Data Selection in Language Model Pretraining

262

07 Oct 2025

lm-Meter: Unveiling Runtime Inference Latency for On-Device Language Models

116

07 Oct 2025

Training Dynamics Impact Post-Training Quantization Robustness

Albert Catalan-Tatjer

Niccolò Ajroldi

Jonas Geiping

181

07 Oct 2025

Robustness assessment of large audio language models in multiple-choice evaluation

167

06 Oct 2025

SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba

173

06 Oct 2025

Recover-LoRA: Data-Free Accuracy Recovery of Degraded Language Models via Low-Rank Adaptation

Devleena Das

Rajeev Patwari

Ashish Sirasao

106

06 Oct 2025

The End of Transformers? On Challenging Attention and the Rise of Sub-Quadratic Architectures

132

06 Oct 2025

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

162

06 Oct 2025

What Makes Diffusion Language Models Super Data Learners?

05 Oct 2025

Read the Scene, Not the Script: Outcome-Aware Safety for LLMs

131

05 Oct 2025

Measuring Language Model Hallucinations Through Distributional Correctness

Thomas F Burns

HILM ELM

178

05 Oct 2025

Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs

145

04 Oct 2025

Pool Me Wisely: On the Effect of Pooling in Transformer-Based Models

143

02 Oct 2025

The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

101

02 Oct 2025

Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning

295

02 Oct 2025

Composer: A Search Framework for Hybrid Neural Architecture Design

236

01 Oct 2025

Downgrade to Upgrade: Optimizer Simplification Enhances Robustness in LLM Unlearning

358

01 Oct 2025

Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

Shojiro Yamabe

Jun Sakuma

AAML

133

01 Oct 2025

Sentry: Authenticating Machine Learning Artifacts on the Fly

Andrew Gan

Zahra Ghodsi

01 Oct 2025

LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts

234

30 Sep 2025

CAST: Continuous and Differentiable Semi-Structured Sparsity-Aware Training for Large Language Models

108

30 Sep 2025

Understanding the Mixture-of-Experts with Nadaraya-Watson Kernel

...

222

30 Sep 2025

Training Matryoshka Mixture-of-Experts for Elastic Inference-Time Expert Utilization

30 Sep 2025

Collaborative Compression for Large-Scale MoE Deployment on Edge

30 Sep 2025

OPPO: Accelerating PPO-based RLHF via Pipeline Overlap

108

30 Sep 2025

MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation

228

30 Sep 2025

Thoughtbubbles: an Unsupervised Method for Parallel Thinking in Latent Space

Houjun Liu

Shikhar Murty

Christopher D. Manning

Róbert Csordás

ReLM LRM AI4CE

160

30 Sep 2025

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

207

30 Sep 2025

Layer-wise dynamic rank for compressing large language models

208

30 Sep 2025

The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks

Arda Uzunoglu

Tianjian Li

Daniel Khashabi

172

30 Sep 2025

Towards Ecologically Valid LLM Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners

30 Sep 2025

Short window attention enables long-term memorization

Pierre-Emmanuel Mazaré

Gabriel Synnaeve

Hervé Jégou

152

29 Sep 2025

Conda: Column-Normalized Adam for Training Large Language Models Faster

246

29 Sep 2025

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

...

Aleksandra Krasnodębska

239

29 Sep 2025

Pretraining with hierarchical memories: separating long-tail and common knowledge

249

29 Sep 2025

CURA: Size Isnt All You Need - A Compact Universal Architecture for On-Device Intelligence

Jae-Bum Seo

Muhammad Salman

Lismer Andres Caceres-Najarro

103

29 Sep 2025

AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment

120

29 Sep 2025

Beyond Repetition: Text Simplification and Curriculum Learning for Data-Constrained Pretraining

M. R

Dan John Velasco

117

29 Sep 2025

Fingerprinting LLMs via Prompt Injection

201

29 Sep 2025

LLM DNA: Tracing Model Evolution via Functional Representations

137

29 Sep 2025

Rethinking Parameter Sharing for LLM Fine-Tuning with Multiple LoRAs

Hao Ban

Kaiyi Ji

MoE

174

29 Sep 2025

Toward Preference-aligned Large Language Models via Residual-based Model Steering

Lucio La Cava

Andrea Tagarelli

LLMSV

163

28 Sep 2025

ChunkLLM: A Lightweight Pluggable Framework for Accelerating LLMs Inference

175

28 Sep 2025

Assessing Large Language Models in Updating Their Forecasts with New Information

Zhangdie Yuan

Zifeng Ding

Andreas Vlachos

28 Sep 2025

Sequential Diffusion Language Models

...

117

28 Sep 2025

Don't Settle Too Early: Self-Reflective Remasking for Diffusion Language Models

113

28 Sep 2025

Tequila: Trapping-free Ternary Quantization for Large Language Models

247

28 Sep 2025

Timber: Training-free Instruct Model Refining with Base via Effective Rank

117

28 Sep 2025

Quant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion Large Language Models

125

27 Sep 2025

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

...

243

27 Sep 2025