HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019

19 May 2019

Yejin Choi

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,251 papers shown

SBVR: Summation of BitVector Representation for Efficient LLM Quantization

132

17 Sep 2025

DSFT: Inspiring Diffusion Large Language Models to Comprehend Mathematical and Logical Patterns

Ranfei Chen

Ming Chen

DiffM AI4CE

17 Sep 2025

ZERA: Zero-init Instruction Evolving Refinement Agent - From Zero Instructions to Structured Prompts via Principle-based Optimization

17 Sep 2025

NIRVANA: Structured pruning reimagined for large language models compression

1.6K

17 Sep 2025

Instance-level Randomization: Toward More Stable LLM Evaluations

149

16 Sep 2025

CBP-Tuning: Efficient Local Customization for Black-box Large Language Models

15 Sep 2025

AMQ: Enabling AutoML for Mixed-precision Weight-Only Quantization of Large Language Models

109

15 Sep 2025

From Parameters to Performance: A Data-Driven Study on LLM Structure and Development

121

14 Sep 2025

Fluid Language Model Benchmarking

125

14 Sep 2025

Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs

Hang Guo

Yawei Li

Luca Benini

207

14 Sep 2025

AQUA: Attention via QUery mAgnitudes for Memory and Compute Efficient Inference in LLMs

S. Shah

Saurav Prakash

Balaraman Ravindran

14 Sep 2025

Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs

147

12 Sep 2025

Towards Understanding Visual Grounding in Visual Language Models

Georgios Pantazopoulos

Eda B. Özyiğit

ObjD

300

12 Sep 2025

ButterflyQuant: Ultra-low-bit LLM Quantization through Learnable Orthogonal Butterfly Transforms

188

11 Sep 2025

LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures

Hai Huang

Yann LeCun

Randall Balestriero

187

11 Sep 2025

Benchmarking Energy Efficiency of Large Language Models Using vLLM

K. Pronk

Q. Zhao

10 Sep 2025

Open-sci-ref-0.01: open and reproducible reference baselines for language model and dataset comparison

165

10 Sep 2025

ForTIFAI: Fending Off Recursive Training Induced Failure for AI Model Collapse

Soheil Zibakhsh Shabgahi

Pedram Aghazadeh

Azalia Mirhoseini

F. Koushanfar

263

10 Sep 2025

Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation

134

09 Sep 2025

Causal Attention with Lookahead Keys

188

09 Sep 2025

LoaQ: Layer-wise Output Approximation Quantization

Li Lin

Xiaojun Wan

08 Sep 2025

IPR: Intelligent Prompt Routing with User-Controlled Quality-Cost Trade-offs

...

238

08 Sep 2025

COMPACT: Common-token Optimized Model Pruning Across Channels and Tokens

Eugene Kwek

Wenpeng Yin

VLM

248

08 Sep 2025

AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs

188

06 Sep 2025

Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian

160

06 Sep 2025

Hyperbolic Large Language Models

210

06 Sep 2025

Delta Activations: A Representation for Finetuned Large Language Models

146

04 Sep 2025

Set Block Decoding is a Language Model Inference Accelerator

142

04 Sep 2025

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

235

04 Sep 2025

On Robustness and Reliability of Benchmark-Based Evaluation of LLMs

164

04 Sep 2025

SelfAug: Mitigating Catastrophic Forgetting in Retrieval-Augmented Generation via Distribution Self-Alignment

...

167

04 Sep 2025

RL's Razor: Why Online Reinforcement Learning Forgets Less

183

04 Sep 2025

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor

104

03 Sep 2025

LExI: Layer-Adaptive Active Experts for Efficient MoE Model Inference

Krishna Teja Chitty-Venkata

159

02 Sep 2025

Efficient Training-Free Online Routing for High-Volume Multi-LLM Serving

Fangzhou Wu

Sandeep Silwal

229

02 Sep 2025

Implicit Reasoning in Large Language Models: A Comprehensive Survey

212

02 Sep 2025

Causal Consistency Regularization: Training Verifiably Sensitive Reasoning in Large Language Models

154

01 Sep 2025

Dream-Coder 7B: An Open Diffusion Language Model for Code

...

127

01 Sep 2025

LiquidGEMM: Hardware-Efficient W4A8 GEMM Kernel for High-Performance LLM Serving

...

106

01 Sep 2025

GradES: Significantly Faster Training in Transformers with Gradient-Based Early Stopping

255

01 Sep 2025

Flaw or Artifact? Rethinking Prompt Sensitivity in Evaluating LLMs

107

01 Sep 2025

DTRNet: Dynamic Token Routing Network to Reduce Quadratic Costs in Transformers

Aman Sharma

Saeed Najafi

Parsa Farinneya

Benyamin Jamialahmadi

31 Aug 2025

Router Upcycling: Leveraging Mixture-of-Routers in Mixture-of-Experts Upcycling

31 Aug 2025

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

305

29 Aug 2025

Standard vs. Modular Sampling: Best Practices for Reliable LLM Unlearning

Praveen Bushipaka

Lucia Passaro

Tommaso Cucinotta

108

29 Aug 2025

PDTrim: Targeted Pruning for Prefill-Decode Disaggregation in Inference

467

29 Aug 2025

Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection

140

28 Aug 2025

Provable Benefits of In-Tool Learning for Large Language Models

152

28 Aug 2025

InSQuAD: In-Context Learning for Efficient Retrieval via Submodular Mutual Information to Enforce Quality and Diversity

Souradeep Nanda

Anay Majee

Rishabh K. Iyer

123

28 Aug 2025

UI-Bench: A Benchmark for Evaluating Design Capabilities of AI Text-to-App Tools

208

28 Aug 2025