HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019

19 May 2019

Yejin Choi

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,253 papers shown

TwinBreak: Jailbreaking LLM Security Alignments based on Twin Prompts

T. Krauß

Hamid Dashtbani

Alexandra Dmitrienko

152

09 Jun 2025

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

...

270

09 Jun 2025

Private Memorization Editing: Turning Memorization into a Defense to Strengthen Data Privacy in Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Elena Sofia Ruzzetti

Giancarlo A. Xompero

Davide Venditti

Fabio Massimo Zanzotto

KELM PILM

291

09 Jun 2025

Learning Distribution-Wise Control in Representation Space for Language Models

Chunyuan Deng

Ruidi Chang

Hanjie Chen

268

07 Jun 2025

Not quite Sherlock Holmes: Language model predictions do not reliably differentiate impossible from improbable eventsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

205

07 Jun 2025

Adapt Once, Thrive with Updates: Transferable Parameter-Efficient Fine-Tuning on Evolving Base ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

188

07 Jun 2025

dots.llm1 Technical Report

...

198

06 Jun 2025

Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias

310

06 Jun 2025

Text-to-LoRA: Instant Transformer Adaption

275

06 Jun 2025

Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation

...

283

06 Jun 2025

DynamicMind: A Tri-Mode Thinking System for Large Language Models

175

06 Jun 2025

Selecting Demonstrations for Many-Shot In-Context Learning via Gradient MatchingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

296

05 Jun 2025

FPTQuant: Function-Preserving Transforms for LLM Quantization

268

05 Jun 2025

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

...

Blaise Agüera y Arcas

João Sacramento

312

05 Jun 2025

Quantifying Cross-Modality Memorization in Vision-Language Models

331

05 Jun 2025

Inference-Time Hyper-Scaling with KV Cache Compression

277

05 Jun 2025

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

425

05 Jun 2025

MANBench: Is Your Multimodal Model Smarter than Human?Annual Meeting of the Association for Computational Linguistics (ACL), 2025

224

04 Jun 2025

RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing

275

04 Jun 2025

SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling

249

04 Jun 2025

A Statistical Physics of Language Model Reasoning

Jack David Carson

Amir Reisizadeh

LRM AI4CE

185

04 Jun 2025

TokAlign: Efficient Vocabulary Adaptation via Token AlignmentAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

207

04 Jun 2025

Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability InformationInternational Joint Conference on Artificial Intelligence (IJCAI), 2025

276

04 Jun 2025

Backbone Augmented Training for Adaptations

206

04 Jun 2025

Adaptive Task Vectors for Large Language Models

262

03 Jun 2025

PoLAR: Polar-Decomposed Low-Rank Adapter Representation

256

03 Jun 2025

ProcrustesGPT: Compressing LLMs with Structured Matrices and Orthogonal TransformationsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Ekaterina Grishina

Mikhail Gorbunov

Maxim Rakhuba

172

03 Jun 2025

EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving

...

302

03 Jun 2025

Beyond Text Compression: Evaluating Tokenizers Across ScalesAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

278

03 Jun 2025

Scaling Fine-Grained MoE Beyond 50B Parameters: Empirical Evaluation and Practical Insights

207

03 Jun 2025

Cataloguing Hugging Face Models to Software Engineering Activities: Automation and Findings

284

03 Jun 2025

StochasTok: Improving Fine-Grained Subword Understanding in LLMs

345

02 Jun 2025

Exploring the Potential of LLMs as Personalized Assistants: Dataset, Evaluation, and AnalysisAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

227

02 Jun 2025

Multilingual Definition ModelingAnnual Meeting of the Association for Computational Linguistics (ACL), 2025

Edison Marrese-Taylor

Erica K. Shimomoto

Alfredo Solano

Enrique Reid

216

02 Jun 2025

T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning

343

02 Jun 2025

TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network

211

02 Jun 2025

Taming LLMs by Scaling Learning Rates with Gradient Grouping

230

01 Jun 2025

Mamba Drafters for Speculative Decoding

...

290

01 Jun 2025

Data Swarms: Optimizable Generation of Synthetic Evaluation Data

358

31 May 2025

Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers

Kazuki Irie

Morris Yau

Samuel J. Gershman

221

31 May 2025

Recipes for Pre-training LLMs with MXFP8

229

30 May 2025

Stepsize anything: A unified learning rate schedule for budgeted-iteration training

631

30 May 2025

HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts

229

30 May 2025

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

197

30 May 2025

ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration

457

30 May 2025

LittleBit: Ultra Low-Bit Quantization via Latent Factorization

229

30 May 2025

SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training

180

30 May 2025

Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning

Wanyun Xie

F. Tonin

Volkan Cevher

169

30 May 2025

LoLA: Low-Rank Linear Attention With Sparse Caching

338

29 May 2025

DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration

205

29 May 2025