ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.07830
  4. Cited By
HellaSwag: Can a Machine Really Finish Your Sentence?

HellaSwag: Can a Machine Really Finish Your Sentence?

Annual Meeting of the Association for Computational Linguistics (ACL), 2019
19 May 2019
Rowan Zellers
Ari Holtzman
Yonatan Bisk
Ali Farhadi
Yejin Choi
ArXiv (abs)PDFHTML

Papers citing "HellaSwag: Can a Machine Really Finish Your Sentence?"

50 / 2,253 papers shown
UI-Bench: A Benchmark for Evaluating Design Capabilities of AI Text-to-App Tools
UI-Bench: A Benchmark for Evaluating Design Capabilities of AI Text-to-App Tools
Sam Jung
Agustin Garcinuno
Spencer Mateega
ELM
223
0
0
28 Aug 2025
Diffusion Language Models Know the Answer Before Decoding
Diffusion Language Models Know the Answer Before Decoding
Pengxiang Li
Yefan Zhou
Dilxat Muhtar
L. Yin
Shilin Yan
Li Shen
Yi Liang
Soroush Vosoughi
Shiwei Liu
178
22
0
27 Aug 2025
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
Anusha Kamath
Kanishk Singla
Rakesh Paul
Raviraj Joshi
Utkarsh Vaidya
Sanjay Singh Chauhan
Niranjan Wartikar
153
0
0
27 Aug 2025
Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models
Task-Stratified Knowledge Scaling Laws for Post-Training Quantized Large Language Models
Chenxi Zhou
Pengfei Cao
Jiang Li
Jun Zhao
Kang Liu
Jun Zhao
Kang Liu
MQ
188
0
0
26 Aug 2025
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning
Zihao Huang
Yu Bao
Qiyang Min
S. Chen
Ran Guo
...
Defa Zhu
Yutao Zeng
Banggu Wu
Xun Zhou
Siyuan Qiao
MoE
181
4
0
26 Aug 2025
Predicting the Order of Upcoming Tokens Improves Language Modeling
Predicting the Order of Upcoming Tokens Improves Language Modeling
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
AI4TS
48
0
0
26 Aug 2025
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Taishi Nakamura
Satoki Ishikawa
Masaki Kawamura
Takumi Okamoto
Daisuke Nohara
Jun Suzuki
Rio Yokota
MoELRM
175
0
0
26 Aug 2025
Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap
Beyond Benchmark: LLMs Evaluation with an Anthropomorphic and Value-oriented Roadmap
Jun Wang
Ninglun Gu
Kailai Zhang
Zijiao Zhang
Yelun Bao
...
Liwei Liu
Yihuan Liu
Pengyong Li
Gary G. Yen
Junchi Yan
ALMELM
226
0
0
26 Aug 2025
Weights-Rotated Preference Optimization for Large Language Models
Weights-Rotated Preference Optimization for Large Language Models
Chenxu Yang
Ruipeng Jia
Mingyu Zheng
Naibin Gu
Zheng Lin
Siyuan Chen
Weichong Yin
Hua Wu
Weiping Wang
142
0
0
25 Aug 2025
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction
Weilin Cai
Le Qin
Shwai He
Junwei Cui
Ang Li
Jiayi Huang
MoE
121
0
0
25 Aug 2025
Riemannian Optimization for LoRA on the Stiefel Manifold
Riemannian Optimization for LoRA on the Stiefel Manifold
JuneYoung Park
MinJae Kang
Seongbae Lee
Haegang Lee
S. Kim
Jaeho Lee
151
1
0
25 Aug 2025
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Layerwise Importance Analysis of Feed-Forward Networks in Transformer-based Language Models
Wataru Ikeda
Kazuki Yano
Ryosuke Takahashi
Jaesung Lee
Keigo Shibata
Jun Suzuki
90
1
0
25 Aug 2025
Integral Transformer: Denoising Attention, Not Too Much Not Too Little
Integral Transformer: Denoising Attention, Not Too Much Not Too Little
I. Kobyzev
Abbas Ghaddar
Dingtao Hu
Boxing Chen
128
0
0
25 Aug 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Weiyun Wang
Zhangwei Gao
Lixin Gu
Hengjun Pu
Long Cui
...
Bowen Zhou
Kai Chen
Yu Qiao
Wenhai Wang
Gen Luo
MLLMLRM
304
265
0
25 Aug 2025
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Yifan Wang
Binbin Liu
Fengze Liu
Yuanfan Guo
Jiyao Deng
Xuecheng Wu
Weidong Zhou
Xiaohuan Zhou
Taifeng Wang
141
0
0
25 Aug 2025
Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks
Randomly Removing 50% of Dimensions in Text Embeddings has Minimal Impact on Retrieval and Classification Tasks
Sotaro Takeshita
Yurina Takeshita
Daniel Ruffinelli
Simone Paolo Ponzetto
147
3
0
25 Aug 2025
DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
Haojie Zhang
92
2
0
24 Aug 2025
Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models?
Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models?
Hyeong Kyu Choi
Xiaojin Zhu
Yixuan Li
LRM
347
11
0
24 Aug 2025
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata
Sylvia Howland
Golara Azar
Daria Soboleva
Natalia Vassilieva
Siddhisanket Raskar
M. Emani
V. Vishwanath
MoE
113
1
0
24 Aug 2025
CEQuest: Benchmarking Large Language Models for Construction Estimation
CEQuest: Benchmarking Large Language Models for Construction Estimation
Y. Wu
L. xilinx Wang
Rui Liu
98
1
0
22 Aug 2025
Interpreting the Effects of Quantization on LLMs
Interpreting the Effects of Quantization on LLMs
Manpreet Singh
Hassan Sajjad
MQMILM
377
3
0
22 Aug 2025
RoboBuddy in the Classroom: Exploring LLM-Powered Social Robots for Storytelling in Learning and Integration Activities
RoboBuddy in the Classroom: Exploring LLM-Powered Social Robots for Storytelling in Learning and Integration Activities
Daniel Tozadore
Nur Ertug
Yasmine Chaker
Mortadha Abderrahim
50
0
0
22 Aug 2025
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Systematic Characterization of LLM Quantization: A Performance, Energy, and Quality Perspective
Tianyao Shi
Yi Ding
MQ
133
3
0
22 Aug 2025
WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling
WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling
Jiacheng Li
Jianchao Tan
Zhidong Yang
Pingwei Sun
Feiye Huo
...
Xiangyu Zhang
Maoxin He
Guangming Tan
Weile Jia
Tong Zhao
110
3
0
21 Aug 2025
Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training
Exploiting Vocabulary Frequency Imbalance in Language Model Pre-training
Woojin Chung
Jeonghoon Kim
200
1
0
21 Aug 2025
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill and Decode Inference
Xiaojuan Tang
Fanxu Meng
Pingzhi Tang
Yuxuan Wang
Di Yin
Xing Sun
M. Zhang
197
0
0
21 Aug 2025
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
Qitao Tan
Xiaoying Song
Jin Lu
Guoming Li
Jun Liu
...
Jundong Li
Xiaoming Zhai
Shaoyi Huang
Wei Niu
Geng Yuan
MQ
226
0
0
21 Aug 2025
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version
Nghiem Thanh Pham
Tung Kieu
Duc-Manh Nguyen
Son Ha Xuan
Nghia Duong-Trung
Danh Le-Phuoc
174
2
0
21 Aug 2025
CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
Muchammad Daniyal Kautsar
Afra Majida Hariono
Widyawan
Syukron Abu Ishaq Alfarozi
Kuntpong Woraratpanya
161
0
0
21 Aug 2025
Dream 7B: Diffusion Large Language Models
Dream 7B: Diffusion Large Language Models
Jiacheng Ye
Zhihui Xie
Lin Zheng
Lei Li
Zirui Wu
Xin Jiang
Zhenguo Li
Lingpeng Kong
DiffMVLM
1.0K
110
0
21 Aug 2025
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
Haokun Lin
Haobo Xu
Yichen Wu
Ziyu Guo
Renrui Zhang
Zhichao Lu
Ying Wei
Gang Qu
Zhenan Sun
DiffMMQ
178
9
0
20 Aug 2025
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
Nvidia
Aarti Basant
Abhijit Khairnar
Abhijit Paithankar
Abhinav Khattar
...
Keith Wyss
Keshav Santhanam
Kezhi Kong
Krzysztof Pawelec
Kumar Anik
LRM
298
0
0
20 Aug 2025
GLASS: Test-Time Acceleration for LLMs via Global-Local Neural Importance Aggregation
GLASS: Test-Time Acceleration for LLMs via Global-Local Neural Importance Aggregation
Amirmohsen Sattarifard
Sepehr Lavasani
Ehsan Imani
Kunlin Zhang
Hanlin Xu
Fengyu Sun
Negar Hassanpour
Chao Gao
VLM
104
1
0
19 Aug 2025
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Xiao Liang
Zhongzhi Li
Yeyun Gong
Yelong Shen
Y. Wu
Zhijiang Guo
Weizhu Chen
LRM
236
24
0
19 Aug 2025
Maximum Score Routing For Mixture-of-Experts
Maximum Score Routing For Mixture-of-ExpertsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Bowen Dong
Yilong Fan
Yutao Sun
Zhenyu Li
Tengyu Pan
Xun Zhou
Jianyong Wang
MoE
117
2
0
18 Aug 2025
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Z-Pruner: Post-Training Pruning of Large Language Models for Efficiency without Retraining
Samiul Basir Bhuiyan
Md. Sazzad Hossain Adib
Mohammed Aman Bhuiyan
Muhammad Rafsan Kabir
Moshiur Farazi
Shafin Rahman
Nabeel Mohammed
179
1
0
18 Aug 2025
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation
David Heineman
Valentin Hofmann
Ian H. Magnusson
Yuling Gu
Noah A. Smith
Hannaneh Hajishirzi
Kyle Lo
Jesse Dodge
ALM
122
4
0
18 Aug 2025
Reinforcement Learning with Rubric Anchors
Reinforcement Learning with Rubric Anchors
Zenan Huang
Yihong Zhuang
Guoshan Lu
Zeyu Qin
Haokai Xu
...
Yanmei Gu
Y Samuel Wang
Zhengkai Yang
Jianguo Li
Junbo Zhao
ALM
119
21
0
18 Aug 2025
Data Mixing Optimization for Supervised Fine-Tuning of Large Language Models
Data Mixing Optimization for Supervised Fine-Tuning of Large Language Models
Yuan Li
Zhengzhong Liu
Eric P. Xing
139
1
0
16 Aug 2025
Every 28 Days the AI Dreams of Soft Skin and Burning Stars: Scaffolding AI Agents with Hormones and Emotions
Every 28 Days the AI Dreams of Soft Skin and Burning Stars: Scaffolding AI Agents with Hormones and Emotions
Leigh Levinson
Christopher J. Agostino
56
0
0
15 Aug 2025
MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models
MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models
Xinyan Jiang
L. Zhang
Jiayi Zhang
Qingsong Yang
Guimin Hu
Di Wang
Lijie Hu
LLMSV
393
3
0
14 Aug 2025
A Survey on Diffusion Language Models
A Survey on Diffusion Language Models
Tianyi Li
Mingda Chen
Bowei Guo
Zhiqiang Shen
316
30
0
14 Aug 2025
EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
EffiEval: Efficient and Generalizable Model Evaluation via Capability Coverage Maximization
Yaoning Wang
Jiahao Ying
Yixin Cao
Yubo Ma
Yugang Jiang
ELM
33
2
0
13 Aug 2025
TiMoE: Time-Aware Mixture of Language Experts
TiMoE: Time-Aware Mixture of Language Experts
Robin Faro
Dongyang Fan
Tamar Alphaidze
Martin Jaggi
MoE
140
1
0
12 Aug 2025
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Zhijun Tu
Hanting Chen
Siqi Liu
Chuanjian Liu
Jian Li
Jie Hu
Yunhe Wang
MQ
121
0
0
09 Aug 2025
Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning
Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning
Jinda Liu
Bo Cheng
Yi-Ju Chang
Yuan Wu
MoMe
80
0
0
07 Aug 2025
Pruning Large Language Models by Identifying and Preserving Functional Networks
Pruning Large Language Models by Identifying and Preserving Functional Networks
Yiheng Liu
Junhao Ning
Sichen Xia
Xiaohui Gao
Ning Qiang
Bao Ge
Junwei Han
Xiaoyan Cai
155
1
0
07 Aug 2025
iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
iFairy: the First 2-bit Complex LLM with All Parameters in {±1,±i}\{\pm1, \pm i\}{±1,±i}
Feiyu Wang
Guoan Wang
Yihao Zhang
S. Wang
Weitao Li
Bokai Huang
Shimao Chen
Z. L. Jiang
Rui Xu
Tong Yang
MQ
233
5
0
07 Aug 2025
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
Chenzhuo Zhao
Xinda Wang
Yue Huang
Junting Lu
Ziqian Liu
LRM
114
1
0
07 Aug 2025
Tensorized Clustered LoRA Merging for Multi-Task Interference
Tensorized Clustered LoRA Merging for Multi-Task Interference
Zhan Su
Fengran Mo
G. Liang
Jinghan Zhang
Bingbing Wen
Prayag Tiwari
Jian-Yun Nie
MoMe
178
0
0
06 Aug 2025
Previous
123...789...444546
Next