v1v2v3 (latest)

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

20 April 2018

Amanpreet Singh

Papers citing "GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"

50 / 4,808 papers shown

DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction

124

25 Aug 2025

EEG-FM-Bench: A Comprehensive Benchmark for the Systematic Evaluation of EEG Foundation Models

116

25 Aug 2025

Debiasing Multilingual LLMs in Cross-lingual Latent Space

148

25 Aug 2025

Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery

Robert Yang

225

25 Aug 2025

Module-Aware Parameter-Efficient Machine Unlearning on Transformers

128

24 Aug 2025

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

Krishna Teja Chitty-Venkata

113

24 Aug 2025

SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds

110

23 Aug 2025

Spatio-Temporal Pruning for Compressed Spiking Large Language Models

23 Aug 2025

QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments

David Beauchemin

Richard Khoury

127

23 Aug 2025

GEM: A Scale-Aware and Distribution-Sensitive Sparse Fine-Tuning Framework for Effective Downstream Adaptation

160

22 Aug 2025

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish

158

22 Aug 2025

CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression

Muchammad Daniyal Kautsar

Afra Majida Hariono

Widyawan

Syukron Abu Ishaq Alfarozi

Kuntpong Woraratpanya

162

21 Aug 2025

Influence-driven Curriculum Learning for Pre-training on Limited Data

201

21 Aug 2025

SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version

174

21 Aug 2025

Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation

...

102

21 Aug 2025

Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference

130

20 Aug 2025

Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation

109

19 Aug 2025

Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text

176

19 Aug 2025

Hallucinations in medical devices

182

18 Aug 2025

Wavy Transformer

Satoshi Noguchi

Yoshinobu Kawahara

143

18 Aug 2025

MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models

399

14 Aug 2025

SoK: Data Minimization in Machine Learning

153

14 Aug 2025

When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing

210

14 Aug 2025

Combating Homelessness Stigma with LLMs: A New Multi-Modal Dataset for Bias Detection

14 Aug 2025

Computational Economics in Large Language Models: Exploring Model Behavior and Incentive Design under Resource Constraints

162

14 Aug 2025

LaajMeter: A Framework for LaaJ Evaluation

183

13 Aug 2025

Dynamic Rank Adjustment for Accurate and Efficient Neural Network Training

113

12 Aug 2025

SinLlama -- A Large Language Model for SinhalaMoratuwa Engineering Research Conference (MERCon), 2025

284

12 Aug 2025

SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders

334

11 Aug 2025

Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment

Saketh Reddy Vemula

Sandipan Dandapat

D. Sharma

Parameswari Krishnamurthy

236

11 Aug 2025

GVGAI-LLM: Evaluating Large Language Model Agents with Infinite Games

102

11 Aug 2025

Understanding Syntactic Generalization in Structure-inducing Language Models

David Arps

Hassan Sajjad

Laura Kallmeyer

147

11 Aug 2025

BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity

108

09 Aug 2025

Fed MobiLLM: Efficient Federated LLM Fine-Tuning over Heterogeneous Mobile Devices via Server Assisted Side-Tuning

121

09 Aug 2025

TASE: Token Awareness and Structured Evaluation for Multilingual Language Models

115

07 Aug 2025

Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning

07 Aug 2025

Tesserae: Scalable Placement Policies for Deep Learning Workloads

S. Bian

Saurabh Agarwal

Md. Tareq Mahmood

Shivaram Venkataraman

140

07 Aug 2025

GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay

140

06 Aug 2025

Adaptive Sparse Softmax: An Effective and Efficient Softmax VariantIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

138

05 Aug 2025

FairLangProc: A Python package for fairness in NLP

Arturo Pérez-Peralta

Sandra Benítez-Peña

Rosa E. Lillo

158

05 Aug 2025

VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs

134

05 Aug 2025

PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models

Minghao Yan

Zhuang Wang

Zhen Jia

Shivaram Venkataraman

Yida Wang

156

04 Aug 2025

Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs

Zuxin Ma

Yunhe Cui

Yongbin Qin

141

04 Aug 2025

Amber Pruner: Leveraging N:M Activation Sparsity for Efficient Prefill in Large Language Models

146

04 Aug 2025

LOST: Low-rank and Sparse Pre-training for Large Language Models

155

04 Aug 2025

CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis

409

04 Aug 2025

The Architecture of Trust: A Framework for AI-Augmented Real Estate Valuation in the Era of Structured Data

182

04 Aug 2025

HT-Transformer: Event Sequences Classification by Accumulating Prefix Information with History Tokens

Ivan Karpukhin

Ivan A Kireev

AI4TS

113

02 Aug 2025

FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models

116

02 Aug 2025

Interpreting Performance Profiles with Deep Learning

Zhuoran Liu

HAI

100

01 Aug 2025