Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1804.07461
Cited By
v1
v2
v3 (latest)
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"
50 / 4,808 papers shown
DualSparse-MoE: Coordinating Tensor/Neuron-Level Sparsity with Expert Partition and Reconstruction
Weilin Cai
Le Qin
Shwai He
Junwei Cui
Ang Li
Jiayi Huang
MoE
124
0
0
25 Aug 2025
EEG-FM-Bench: A Comprehensive Benchmark for the Systematic Evaluation of EEG Foundation Models
Wei Xiong
Jiangtong Li
Jie Li
Kun Zhu
116
3
0
25 Aug 2025
Debiasing Multilingual LLMs in Cross-lingual Latent Space
Qiwei Peng
Guimin Hu
Yekun Chai
Anders Søgaard
148
1
0
25 Aug 2025
Unlearning as Ablation: Toward a Falsifiable Benchmark for Generative Scientific Discovery
Robert Yang
MU
225
0
0
25 Aug 2025
Module-Aware Parameter-Efficient Machine Unlearning on Transformers
Wenjie Bao
Jian Lou
Yuke Hu
Xiaochen Li
Zhihao Liu
Jiaqi Liu
Zhan Qin
K. Ren
MU
128
0
0
24 Aug 2025
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Krishna Teja Chitty-Venkata
Sylvia Howland
Golara Azar
Daria Soboleva
Natalia Vassilieva
Siddhisanket Raskar
M. Emani
V. Vishwanath
MoE
113
1
0
24 Aug 2025
SALMAN: Stability Analysis of Language Models Through the Maps Between Graph-based Manifolds
Wuxinlin Cheng
Yun Feng
Jinwen Wu
K. P. Subbalakshmi
Tian Han
Zhuo Feng
AAML
110
0
0
23 Aug 2025
Spatio-Temporal Pruning for Compressed Spiking Large Language Models
Yi Jiang
Malyaban Bal
Brian Matejek
Susmit Jha
Adam D. Cobb
Abhronil Sengupta
93
0
0
23 Aug 2025
QFrCoLA: a Quebec-French Corpus of Linguistic Acceptability Judgments
David Beauchemin
Richard Khoury
127
2
0
23 Aug 2025
GEM: A Scale-Aware and Distribution-Sensitive Sparse Fine-Tuning Framework for Effective Downstream Adaptation
Sungmin Kang
Jisoo Kim
Salman Avestimehr
Sunwoo Lee
MoE
160
0
0
22 Aug 2025
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er
.Ilker Kesen
Gözde Gül Şahin
Aykut Erdem
ELM
VLM
158
1
0
22 Aug 2025
CALR: Corrective Adaptive Low-Rank Decomposition for Efficient Large Language Model Layer Compression
Muchammad Daniyal Kautsar
Afra Majida Hariono
Widyawan
Syukron Abu Ishaq Alfarozi
Kuntpong Woraratpanya
162
0
0
21 Aug 2025
Influence-driven Curriculum Learning for Pre-training on Limited Data
Loris Schoenegger
Lukas Thoma
Terra Blevins
Benjamin Roth
201
1
0
21 Aug 2025
SLM-Bench: A Comprehensive Benchmark of Small Language Models on Environmental Impacts--Extended Version
Nghiem Thanh Pham
Tung Kieu
Duc-Manh Nguyen
Son Ha Xuan
Nghia Duong-Trung
Danh Le-Phuoc
174
2
0
21 Aug 2025
Mind and Motion Aligned: A Joint Evaluation IsaacSim Benchmark for Task Planning and Low-Level Policies in Mobile Manipulation
Nikita Kachaev
Andrei Spiridonov
Andrey Gorodetsky
K. Muravyev
Nikita Oskolkov
...
Vlad Shakhuro
Dmitry Makarov
Aleksandr Panov
Polina Fedotova
A. Kovalev
LM&Ro
102
2
0
21 Aug 2025
Evaluating Multilingual and Code-Switched Alignment in LLMs via Synthetic Natural Language Inference
Samir Abdaljalil
E. Serpedin
K. Qaraqe
Hasan Kurban
130
0
0
20 Aug 2025
Train Once, Deploy Anywhere: Realize Data-Efficient Dynamic Object Manipulation
Zhuoling Li
Xiaoyang Wu
Zhenhua Xu
Hengshuang Zhao
109
0
0
19 Aug 2025
Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text
Zixin Rao
Youssef Mohamed
Shang Liu
Zeyan Liu
DeLMO
176
0
0
19 Aug 2025
Hallucinations in medical devices
Jason Granstedt
Prabhat Kc
Rucha Deshpande
Victor Garcia
Aldo Badano
182
3
0
18 Aug 2025
Wavy Transformer
Satoshi Noguchi
Yoshinobu Kawahara
143
0
0
18 Aug 2025
MSRS: Adaptive Multi-Subspace Representation Steering for Attribute Alignment in Large Language Models
Xinyan Jiang
L. Zhang
Jiayi Zhang
Qingsong Yang
Guimin Hu
Di Wang
Lijie Hu
LLMSV
399
3
0
14 Aug 2025
SoK: Data Minimization in Machine Learning
Robin Staab
Nikola Jovanović
Kimberly Mai
Prakhar Ganesh
Martin Vechev
Ferdinando Fioretto
Matthew Jagielski
153
0
0
14 Aug 2025
When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing
Mahdi Dhaini
Stephen Meisenbacher
Ege Erdogan
Florian Matthes
Gjergji Kasneci
SILM
210
0
0
14 Aug 2025
Combating Homelessness Stigma with LLMs: A New Multi-Modal Dataset for Bias Detection
Jonathan A. Karr Jr.
Benjamin F. Herbst
Ting Hua
Matthew Hauenstein
Georgina Curto
Nitesh Chawla
81
0
0
14 Aug 2025
Computational Economics in Large Language Models: Exploring Model Behavior and Incentive Design under Resource Constraints
Sandeep Reddy
Kabir Khan
Rohit Patil
Ananya Chakraborty
Faizan A. Khan
Swati Kulkarni
Arjun Verma
Neha Singh
162
1
0
14 Aug 2025
LaajMeter: A Framework for LaaJ Evaluation
Samuel Ackerman
Gal Amram
Ora Nova Fandina
E. Farchi
Shmulik Froimovich
Raviv Gal
Wesam Ibraheem
Avi Ziv
ALM
183
1
0
13 Aug 2025
Dynamic Rank Adjustment for Accurate and Efficient Neural Network Training
Hyuntak Shin
Aecheon Jung
Sungeun Hong
Sunwoo Lee
113
0
0
12 Aug 2025
SinLlama -- A Large Language Model for Sinhala
Moratuwa Engineering Research Conference (MERCon), 2025
H.W.K.Aravinda
Rashad Sirajudeen
Samith Karunathilake
Nisansa de Silva
Surangika Ranathunga
Rishemjit Kaur
LRM
284
1
0
12 Aug 2025
SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders
Zhuohao Yu
Xingru Jiang
Weizheng Gu
Yidong Wang
Shikun Zhang
Wei Ye
Wei Ye
WaLM
334
1
0
11 Aug 2025
Rethinking Tokenization for Rich Morphology: The Dominance of Unigram over BPE and Morphological Alignment
Saketh Reddy Vemula
Sandipan Dandapat
D. Sharma
Parameswari Krishnamurthy
236
0
0
11 Aug 2025
GVGAI-LLM: Evaluating Large Language Model Agents with Infinite Games
Yuchen Li
Cong Lin
Muhammad Umair Nasir
Philip Bontrager
Jialin Liu
Julian Togelius
LLMAG
ELM
LRM
102
1
0
11 Aug 2025
Understanding Syntactic Generalization in Structure-inducing Language Models
David Arps
Hassan Sajjad
Laura Kallmeyer
147
0
0
11 Aug 2025
BoRA: Towards More Expressive Low-Rank Adaptation with Block Diversity
Shiwei Li
Xiandi Luo
Haozhao Wang
Xing Tang
Ziqiang Cui
Dugang Liu
Yuhua Li
Xiuqiang He
Ruixuan Li
108
4
0
09 Aug 2025
Fed MobiLLM: Efficient Federated LLM Fine-Tuning over Heterogeneous Mobile Devices via Server Assisted Side-Tuning
Xingke Yang
Liang Li
Sicong Li
Liwei Guan
Hao Wang
Xiaoqi Qi
Jiang-Dong Liu
Xin Fu
Miao Pan
121
1
0
09 Aug 2025
TASE: Token Awareness and Structured Evaluation for Multilingual Language Models
Chenzhuo Zhao
Xinda Wang
Yue Huang
Junting Lu
Ziqian Liu
LRM
115
1
0
07 Aug 2025
Align, Don't Divide: Revisiting the LoRA Architecture in Multi-Task Learning
Jinda Liu
Bo Cheng
Yi-Ju Chang
Yuan Wu
MoMe
83
0
0
07 Aug 2025
Tesserae: Scalable Placement Policies for Deep Learning Workloads
S. Bian
Saurabh Agarwal
Md. Tareq Mahmood
Shivaram Venkataraman
140
0
0
07 Aug 2025
GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay
Yunan Zhang
Shuoran Jiang
Mengchen Zhao
Yuefeng Li
Yang Fan
Xiangping Wu
Qingcai Chen
KELM
CLL
140
1
0
06 Aug 2025
Adaptive Sparse Softmax: An Effective and Efficient Softmax Variant
IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Qi Lv
Lei Geng
Ziqiang Cao
Min Cao
Sujian Li
Wenjie Li
Guohong Fu
138
2
0
05 Aug 2025
FairLangProc: A Python package for fairness in NLP
Arturo Pérez-Peralta
Sandra Benítez-Peña
Rosa E. Lillo
158
0
0
05 Aug 2025
VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs
Zixuan Gu
Qiufeng Fan
Long Sun
Yang Liu
Xiaojun Ye
134
1
0
05 Aug 2025
PLoRA: Efficient LoRA Hyperparameter Tuning for Large Models
Minghao Yan
Zhuang Wang
Zhen Jia
Shivaram Venkataraman
Yida Wang
156
1
0
04 Aug 2025
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
Zuxin Ma
Yunhe Cui
Yongbin Qin
141
0
0
04 Aug 2025
Amber Pruner: Leveraging N:M Activation Sparsity for Efficient Prefill in Large Language Models
Tai An
Ruwu Cai
Yanzhe Zhang
Yang Liu
Hao Chen
Pengcheng Xie
Sheng Chang
Jing Lin
Gongyi Wang
MoE
146
2
0
04 Aug 2025
LOST: Low-rank and Sparse Pre-training for Large Language Models
Jiaxi Li
Lu Yin
Li Shen
Jinjin Xu
Liwu Xu
Tianjin Huang
Wenwu Wang
Shiwei Liu
Xilu Wang
155
2
0
04 Aug 2025
CAMERA: Multi-Matrix Joint Compression for MoE Models via Micro-Expert Redundancy Analysis
Yuzhuang Xu
Xu Han
Yuanchi Zhang
Yixuan Wang
Yijun Liu
Shiyu Ji
Qingfu Zhu
Wanxiang Che
MoE
MQ
409
1
0
04 Aug 2025
The Architecture of Trust: A Framework for AI-Augmented Real Estate Valuation in the Era of Structured Data
Petteri Teikari
Mike Jarrell
Maryam Azh
Harri Pesola
182
1
0
04 Aug 2025
HT-Transformer: Event Sequences Classification by Accumulating Prefix Information with History Tokens
Ivan Karpukhin
Ivan A Kireev
AI4TS
113
1
0
02 Aug 2025
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
Zishan Shao
Yixiao Wang
Qinsi Wang
Ting Jiang
Zhixu Du
Hancheng Ye
Danyang Zhuo
Yiran Chen
Xue Yang
116
3
0
02 Aug 2025
Interpreting Performance Profiles with Deep Learning
Zhuoran Liu
HAI
100
0
0
01 Aug 2025
Previous
1
2
3
...
5
6
7
...
95
96
97
Next