Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1804.07461
Cited By
v1
v2
v3 (latest)
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"
50 / 4,808 papers shown
FedReFT: Federated Representation Fine-Tuning with All-But-Me Aggregation
Fatema Siddika
Md Anwar Hossen
J. P. Muñoz
Tanya Roosta
Anuj Sharma
Ali Jannesari
FedML
174
1
0
24 Dec 2025
Technical Report on Text Dataset Distillation
Keith Ando Ogawa
Bruno Yamamoto
Lucas Lauton de Alcantara
Victor Zacarias
Edson Bollis
Lucas Pellicer
Rosimeire Pereira Costa
A. H. R. Costa
Artur Jordao
DD
276
0
0
03 Dec 2025
Evaluating Hydro-Science and Engineering Knowledge of Large Language Models
S. Hu
Wenbo Shan
Yingjia Li
Zhiqi Wan
Xinpeng Yu
...
Chee Hui Lai
Wei Luo
Yubin He
Bin Xu
Jianshi Zhao
ELM
AI4CE
178
0
0
03 Dec 2025
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models
Róbert Belanec
Ivan Srba
Maria Bielikova
ALM
440
0
0
02 Dec 2025
An Empirical Survey of Model Merging Algorithms for Social Bias Mitigation
Daiki Shirafuji
Tatsuhiko Saito
Yasutomo Kimura
MoMe
KELM
132
0
0
02 Dec 2025
ESACT: An End-to-End Sparse Accelerator for Compute-Intensive Transformers via Local Similarity
Hongxiang Liu
Zhifang Deng
Tong Pu
Shengli Lu
161
0
0
02 Dec 2025
InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages
Mamadou K. Keita
Sébastien Diarra
Christopher Homan
Seydou Diallo
45
0
0
01 Dec 2025
Stay Unique, Stay Efficient: Preserving Model Personality in Multi-Task Merging
Kuangpu Guo
Yuhe Ding
Jian Liang
Zilei Wang
Ran He
MoMe
136
0
0
01 Dec 2025
Low-Rank Prehab: Preparing Neural Networks for SVD Compression
Haoran Qin
Shansita D. Sharma
Ali Abbasi
Chayne Thrash
Soheil Kolouri
149
0
0
01 Dec 2025
Breaking It Down: Domain-Aware Semantic Segmentation for Retrieval Augmented Generation
Aparajitha Allamraju
Maitreya Prafulla Chitale
Hiranmai Sri Adibhatla
Rahul Mishra
Manish Shrivastava
71
0
0
29 Nov 2025
EduEval: A Hierarchical Cognitive Benchmark for Evaluating Large Language Models in Chinese Education
Guoqing Ma
Jia Zhu
Hanghui Guo
Weijie Shi
Yue Cui
Jiawei Shen
Zilong Li
Yidan Liang
AI4Ed
ELM
332
0
0
29 Nov 2025
From Coefficients to Directions: Rethinking Model Merging with Directional Alignment
Zhikang Chen
Sen Cui
Deheng Ye
Min Zhang
Gang Niu
Yu Zhang
Masashi Sugiyama
Tingting Zhu
MoMe
185
0
0
29 Nov 2025
FedSGT: Exact Federated Unlearning via Sequential Group-based Training
Bokang Zhang
Hong Guan
Hong kyu Lee
Ruixuan Liu
Jia Zou
Li Xiong
MU
FedML
247
0
0
28 Nov 2025
Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day
Milad Abdollahzadeh
Abdul Raheem
Zilong Zhao
Uzair Javaid
Kevin Yee
Nalam Venkata Abhishek
Tram Truong-Huu
Biplab Sikdar
LMTD
ALM
239
0
0
28 Nov 2025
SuRe: Surprise-Driven Prioritised Replay for Continual LLM Learning
Hugo Hazard
Zafeirios Fountas
Martin A Benfeghoul
Adnan Oomerjee
Jun Wang
Haitham Bou-Ammar
CLL
KELM
195
0
0
27 Nov 2025
CacheTrap: Injecting Trojans in LLMs without Leaving any Traces in Inputs or Weights
Mohaiminul Al Nahian
Abeer Matar A. Almalky
Gamana Aragonda
Ranyang Zhou
Sabbir Ahmed
Dmitry Ponomarev
Li Yang
Shaahin Angizi
Adnan Siraj Rakin
59
0
0
27 Nov 2025
Decomposed Trust: Exploring Privacy, Adversarial Robustness, Fairness, and Ethics of Low-Rank LLMs
Daniel Agyei Asante
Md Mokarram Chowdhury
Yang Li
89
0
0
27 Nov 2025
Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models
Julianna Piskorz
Cristina Pinneri
Alvaro H.C. Correia
Motasem Alfarra
Risheek Garrepalli
Christos Louizos
DiffM
209
1
0
26 Nov 2025
PEFT-Bench: A Parameter-Efficient Fine-Tuning Methods Benchmark
Róbert Belanec
Branislav Pecher
Ivan Srba
Maria Bielikova
121
1
0
26 Nov 2025
Structured Prompting Enables More Robust Evaluation of Language Models
Asad Aali
Muhammad Ahmed Mohsin
Vasiliki Bikia
Arnav Singhvi
Richard Gaus
...
Sanmi Koyejo
Emily Alsentzer
Christopher Potts
N. Shah
Akshay Chaudhari
ELM
LRM
284
0
0
25 Nov 2025
Stragglers Can Contribute More: Uncertainty-Aware Distillation for Asynchronous Federated Learning
Yujia Wang
Fenglong Ma
Jinghui Chen
FedML
282
0
0
25 Nov 2025
Towards Trustworthy Wi-Fi Sensing: Systematic Evaluation of Deep Learning Model Robustness to Adversarial Attacks
Shreevanth Krishnaa Gopalakrishnan
Stephen Hailes
AAML
OOD
221
0
0
25 Nov 2025
CrypTorch: PyTorch-based Auto-tuning Compiler for Machine Learning with Multi-party Computation
Jinyu Liu
Gang Tan
Kiwan Maeng
88
0
0
24 Nov 2025
ABM-LoRA: Activation Boundary Matching for Fast Convergence in Low-Rank Adaptation
Dongha Lee
Jinhee Park
Minjun Kim
Junseok Kwon
AI4CE
402
0
0
24 Nov 2025
OceanForecastBench: A Benchmark Dataset for Data-Driven Global Ocean Forecasting
Haoming Jia
Yi Han
X. Wang
Huizan Wang
Wei Wu
Jianming Zheng
Peikun Xiao
AI4Cl
420
0
0
24 Nov 2025
CoreEval: Automatically Building Contamination-Resilient Datasets with Real-World Knowledge toward Reliable LLM Evaluation
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Jingqian Zhao
Bingbing Wang
Geng Tu
Y. Zhang
Qianlong Wang
Bin Liang
Jing Li
Ruifeng Xu
83
0
0
24 Nov 2025
CAMformer: Associative Memory is All You Need
Tergel Molom-Ochir
Benjamin Morris
Mark Horton
Chiyue Wei
Cong Guo
...
Peter Liu
Shan X. Wang
Deliang Fan
Hai Helen Li
Yiran Chen
101
0
0
24 Nov 2025
Exploiting the Experts: Unauthorized Compression in MoE-LLMs
Pinaki Prasad Guha Neogi
Ahmad Mohammadshirazi
Dheeraj Kulshrestha
R. Ramnath
MoE
140
0
0
22 Nov 2025
DeepCoT: Deep Continual Transformers for Real-Time Inference on Data Streams
Ginés Carreto Picón
Peng Yuan Zhou
Qi Zhang
Alexandros Iosifidis
AI4TS
196
0
0
21 Nov 2025
ILoRA: Federated Learning with Low-Rank Adaptation for Heterogeneous Client Aggregation
Junchao Zhou
Junkang Liu
Fanhua Shang
183
0
0
20 Nov 2025
TS-PEFT: Unveiling Token-Level Redundancy in Parameter-Efficient Fine-Tuning
Dabiao Ma
Ziming Dai
Zhimin Xin
Shu Wang
Ye Wang
Haojun Fei
178
0
0
20 Nov 2025
Multimodal Evaluation of Russian-language Architectures
Artem Chervyakov
Ulyana Isaeva
Anton A. Emelyanov
Artem Safin
Maria Tikhonova
...
Ilseyar Alimova
Ilseyar Alimova
A. Kapitanov
Alena Fenogenova
Alena Fenogenova
320
1
0
19 Nov 2025
Breaking Expert Knowledge Limits: Self-Pruning for Large Language Models
Haidong Kang
Lihong Lin
Enneng Yang
Hongning Dai
Hao Wang
LRM
217
0
0
19 Nov 2025
An Operational Kardashev-Style Scale for Autonomous AI - Towards AGI and Superintelligence
Przemyslaw Chojecki
ELM
103
3
0
17 Nov 2025
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
Chenglong Wang
Yifu Huo
Yang Gan
Yongyu Mu
Qiaozhi He
...
Tongran Liu
Anxiang Ma
Zhengtao Yu
Jingbo Zhu
Tong Xiao
108
0
0
16 Nov 2025
Dynamic Temperature Scheduler for Knowledge Distillation
Sibgat Ul Islam
Jawad Ibn Ahad
Fuad Rahman
M. R. Amin
Nabeel Mohammed
Shafin Rahman
102
0
0
14 Nov 2025
Towards Outcome-Oriented, Task-Agnostic Evaluation of AI Agents
Waseem Alshikh
Muayad Ali
Brian Kennedy
Dmytro Mozolevskyi
80
0
0
11 Nov 2025
DP-AdamW: Investigating Decoupled Weight Decay and Bias Correction in Private Deep Learning
Jay Chooi
Kevin Cong
Russell Li
Lillian Sun
167
1
0
11 Nov 2025
Low-Rank Curvature for Zeroth-Order Optimization in LLM Fine-Tuning
Hyunseok Seung
Jaewoo Lee
Hyunsuk Ko
73
0
0
11 Nov 2025
Re-coding for Uncertainties: Edge-awareness Semantic Concordance for Resilient Event-RGB Segmentation
Nan Bao
Yifan Zhao
Lin Zhu
Jia Li
94
0
0
11 Nov 2025
LoRA on the Go: Instance-level Dynamic LoRA Selection and Merging
Seungeon Lee
Soumi Das
Manish Gupta
Krishna P. Gummadi
MoMe
622
1
0
10 Nov 2025
TuckA: Hierarchical Compact Tensor Experts for Efficient Fine-Tuning
Qifeng Lei
Zhiyong Yang
Qianqian Xu
Cong Hua
Peisong Wen
Qingming Huang
106
0
0
10 Nov 2025
Probabilities Are All You Need: A Probability-Only Approach to Uncertainty Estimation in Large Language Models
Manh Trong Nguyen
Sunil R. Gupta
Hung Le
167
0
0
10 Nov 2025
QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations
Zhixiong Zhao
Haomin Li
Fangxin Liu
Yuncheng Lu
Zongwu Wang
Tao Yang
Li Jiang
Haibing Guan
260
2
0
10 Nov 2025
Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains
P. Wang
Hongcheng Liu
Yusheng Liao
Ziqing Fan
Yaxin Du
Shuo Tang
Y. Wang
Y Samuel Wang
132
1
0
10 Nov 2025
DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning
Nikolay Yudin
Ekaterina Grishina
Andrey Veprikov
Alexandr Beznosikov
Maxim Rakhuba
177
0
0
09 Nov 2025
LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation
Liya Zhu
Peizhuang Cong
Aowei Ji
Wenya Wu
Jiani Hou
...
Jingzhe Ding
Tong Yang
Z. Wang
Ge Zhang
Wenhao Huang
ALM
ELM
573
0
0
09 Nov 2025
Steering Language Models with Weight Arithmetic
Constanza Fierro
Fabien Roger
MoMe
LLMSV
515
0
0
07 Nov 2025
ManufactuBERT: Efficient Continual Pretraining for Manufacturing
Robin Armingaud
Romaric Besançon
81
0
0
07 Nov 2025
First is Not Really Better Than Last: Evaluating Layer Choice and Aggregation Strategies in Language Model Data Influence Estimation
Dmytro Vitel
Anshuman Chhabra
TDI
365
0
0
06 Nov 2025
1
2
3
4
...
95
96
97
Next