Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.07461
Cited By
v1
v2
v3 (latest)
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"
50 / 4,447 papers shown
Title
Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark
Sondos Mahmoud Bsharat
Mukul Ranjan
Aidar Myrzakhan
Jiacheng Liu
Bowei Guo
Shengkun Tang
Zhuang Liu
Yuanzhi Li
Zhiqiang Shen
ELM
118
1
0
26 Mar 2025
Cyborg Data: Merging Human with AI Generated Training Data
Kai North
Christopher Ormerod
68
0
0
26 Mar 2025
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models
Suhas G Hegde
S. K
Aruna Tiwari
88
1
0
25 Mar 2025
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs
Chang Gao
Kang Zhao
Jianfei Chen
Liping Jing
83
0
0
24 Mar 2025
LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment
Jong Myoung Kim
Young-Jun Lee
Ho-Jin Choi
Sangkeun Jung
102
0
0
24 Mar 2025
ZeroLM: Data-Free Transformer Architecture Search for Language Models
Zhen-Song Chen
Hong-Wei Ding
Xian-Jia Wang
Witold Pedrycz
96
0
0
24 Mar 2025
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment
Jong Myoung Kim
Young-Jun_Lee
Ho-Jin Choi
Sangkeun Jung
100
0
0
24 Mar 2025
Dynamic Task Vector Grouping for Efficient Multi-Task Prompt Tuning
Pieyi Zhang
Richong Zhang
Zhijie Nie
VLM
138
0
0
23 Mar 2025
DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation
Massimo Bini
Leander Girrbach
Zeynep Akata
215
1
0
23 Mar 2025
CausalRivers -- Scaling up benchmarking of causal discovery for real-world time-series
Gideon Stein
M. Shadaydeh
Jan Blunk
Niklas Penzel
Joachim Denzler
AI4TS
78
1
0
21 Mar 2025
Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models
Suho Yoo
Hyunjong Ok
Jaeho Lee
AuLLM
RALM
103
0
0
21 Mar 2025
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer
Tony Zhang
Rickard Brännvall
79
0
0
20 Mar 2025
Unified Enhancement of the Generalization and Robustness of Language Models via Bi-Stage Optimization
Yizhou Sun
Juan Yin
Juan Zhao
Fan Zhang
Yongheng Liu
Hongji Chen
62
0
0
19 Mar 2025
Measuring AI Ability to Complete Long Tasks
Thomas Kwa
Ben West
Joel Becker
Amy Deng
Katharyn Garcia
...
Lucas Jun Koba Sato
H. Wijk
Daniel M. Ziegler
Elizabeth Barnes
Lawrence Chan
ELM
284
18
0
18 Mar 2025
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
Duke Nguyen
Aditya Joshi
Flora D. Salim
81
0
0
17 Mar 2025
Triad: Empowering LMM-based Anomaly Detection with Vision Expert-guided Visual Tokenizer and Manufacturing Process
Yuanze Li
Shihao Yuan
Haolin Wang
Qizhang Li
Ming-Yu Liu
Chen Xu
Guangming Shi
Wangmeng Zuo
83
2
0
17 Mar 2025
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization
Hao Mark Chen
S. Hu
Wayne Luk
Timothy M. Hospedales
Hongxiang Fan
MoMe
119
1
0
16 Mar 2025
TLUE: A Tibetan Language Understanding Evaluation Benchmark
Fan Gao
Cheng Huang
Nyima Tashi
Xiangxiang Wang
Thupten Tsering
...
Gadeng Luosang
Rinchen Dongrub
Dorje Tashi
Xiao Feng
Yongbin Yu
ELM
253
2
0
15 Mar 2025
A Survey on Federated Fine-tuning of Large Language Models
Yebo Wu
Chunlin Tian
Jingguang Li
He Sun
Kahou Tam
Zhanting Zhou
Haicheng Liao
Zhijiang Guo
Li Li
Chengzhong Xu
FedML
154
5
0
15 Mar 2025
LAG-MMLU: Benchmarking Frontier LLM Understanding in Latvian and Giriama
Naome A. Etori
Kevin Lu
Randu Karisa
Arturs Kanepajs
LRM
ELM
479
0
0
14 Mar 2025
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling
R. Teo
T. Nguyen
MoE
149
2
0
14 Mar 2025
Key, Value, Compress: A Systematic Exploration of KV Cache Compression Techniques
Neusha Javidnia
B. Rouhani
F. Koushanfar
549
0
0
14 Mar 2025
reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs
Zhaofeng Wu
Michihiro Yasunaga
Andrew Cohen
Yoon Kim
Asli Celikyilmaz
Marjan Ghazvininejad
90
3
0
14 Mar 2025
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity
Chi Xu
Gefei Zhang
Yantong Zhu
Luca Benini
Guosheng Hu
Yawei Li
Zhihong Zhang
58
1
0
14 Mar 2025
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
Weihao Xuan
Rui Yang
Heli Qi
Qingcheng Zeng
Yunze Xiao
...
Edison Marrese-Taylor
Shijian Lu
Yusuke Iwasawa
Yutaka Matsuo
Irene Li
ELM
213
7
0
13 Mar 2025
Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark
Viktor Moskvoretskii
Alina Lobanova
Ekaterina Neminova
Chris Biemann
Alexander Panchenko
Irina Nikishina
82
0
0
13 Mar 2025
Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout
Shilong Wang
Jianchun Liu
Hongli Xu
Jiaming Yan
Xianjun Gao
147
1
0
13 Mar 2025
Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors
Runxi Cheng
Feng Xiong
Yongxian Wei
Wanyun Zhu
Chun Yuan
MoMe
133
1
0
11 Mar 2025
Multi-Cue Adaptive Visual Token Pruning for Large Vision-Language Models
Bozhi Luan
Wengang Zhou
Hao Feng
Zhe Wang
Xiaosong Li
Haoyang Li
VLM
123
0
0
11 Mar 2025
Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models
Hao Zhou
Guergana Savova
Lijing Wang
122
0
0
10 Mar 2025
Gender Encoding Patterns in Pretrained Language Model Representations
Mahdi Zakizadeh
Mohammad Taher Pilehvar
208
0
0
09 Mar 2025
Green Prompting
Marta Adamska
Daria Smirnova
Hamid Nasiri
Zhengxin Yu
Peter Garraghan
527
1
0
09 Mar 2025
Seeing Delta Parameters as JPEG Images: Data-Free Delta Compression with Discrete Cosine Transform
Chenyu Huang
Peng Ye
Xinyu Wang
Shenghe Zheng
Biqing Qi
Lei Bai
Wanli Ouyang
Tao Chen
59
2
0
09 Mar 2025
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
Shwai He
Weilin Cai
Jiayi Huang
Ang Li
MoE
185
2
0
07 Mar 2025
Quantum-PEFT: Ultra parameter-efficient fine-tuning
Toshiaki Koike-Akino
F. Tonin
Yongtao Wu
Frank Zhengqing Wu
Leyla Naz Candogan
Volkan Cevher
MQ
221
5
0
07 Mar 2025
Toward an Evaluation Science for Generative AI Systems
Laura Weidinger
Deb Raji
Hanna M. Wallach
Margaret Mitchell
Angelina Wang
Olawale Salaudeen
Rishi Bommasani
Sayash Kapoor
Deep Ganguli
Sanmi Koyejo
EGVM
ELM
110
10
0
07 Mar 2025
AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models
Hengrui Xing
Cong Tian
Liang Zhao
Zejun Ma
WenSheng Wang
N. Zhang
Chao Huang
Zhenhua Duan
74
0
0
07 Mar 2025
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs
Zara Siddique
Irtaza Khalid
Liam D. Turner
Luis Espinosa-Anke
LLMSV
161
2
0
07 Mar 2025
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning
Mohammad Amin Ghanizadeh
Mohammad Javad Dousti
82
1
0
06 Mar 2025
Tgea: An error-annotated dataset and benchmark tasks for text generation from pretrained language models
Jie He
Bo Peng
Yi-Lun Liao
Qun Liu
Deyi Xiong
109
8
0
06 Mar 2025
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Hybrid Secret Sharing
Zhichao You
Xuewen Dong
Ke Cheng
Xutong Mu
Jiaxuan Fu
Shiyang Ma
Qiang Qu
Yulong Shen
FedML
113
0
0
05 Mar 2025
Personalized Federated Fine-tuning for Heterogeneous Data: An Automatic Rank Learning Approach via Two-Level LoRA
Jie Hao
Yuman Wu
Ali Payani
Myungjin Lee
Mingrui Liu
124
2
0
05 Mar 2025
Token-Level Privacy in Large Language Models
Reém Harel
Niv Gilboa
Yuval Pinter
89
0
0
05 Mar 2025
Sarcasm Detection as a Catalyst: Improving Stance Detection with Cross-Target Capabilities
Gibson Nkhata Shi Yin Hong
Susan Gauch
91
0
0
05 Mar 2025
Privacy and Accuracy-Aware AI/ML Model Deduplication
Hong Guan
Lei Yu
Lixi Zhou
Li Xiong
Kanchan Chowdhury
Lulu Xie
Xusheng Xiao
Jia Zou
79
0
0
04 Mar 2025
MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality
Shuaike Li
Kai Zhang
Qiang Liu
Enhong Chen
KELM
128
1
0
04 Mar 2025
Parameter-Efficient Fine-Tuning of Large Language Models via Deconvolution in Subspace
Jia-Chen Zhang
Yu-Jie Xiong
Chun-Ming Xia
Dong-Hai Zhu
Xi-He Qiu
105
4
0
03 Mar 2025
Analyzing the Safety of Japanese Large Language Models in Stereotype-Triggering Prompts
Akito Nakanishi
Yukie Sano
Geng Liu
Francesco Pierri
95
0
0
03 Mar 2025
Revisiting Large Language Model Pruning using Neuron Semantic Attribution
Yizhuo Ding
Xinwei Sun
Yanwei Fu
Guosheng Hu
73
2
0
03 Mar 2025
AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Nicholas Carlini
Javier Rando
Edoardo Debenedetti
Milad Nasr
F. Tramèr
AAML
ELM
92
3
0
03 Mar 2025
Previous
1
2
3
4
5
6
...
87
88
89
Next