v1v2v3 (latest)

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

20 April 2018

Amanpreet Singh

Papers citing "GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"

50 / 4,808 papers shown

HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance

...

Tianyang Wang

Hao Xu

105

03 Oct 2025

Hierarchical Semantic Retrieval with Cobweb

Anant Gupta

Karthik Singaravadivelan

Zekun Wang

108

02 Oct 2025

Automated Evaluation can Distinguish the Good and Bad AI Responses to Patient Questions about Hospitalization

Sarvesh Soni

Dina Demner-Fushman

AI4MH

173

01 Oct 2025

The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation

Zarreen Reza

LLMAG

01 Oct 2025

Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation

01 Oct 2025

ALARB: An Arabic Legal Argument Reasoning Benchmark

102

01 Oct 2025

Sample-Efficient Differentially Private Fine-Tuning via Gradient Matrix Denoising

Ali Dadsetan

Frank Rudzicz

100

01 Oct 2025

Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment

Vanya Bannihatti Kumar

01 Oct 2025

QFrBLiMP: a Quebec-French Benchmark of Linguistic Minimal Pairs

131

30 Sep 2025

When Hallucination Costs Millions: Benchmarking AI Agents in High-Stakes Adversarial Financial Markets

30 Sep 2025

AI Playing Business Games: Benchmarking Large Language Models on Managerial Decision-Making in Dynamic Simulations

Berdymyrat Ovezmyradov

102

30 Sep 2025

LD-MoLE: Learnable Dynamic Routing for Mixture of LoRA Experts

234

30 Sep 2025

Emergent evaluation hubs in a decentralizing large language model ecosystem

Manuel Cebrian

Tomomi Kito

Raul Castro Fernandez

ALM ELM

136

30 Sep 2025

PrunedLoRA: Robust Gradient-Based structured pruning for Low-rank Adaptation in Fine-tuning

247

30 Sep 2025

Mitigating Biases in Language Models via Bias Unlearning

214

30 Sep 2025

Conda: Column-Normalized Adam for Training Large Language Models Faster

246

29 Sep 2025

Knowledge Editing with Subspace-Aware Key-Value Mappings

294

29 Sep 2025

Mechanisms of Matter: Language Inferential Benchmark on Physicochemical Hypothesis in Materials Synthesis

Yingming Pu

Tao Lin

Hongyu Chen

157

29 Sep 2025

A Hierarchical Error Framework for Reliable Automated Coding in Communication Research: Applications to Health and Political Communication

Zhilong Zhao

Yindi Liu

AILaw

202

29 Sep 2025

Dynamic Orthogonal Continual Fine-tuning for Mitigating Catastrophic Forgettings

149

28 Sep 2025

Open-DeBias: Toward Mitigating Open-Set Bias in Language Models

178

28 Sep 2025

Beyond Greedy Exits: Improved Early Exit Decisions for Risk Control and Reliability

Divya J. Bajpai

M. Hanawal

28 Sep 2025

More Data or Better Algorithms: Latent Diffusion Augmentation for Deep Imbalanced Regression

Shayan Alahyari

DiffM

27 Sep 2025

Breaking the MoE LLM Trilemma: Dynamic Expert Clustering with Structured Compression

150

27 Sep 2025

Knowledge distillation through geometry-aware representational alignment

177

27 Sep 2025

Bridging Fairness and Explainability: Can Input-Based Explanations Promote Fairness in Hate Speech Detection?

175

26 Sep 2025

Diagnosing the Performance Trade-off in Moral Alignment: A Case Study on Gender Stereotypes

180

25 Sep 2025

Null-Space Filtering for Data-Free Continual Model Merging: Preserving Transparency, Promoting Fidelity

128

25 Sep 2025

Performance Consistency of Learning Methods for Information Retrieval Tasks

Meng Yuan

Justin Zobel

25 Sep 2025

SoM-1K: A Thousand-Problem Benchmark Dataset for Strength of Materials

25 Sep 2025

Investigating the Representation of Backchannels and Fillers in Fine-tuned Language Models

24 Sep 2025

Detoxifying Large Language Models via Autoregressive Reward Guided Representation Editing

153

24 Sep 2025

Faster Than SVD, Smarter Than SGD: The OPLoRA Alternating Update

Abdulla Jasem Almansoori

109

24 Sep 2025

TsqLoRA: Towards Sensitivity and Quality Low-Rank Adaptation for Efficient Fine-Tuning

168

23 Sep 2025

Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass

Nicholas Popovic

Michael Färber

113

23 Sep 2025

HyperAdapt: Simple High-Rank Adaptation

Abel Gurung

Joseph Campbell

167

23 Sep 2025

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning

162

23 Sep 2025

A Pipeline to Assess Merging Methods via Behavior and Internals

Yutaro Sigris

Andreas Waldis

MoMe

296

23 Sep 2025

Uncertainty in Semantic Language Modeling with PIXELS

Stefania Radu

Marco Zullich

Matias Valdenegro-Toro

147

23 Sep 2025

MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents

22 Sep 2025

TensLoRA: Tensor Alternatives for Low-Rank Adaptation

François Leduc-Primeau

22 Sep 2025

TASO: Task-Aligned Sparse Optimization for Parameter-Efficient Model Adaptation

22 Sep 2025

SEQR: Secure and Efficient QR-based LoRA Routing

William Fleshman

Benjamin Van Durme

165

22 Sep 2025

AIMMerging: Adaptive Iterative Model Merging Using Training Trajectories for Language Model Continual Learning

...

163

22 Sep 2025

On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

Rongguang Ye

Ming Tang

Edith C. H. Ngai

22 Sep 2025

Accurate and Efficient Low-Rank Model Merging in Core Space

Bartłomiej Twardowski

272

22 Sep 2025

Can an Individual Manipulate the Collective Decisions of Multi-Agents?

210

20 Sep 2025

MCP: A Control-Theoretic Orchestration Framework for Synergistic Efficiency and Interpretability in Multimodal Large Language Models

Luyan Zhang

20 Sep 2025

BEFT: Bias-Efficient Fine-Tuning of Language Models

Baichuan Huang

Ananth Balashankar

Amir Aminifar

127

19 Sep 2025

Toward Efficient Influence Function: Dropout as a Compression Tool

Yuchen Zhang

Mohammad Mohammadi Amiri

TDI

243

19 Sep 2025