ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.07461
  4. Cited By
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
v1v2v3 (latest)

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"

50 / 4,447 papers shown
Title
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection
Yixian Shen
Qi Bi
Jia-Hong Huang
Hongyi Zhu
Andy D. Pimentel
Anuj Pathania
17
0
0
29 May 2025
Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics
Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics
Shiwei Li
Xiandi Luo
Xing Tang
Haozhao Wang
Hao Chen
Weihong Luo
Yuhua Li
Xiuqiang He
Ruixuan Li
AI4CE
45
0
0
29 May 2025
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning
Yongkang Liu
Xingle Xu
Ercong Nie
Zijing Wang
Shi Feng
Daling Wang
Qian Li
Hinrich Schutze
21
0
0
28 May 2025
Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs
Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs
Zhiyi Wan
Wanrou Du
Liang Li
Miao Pan
Xiaoqi Qin
CLL
30
0
0
28 May 2025
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Christopher Ormerod
15
0
0
28 May 2025
Limited Generalizability in Argument Mining: State-Of-The-Art Models Learn Datasets, Not Arguments
Limited Generalizability in Argument Mining: State-Of-The-Art Models Learn Datasets, Not Arguments
Marc Feger
Katarina Boland
Stefan Dietze
28
0
0
28 May 2025
Improving Continual Pre-training Through Seamless Data Packing
Improving Continual Pre-training Through Seamless Data Packing
Ruicheng Yin
Xuan Gao
Changze Lv
Xiaohua Wang
Xiaoqing Zheng
Xuanjing Huang
21
0
0
28 May 2025
ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning
ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning
Zhendong Mi
Zhenglun Kong
Geng Yuan
Shaoyi Huang
43
0
0
28 May 2025
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
MoRE: A Mixture of Low-Rank Experts for Adaptive Multi-Task Learning
Dacao Zhang
Kun Zhang
Shimao Chu
Le Wu
Xin Li
Si Wei
MoEALMOffRL
32
0
0
28 May 2025
Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets
Efficient Ensemble for Fine-tuning Language Models on Multiple Datasets
Dongyue Li
Ziniu Zhang
Lu Wang
Hongyang R. Zhang
28
1
0
28 May 2025
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors
Filippo Rinaldi
Giacomo Capitani
Lorenzo Bonicelli
Donato Crisostomi
Federico Bolelli
E. Ficarra
Emanuele Rodolà
Simone Calderara
Angelo Porrello
12
0
0
28 May 2025
Revisiting Bayesian Model Averaging in the Era of Foundation Models
Revisiting Bayesian Model Averaging in the Era of Foundation Models
Mijung Park
UQCVMoMe
17
0
0
28 May 2025
Advancing Expert Specialization for Better MoE
Advancing Expert Specialization for Better MoE
Hongcan Guo
Haolang Lu
Guoshun Nan
Bolun Chu
Jialin Zhuang
Yuan Yang
Wenhao Che
Sicong Leng
Qimei Cui
Xudong Jiang
MoEMoMe
84
0
0
28 May 2025
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Unraveling LoRA Interference: Orthogonal Subspaces for Robust Model Merging
Haobo Zhang
Jiayu Zhou
MoMe
40
0
0
28 May 2025
How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian
How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian
Andrea Pedrotti
Giulia Rambelli
Caterina Villani
Marianna Bolognesi
19
0
0
27 May 2025
DLP: Dynamic Layerwise Pruning in Large Language Models
DLP: Dynamic Layerwise Pruning in Large Language Models
Yuli Chen
B. Cheng
Jiale Han
Yingying Zhang
Yingting Li
Shuhao Zhang
42
0
0
27 May 2025
Leaner Transformers: More Heads, Less Depth
Leaner Transformers: More Heads, Less Depth
Hemanth Saratchandran
Damien Teney
Simon Lucey
22
0
0
27 May 2025
LLMs are Frequency Pattern Learners in Natural Language Inference
LLMs are Frequency Pattern Learners in Natural Language Inference
Liang Cheng
Zhaowei Wang
Mark Steedman
36
0
0
27 May 2025
SHE-LoRA: Selective Homomorphic Encryption for Federated Tuning with Heterogeneous LoRA
SHE-LoRA: Selective Homomorphic Encryption for Federated Tuning with Heterogeneous LoRA
Jianmin Liu
Li Yan
Borui Li
Lei Yu
Chao Shen
21
0
0
27 May 2025
AutoSGD: Automatic Learning Rate Selection for Stochastic Gradient Descent
AutoSGD: Automatic Learning Rate Selection for Stochastic Gradient Descent
Nikola Surjanovic
Alexandre Bouchard-Côté
Trevor Campbell
25
0
0
27 May 2025
Research Community Perspectives on "Intelligence" and Large Language Models
Research Community Perspectives on "Intelligence" and Large Language Models
Bertram Højer
Terne Sasha Thorn Jakobsen
Anna Rogers
Stefan Heinrich
41
0
0
27 May 2025
Information-Theoretic Complementary Prompts for Improved Continual Text Classification
Information-Theoretic Complementary Prompts for Improved Continual Text Classification
Duzhen Zhang
Yong Ren
Chenxing Li
Dong Yu
Tielin Zhang
CLLVLM
93
0
0
27 May 2025
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
LayerIF: Estimating Layer Quality for Large Language Models using Influence Functions
Hadi Askari
Shivanshu Gupta
Fei Wang
Anshuman Chhabra
Muhao Chen
TDI
41
0
0
27 May 2025
We Need to Measure Data Diversity in NLP -- Better and Broader
We Need to Measure Data Diversity in NLP -- Better and Broader
Dong Nguyen
Esther Ploeger
43
0
0
26 May 2025
DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response
DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response
Bilel Cherif
Tamás Bisztray
Richard A. Dubniczky
Aaesha Aldahmani
Saeed Alshehhi
Norbert Tihanyi
ELM
24
0
0
26 May 2025
Learning to Select In-Context Demonstration Preferred by Large Language Model
Learning to Select In-Context Demonstration Preferred by Large Language Model
Zheng Zhang
Shaocheng Lan
Lei Song
Jiang Bian
Yexin Li
Kan Ren
24
0
0
26 May 2025
ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
ExAnte: A Benchmark for Ex-Ante Inference in Large Language Models
Yachuan Liu
Xiaochun Wei
Lin Shi
Xinnuo Li
Bohan Zhang
Paramveer S. Dhillon
Qiaozhu Mei
55
0
0
26 May 2025
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
Jaehun Jung
Seungju Han
Ximing Lu
Skyler Hallinan
David Acuna
Shrimai Prabhumoye
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
Yejin Choi
SyDa
11
1
0
26 May 2025
Parameter-Efficient Fine-Tuning with Column Space Projection
Parameter-Efficient Fine-Tuning with Column Space Projection
Junseo Hwang
Wonguk Cho
Taesup Kim
48
0
0
26 May 2025
Rethinking the Understanding Ability across LLMs through Mutual Information
Rethinking the Understanding Ability across LLMs through Mutual Information
Shaojie Wang
Sirui Ding
Na Zou
27
0
0
25 May 2025
Safety Alignment via Constrained Knowledge Unlearning
Safety Alignment via Constrained Knowledge Unlearning
Zesheng Shi
Yucheng Zhou
Jing Li
MUKELMAAML
68
2
0
24 May 2025
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang
Bingcong Li
G. Giannakis
236
1
0
24 May 2025
PLUMAGE: Probabilistic Low rank Unbiased Min Variance Gradient Estimator for Efficient Large Model Training
PLUMAGE: Probabilistic Low rank Unbiased Min Variance Gradient Estimator for Efficient Large Model Training
Matan Haroush
Daniel Soudry
181
0
0
23 May 2025
LCD: Advancing Extreme Low-Bit Clustering for Large Language Models via Knowledge Distillation
LCD: Advancing Extreme Low-Bit Clustering for Large Language Models via Knowledge Distillation
Fangxin Liu
Ning Yang
Junping Zhao
Tao Yang
Haibing Guan
Li Jiang
MQ
31
0
0
23 May 2025
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
Takashi Ishida
Thanawat Lodkaew
Ikko Yamane
206
0
0
23 May 2025
TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning
TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning
Florentin Beck
William Rudman
Carsten Eickhoff
59
0
0
22 May 2025
Bayesian Optimization for Enhanced Language Models: Optimizing Acquisition Functions
Bayesian Optimization for Enhanced Language Models: Optimizing Acquisition Functions
Zishuo Bao
Yibo Liu
Changyutao Qiu
200
0
0
22 May 2025
Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
Seamus Somerstep
Vinod Raman
Unique Subedi
Yuekai Sun
76
0
0
22 May 2025
EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios
EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios
Bin Xu
Yu Bai
Huashan Sun
Yiguan Lin
Siming Liu
Xinyue Liang
Yaolin Li
Yang Gao
Heyan Huang
AI4EdELM
198
0
0
22 May 2025
Understanding Differential Transformer Unchains Pretrained Self-Attentions
Understanding Differential Transformer Unchains Pretrained Self-Attentions
Chaerin Kong
Jiho Jang
Nojun Kwak
82
0
0
22 May 2025
SPaRC: A Spatial Pathfinding Reasoning Challenge
SPaRC: A Spatial Pathfinding Reasoning Challenge
Lars Benedikt Kaesberg
Jan Philip Wahle
Terry Ruas
Bela Gipp
LRM
57
0
0
22 May 2025
ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts
ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts
Dongwon Noh
Donghyeok Koh
Junghun Yuk
Gyuwan Kim
Jaeyong Lee
Kyungtae Lim
Cheoneum Park
ELM
71
0
0
22 May 2025
Transfer of Structural Knowledge from Synthetic Languages
Transfer of Structural Knowledge from Synthetic Languages
Mikhail Budnikov
Ivan Yamshchikov
59
0
0
21 May 2025
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing
Peng Wang
Biyu Zhou
Xuehai Tang
Jizhong Han
Songlin Hu
KELM
118
0
0
21 May 2025
MaxPoolBERT: Enhancing BERT Classification via Layer- and Token-Wise Aggregation
MaxPoolBERT: Enhancing BERT Classification via Layer- and Token-Wise Aggregation
Maike Behrendt
Stefan Sylvius Wagner
Stefan Harmeling
SSeg
168
0
0
21 May 2025
Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders
Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders
Agam Goyal
Vedant Rathi
William Yeh
Yian Wang
Yuen Chen
Hari Sundaram
100
0
0
20 May 2025
Enhancing LLMs via High-Knowledge Data Selection
Enhancing LLMs via High-Knowledge Data Selection
Feiyu Duan
Xuemiao Zhang
Sirui Wang
Haoran Que
Yuqi Liu
Wenge Rong
Xunliang Cai
227
0
0
20 May 2025
Low-Cost FlashAttention with Fused Exponential and Multiplication Hardware Operators
Low-Cost FlashAttention with Fused Exponential and Multiplication Hardware Operators
K. Alexandridis
Vasileios Titopoulos
G. Dimitrakopoulos
63
0
0
20 May 2025
BeamClean: Language Aware Embedding Reconstruction
BeamClean: Language Aware Embedding Reconstruction
Kaan Kale
Kyle Mylonakis
Jay Roberts
Sidhartha Roy
AAML
147
1
0
19 May 2025
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Mamba-Adaptor: State Space Model Adaptor for Visual Recognition
Fei Xie
Jiahao Nie
Yujin Tang
W. Zhang
Hongshen Zhao
Mamba
142
0
0
19 May 2025
Previous
12345...878889
Next