ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1804.07461
  4. Cited By
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
v1v2v3 (latest)

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
    ELM
ArXiv (abs)PDFHTML

Papers citing "GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"

50 / 4,447 papers shown
Title
RomeBERT: Robust Training of Multi-Exit BERT
RomeBERT: Robust Training of Multi-Exit BERT
Shijie Geng
Peng Gao
Zuohui Fu
Yongfeng Zhang
81
28
0
24 Jan 2021
WangchanBERTa: Pretraining transformer-based Thai Language Models
WangchanBERTa: Pretraining transformer-based Thai Language Models
Lalita Lowphansirikul
Charin Polpanumas
Nawat Jantrakulchai
Sarana Nutanong
58
76
0
24 Jan 2021
Debiasing Pre-trained Contextualised Embeddings
Debiasing Pre-trained Contextualised Embeddings
Masahiro Kaneko
Danushka Bollegala
269
143
0
23 Jan 2021
The heads hypothesis: A unifying statistical approach towards
  understanding multi-headed attention in BERT
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
Madhura Pande
Aakriti Budhraja
Preksha Nema
Pratyush Kumar
Mitesh M. Khapra
66
19
0
22 Jan 2021
Distilling Large Language Models into Tiny and Effective Students using
  pQRNN
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
60
17
0
21 Jan 2021
Adv-OLM: Generating Textual Adversaries via OLM
Adv-OLM: Generating Textual Adversaries via OLM
Vijit Malik
A. Bhat
Ashutosh Modi
134
6
0
21 Jan 2021
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual
  Retrieval
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval
Robert Litschko
Ivan Vulić
Simone Paolo Ponzetto
Goran Glavaš
74
23
0
21 Jan 2021
Zero-shot Generalization in Dialog State Tracking through Generative
  Question Answering
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
Shuyang Li
Jin Cao
Mukund Sridhar
Henghui Zhu
Shang-Wen Li
Wael Hamza
Julian McAuley
BDL
72
46
0
20 Jan 2021
Classifying Scientific Publications with BERT -- Is Self-Attention a
  Feature Selection Method?
Classifying Scientific Publications with BERT -- Is Self-Attention a Feature Selection Method?
Andrés García-Silva
José Manuél Gómez-Pérez
43
11
0
20 Jan 2021
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation
Lingyun Feng
Minghui Qiu
Yaliang Li
Haitao Zheng
Ying Shen
90
10
0
20 Jan 2021
Situation and Behavior Understanding by Trope Detection on Films
Situation and Behavior Understanding by Trope Detection on Films
Chen-Hsi Chang
Hung-Ting Su
Jui-Heng Hsu
Yu-Siang Wang
Yu-Cheng Chang
Zhe-Yu Liu
Ya-Liang Chang
Wen-Feng Cheng
Ke-Jyun Wang
Winston H. Hsu
110
7
0
19 Jan 2021
Leveraging Local Variation in Data: Sampling and Weighting Schemes for
  Supervised Deep Learning
Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning
Paul Novello
Gaël Poëtte
D. Lugato
P. Congedo
90
0
0
19 Jan 2021
Teach me how to Label: Labeling Functions from Natural Language with
  Text-to-text Transformers
Teach me how to Label: Labeling Functions from Natural Language with Text-to-text Transformers
Yannis Papanikolaou
30
0
0
18 Jan 2021
Joint Energy-based Model Training for Better Calibrated Natural Language
  Understanding Models
Joint Energy-based Model Training for Better Calibrated Natural Language Understanding Models
Tianxing He
Bryan McCann
Caiming Xiong
Ehsan Hosseini-Asl
60
22
0
18 Jan 2021
What Makes Good In-Context Examples for GPT-$3$?
What Makes Good In-Context Examples for GPT-333?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAMLRALM
400
1,400
0
17 Jan 2021
Understanding in Artificial Intelligence
Understanding in Artificial Intelligence
S. Maetschke
D. M. Iraola
Pieter Barnard
Elaheh Shafieibavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
ELMVLM
46
0
0
17 Jan 2021
Transformer-Based Models for Question Answering on COVID19
Transformer-Based Models for Question Answering on COVID19
Hillary Ngai
Yoona Park
John Chen
Mahboobeh Parsapoor
OOD
48
21
0
16 Jan 2021
To Understand Representation of Layer-aware Sequence Encoders as
  Multi-order-graph
To Understand Representation of Layer-aware Sequence Encoders as Multi-order-graph
Sufeng Duan
Hai Zhao
MILM
64
0
0
16 Jan 2021
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset:
  Collection, Insights and Improvements
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schuller
96
62
0
15 Jan 2021
KDLSQ-BERT: A Quantized Bert Combining Knowledge Distillation with
  Learned Step Size Quantization
KDLSQ-BERT: A Quantized Bert Combining Knowledge Distillation with Learned Step Size Quantization
Jing Jin
Cai Liang
Tiancheng Wu
Li Zou
Zhiliang Gan
MQ
59
27
0
15 Jan 2021
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal
  Classification Paradigm
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm
Akshay Krishna Sheshadri
Anvesh Rao Vijjini
S. Kharbanda
43
8
0
14 Jan 2021
Benchmarking Simulation-Based Inference
Benchmarking Simulation-Based Inference
Jan-Matthis Lueckmann
Jan Boelts
David S. Greenberg
P. J. Gonçalves
Jakob H. Macke
267
198
0
12 Jan 2021
Of Non-Linearity and Commutativity in BERT
Of Non-Linearity and Commutativity in BERT
Sumu Zhao
Damian Pascual
Gino Brunner
Roger Wattenhofer
105
17
0
12 Jan 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple
  and Efficient Sparsity
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
W. Fedus
Barret Zoph
Noam M. Shazeer
MoE
156
2,247
0
11 Jan 2021
BERT & Family Eat Word Salad: Experiments with Text Understanding
BERT & Family Eat Word Salad: Experiments with Text Understanding
Ashim Gupta
Giorgi Kvernadze
Vivek Srikumar
260
73
0
10 Jan 2021
Applying Transfer Learning for Improving Domain-Specific Search
  Experience Using Query to Question Similarity
Applying Transfer Learning for Improving Domain-Specific Search Experience Using Query to Question Similarity
Ankush Chopra
S. Agrawal
Sohom Ghosh
RALM
21
4
0
07 Jan 2021
I-BERT: Integer-only BERT Quantization
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
179
354
0
05 Jan 2021
CDLM: Cross-Document Language Modeling
CDLM: Cross-Document Language Modeling
Avi Caciularu
Arman Cohan
Iz Beltagy
Matthew E. Peters
Arie Cattan
Ido Dagan
VLM
75
33
0
02 Jan 2021
Which Linguist Invented the Lightbulb? Presupposition Verification for
  Question-Answering
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering
Najoung Kim
Ellie Pavlick
Burcu Karagol Ayan
Deepak Ramachandran
159
48
0
02 Jan 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and
  Improving Models
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
142
251
0
01 Jan 2021
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource
  Language Understanding Evaluation in Bangla
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Kazi Samin Mubasshir
Md. Saiful Islam
Anindya Iqbal
M. Rahman
Rifat Shahriyar
SSLVLM
101
180
0
01 Jan 2021
On Explaining Your Explanations of BERT: An Empirical Study with
  Sequence Classification
On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification
Zhengxuan Wu
Desmond C. Ong
78
22
0
01 Jan 2021
WARP: Word-level Adversarial ReProgramming
WARP: Word-level Adversarial ReProgramming
Karen Hambardzumyan
Hrant Khachatrian
Jonathan May
AAML
349
354
0
01 Jan 2021
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
Xiaohan Chen
Yu Cheng
Shuohang Wang
Zhe Gan
Zhangyang Wang
Jingjing Liu
131
100
0
31 Dec 2020
MiniLMv2: Multi-Head Self-Attention Relation Distillation for
  Compressing Pretrained Transformers
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers
Wenhui Wang
Hangbo Bao
Shaohan Huang
Li Dong
Furu Wei
MQ
139
275
0
31 Dec 2020
Making Pre-trained Language Models Better Few-shot Learners
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
435
1,987
0
31 Dec 2020
BinaryBERT: Pushing the Limit of BERT Quantization
BinaryBERT: Pushing the Limit of BERT Quantization
Haoli Bai
Wei Zhang
Lu Hou
Lifeng Shang
Jing Jin
Xin Jiang
Qun Liu
Michael Lyu
Irwin King
MQ
230
227
0
31 Dec 2020
Open Korean Corpora: A Practical Report
Open Korean Corpora: A Practical Report
Won Ik Cho
Sangwhan Moon
YoungSook Song
82
8
0
31 Dec 2020
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Ahmad Rashid
Vasileios Lioutas
Abbas Ghaddar
Mehdi Rezagholizadeh
98
28
0
31 Dec 2020
CLEAR: Contrastive Learning for Sentence Representation
CLEAR: Contrastive Learning for Sentence Representation
Zhuofeng Wu
Sinong Wang
Jiatao Gu
Madian Khabsa
Fei Sun
Hao Ma
SSL
82
324
0
31 Dec 2020
Corrected CBOW Performs as well as Skip-gram
Corrected CBOW Performs as well as Skip-gram
Ozan Irsoy
Adrian Benton
K. Stratos
SyDa
34
12
0
30 Dec 2020
Out of Order: How Important Is The Sequential Order of Words in a
  Sentence in Natural Language Understanding Tasks?
Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?
Thang M. Pham
Trung Bui
Long Mai
Anh Totti Nguyen
292
123
0
30 Dec 2020
Improving BERT with Syntax-aware Local Attention
Improving BERT with Syntax-aware Local Attention
Zhongli Li
Qingyu Zhou
Chao Li
Ke Xu
Yunbo Cao
102
45
0
30 Dec 2020
Accurate Word Representations with Universal Visual Guidance
Accurate Word Representations with Universal Visual Guidance
Zhuosheng Zhang
Haojie Yu
Hai Zhao
Rui Wang
Masao Utiyama
55
0
0
30 Dec 2020
CascadeBERT: Accelerating Inference of Pre-trained Language Models via
  Calibrated Complete Models Cascade
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Lei Li
Yankai Lin
Deli Chen
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
115
52
0
29 Dec 2020
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust
  Task-oriented Dialog Systems
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
Baolin Peng
Chunyuan Li
Zhu Zhang
Chenguang Zhu
Jinchao Li
Jianfeng Gao
69
50
0
29 Dec 2020
BURT: BERT-inspired Universal Representation from Learning Meaningful
  Segment
BURT: BERT-inspired Universal Representation from Learning Meaningful Segment
Yian Li
Hai Zhao
SSL
37
0
0
28 Dec 2020
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation
Peyman Passban
Yimeng Wu
Mehdi Rezagholizadeh
Qun Liu
87
124
0
27 Dec 2020
SG-Net: Syntax Guided Transformer for Language Representation
SG-Net: Syntax Guided Transformer for Language Representation
Zhuosheng Zhang
Yuwei Wu
Junru Zhou
Sufeng Duan
Hai Zhao
Rui Wang
125
38
0
27 Dec 2020
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
Muhammad Abdul-Mageed
AbdelRahim Elmadany
El Moatez Billah Nagoudi
VLM
131
466
0
27 Dec 2020
Previous
123...737475...878889
Next