Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.07461
Cited By
v1
v2
v3 (latest)
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"
50 / 4,447 papers shown
Title
RomeBERT: Robust Training of Multi-Exit BERT
Shijie Geng
Peng Gao
Zuohui Fu
Yongfeng Zhang
81
28
0
24 Jan 2021
WangchanBERTa: Pretraining transformer-based Thai Language Models
Lalita Lowphansirikul
Charin Polpanumas
Nawat Jantrakulchai
Sarana Nutanong
58
76
0
24 Jan 2021
Debiasing Pre-trained Contextualised Embeddings
Masahiro Kaneko
Danushka Bollegala
269
143
0
23 Jan 2021
The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT
Madhura Pande
Aakriti Budhraja
Preksha Nema
Pratyush Kumar
Mitesh M. Khapra
66
19
0
22 Jan 2021
Distilling Large Language Models into Tiny and Effective Students using pQRNN
P. Kaliamoorthi
Aditya Siddhant
Edward Li
Melvin Johnson
MQ
60
17
0
21 Jan 2021
Adv-OLM: Generating Textual Adversaries via OLM
Vijit Malik
A. Bhat
Ashutosh Modi
134
6
0
21 Jan 2021
Evaluating Multilingual Text Encoders for Unsupervised Cross-Lingual Retrieval
Robert Litschko
Ivan Vulić
Simone Paolo Ponzetto
Goran Glavaš
74
23
0
21 Jan 2021
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
Shuyang Li
Jin Cao
Mukund Sridhar
Henghui Zhu
Shang-Wen Li
Wael Hamza
Julian McAuley
BDL
72
46
0
20 Jan 2021
Classifying Scientific Publications with BERT -- Is Self-Attention a Feature Selection Method?
Andrés García-Silva
José Manuél Gómez-Pérez
43
11
0
20 Jan 2021
Learning to Augment for Data-Scarce Domain BERT Knowledge Distillation
Lingyun Feng
Minghui Qiu
Yaliang Li
Haitao Zheng
Ying Shen
90
10
0
20 Jan 2021
Situation and Behavior Understanding by Trope Detection on Films
Chen-Hsi Chang
Hung-Ting Su
Jui-Heng Hsu
Yu-Siang Wang
Yu-Cheng Chang
Zhe-Yu Liu
Ya-Liang Chang
Wen-Feng Cheng
Ke-Jyun Wang
Winston H. Hsu
110
7
0
19 Jan 2021
Leveraging Local Variation in Data: Sampling and Weighting Schemes for Supervised Deep Learning
Paul Novello
Gaël Poëtte
D. Lugato
P. Congedo
90
0
0
19 Jan 2021
Teach me how to Label: Labeling Functions from Natural Language with Text-to-text Transformers
Yannis Papanikolaou
30
0
0
18 Jan 2021
Joint Energy-based Model Training for Better Calibrated Natural Language Understanding Models
Tianxing He
Bryan McCann
Caiming Xiong
Ehsan Hosseini-Asl
60
22
0
18 Jan 2021
What Makes Good In-Context Examples for GPT-
3
3
3
?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
400
1,400
0
17 Jan 2021
Understanding in Artificial Intelligence
S. Maetschke
D. M. Iraola
Pieter Barnard
Elaheh Shafieibavani
Peter Zhong
Ying Xu
Antonio Jimeno Yepes
ELM
VLM
46
0
0
17 Jan 2021
Transformer-Based Models for Question Answering on COVID19
Hillary Ngai
Yoona Park
John Chen
Mahboobeh Parsapoor
OOD
48
21
0
16 Jan 2021
To Understand Representation of Layer-aware Sequence Encoders as Multi-order-graph
Sufeng Duan
Hai Zhao
MILM
64
0
0
16 Jan 2021
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements
Lukas Stappen
Alice Baird
Lea Schumann
Björn Schuller
96
62
0
15 Jan 2021
KDLSQ-BERT: A Quantized Bert Combining Knowledge Distillation with Learned Step Size Quantization
Jing Jin
Cai Liang
Tiancheng Wu
Li Zou
Zhiliang Gan
MQ
59
27
0
15 Jan 2021
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm
Akshay Krishna Sheshadri
Anvesh Rao Vijjini
S. Kharbanda
43
8
0
14 Jan 2021
Benchmarking Simulation-Based Inference
Jan-Matthis Lueckmann
Jan Boelts
David S. Greenberg
P. J. Gonçalves
Jakob H. Macke
267
198
0
12 Jan 2021
Of Non-Linearity and Commutativity in BERT
Sumu Zhao
Damian Pascual
Gino Brunner
Roger Wattenhofer
105
17
0
12 Jan 2021
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
W. Fedus
Barret Zoph
Noam M. Shazeer
MoE
156
2,247
0
11 Jan 2021
BERT & Family Eat Word Salad: Experiments with Text Understanding
Ashim Gupta
Giorgi Kvernadze
Vivek Srikumar
260
73
0
10 Jan 2021
Applying Transfer Learning for Improving Domain-Specific Search Experience Using Query to Question Similarity
Ankush Chopra
S. Agrawal
Sohom Ghosh
RALM
21
4
0
07 Jan 2021
I-BERT: Integer-only BERT Quantization
Sehoon Kim
A. Gholami
Z. Yao
Michael W. Mahoney
Kurt Keutzer
MQ
179
354
0
05 Jan 2021
CDLM: Cross-Document Language Modeling
Avi Caciularu
Arman Cohan
Iz Beltagy
Matthew E. Peters
Arie Cattan
Ido Dagan
VLM
75
33
0
02 Jan 2021
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering
Najoung Kim
Ellie Pavlick
Burcu Karagol Ayan
Deepak Ramachandran
159
48
0
02 Jan 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
142
251
0
01 Jan 2021
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Kazi Samin Mubasshir
Md. Saiful Islam
Anindya Iqbal
M. Rahman
Rifat Shahriyar
SSL
VLM
101
180
0
01 Jan 2021
On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification
Zhengxuan Wu
Desmond C. Ong
78
22
0
01 Jan 2021
WARP: Word-level Adversarial ReProgramming
Karen Hambardzumyan
Hrant Khachatrian
Jonathan May
AAML
349
354
0
01 Jan 2021
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
Xiaohan Chen
Yu Cheng
Shuohang Wang
Zhe Gan
Zhangyang Wang
Jingjing Liu
131
100
0
31 Dec 2020
MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers
Wenhui Wang
Hangbo Bao
Shaohan Huang
Li Dong
Furu Wei
MQ
139
275
0
31 Dec 2020
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
435
1,987
0
31 Dec 2020
BinaryBERT: Pushing the Limit of BERT Quantization
Haoli Bai
Wei Zhang
Lu Hou
Lifeng Shang
Jing Jin
Xin Jiang
Qun Liu
Michael Lyu
Irwin King
MQ
230
227
0
31 Dec 2020
Open Korean Corpora: A Practical Report
Won Ik Cho
Sangwhan Moon
YoungSook Song
82
8
0
31 Dec 2020
Towards Zero-Shot Knowledge Distillation for Natural Language Processing
Ahmad Rashid
Vasileios Lioutas
Abbas Ghaddar
Mehdi Rezagholizadeh
98
28
0
31 Dec 2020
CLEAR: Contrastive Learning for Sentence Representation
Zhuofeng Wu
Sinong Wang
Jiatao Gu
Madian Khabsa
Fei Sun
Hao Ma
SSL
82
324
0
31 Dec 2020
Corrected CBOW Performs as well as Skip-gram
Ozan Irsoy
Adrian Benton
K. Stratos
SyDa
34
12
0
30 Dec 2020
Out of Order: How Important Is The Sequential Order of Words in a Sentence in Natural Language Understanding Tasks?
Thang M. Pham
Trung Bui
Long Mai
Anh Totti Nguyen
292
123
0
30 Dec 2020
Improving BERT with Syntax-aware Local Attention
Zhongli Li
Qingyu Zhou
Chao Li
Ke Xu
Yunbo Cao
102
45
0
30 Dec 2020
Accurate Word Representations with Universal Visual Guidance
Zhuosheng Zhang
Haojie Yu
Hai Zhao
Rui Wang
Masao Utiyama
55
0
0
30 Dec 2020
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Lei Li
Yankai Lin
Deli Chen
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
115
52
0
29 Dec 2020
RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems
Baolin Peng
Chunyuan Li
Zhu Zhang
Chenguang Zhu
Jinchao Li
Jianfeng Gao
69
50
0
29 Dec 2020
BURT: BERT-inspired Universal Representation from Learning Meaningful Segment
Yian Li
Hai Zhao
SSL
37
0
0
28 Dec 2020
ALP-KD: Attention-Based Layer Projection for Knowledge Distillation
Peyman Passban
Yimeng Wu
Mehdi Rezagholizadeh
Qun Liu
87
124
0
27 Dec 2020
SG-Net: Syntax Guided Transformer for Language Representation
Zhuosheng Zhang
Yuwei Wu
Junru Zhou
Sufeng Duan
Hai Zhao
Rui Wang
125
38
0
27 Dec 2020
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
Muhammad Abdul-Mageed
AbdelRahim Elmadany
El Moatez Billah Nagoudi
VLM
131
466
0
27 Dec 2020
Previous
1
2
3
...
73
74
75
...
87
88
89
Next