Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1804.07461
Cited By
v1
v2
v3 (latest)
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
20 April 2018
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"
50 / 4,447 papers shown
Title
Inserting Information Bottlenecks for Attribution in Transformers
Zhiying Jiang
Raphael Tang
Ji Xin
Jimmy J. Lin
55
6
0
27 Dec 2020
Pre-Training a Language Model Without Human Language
Cheng-Han Chiang
Hung-yi Lee
71
13
0
22 Dec 2020
Undivided Attention: Are Intermediate Layers Necessary for BERT?
S. N. Sridhar
Anthony Sarah
66
15
0
22 Dec 2020
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning
Armen Aghajanyan
Luke Zettlemoyer
Sonal Gupta
110
577
1
22 Dec 2020
RealFormer: Transformer Likes Residual Attention
Ruining He
Anirudh Ravula
Bhargav Kanagal
Joshua Ainslie
76
110
0
21 Dec 2020
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Hanrui Wang
Zhekai Zhang
Song Han
156
399
0
17 Dec 2020
Continual Lifelong Learning in Natural Language Processing: A Survey
Magdalena Biesialska
Katarzyna Biesialska
Marta R. Costa-jussá
KELM
CLL
100
222
0
17 Dec 2020
MASKER: Masked Keyword Regularization for Reliable Text Classification
S. Moon
Sangwoo Mo
Kimin Lee
Jaeho Lee
Jinwoo Shin
120
38
0
17 Dec 2020
Assessing COVID-19 Impacts on College Students via Automated Processing of Free-form Text
Ravi Sharma
Srivyshnavi Pagadala
Pratool Bharti
Sriram Chellappan
Trine Schmidt
Raj Goyal
28
7
0
17 Dec 2020
Revisiting Linformer with a modified self-attention with linear complexity
Madhusudan Verma
37
8
0
16 Dec 2020
Learning from Mistakes: Using Mis-predictions as Harm Alerts in Language Pre-Training
Chen Xing
Wenhao Liu
Caiming Xiong
31
0
0
16 Dec 2020
Pre-Training Transformers as Energy-Based Cloze Models
Kevin Clark
Minh-Thang Luong
Quoc V. Le
Christopher D. Manning
77
80
0
15 Dec 2020
Modeling Heterogeneous Statistical Patterns in High-dimensional Data by Adversarial Distributions: An Unsupervised Generative Framework
Han Zhang
Wenhao Zheng
C. L. Philip Chen
Kevin Gao
Yao Hu
Ling Huang
Wenyuan Xu
AAML
30
1
0
15 Dec 2020
Writing Polishment with Simile: Task, Dataset and A Neural Approach
Jiayi Zhang
Zhi Cui
Xiaoqiang Xia
Yalong Guo
Yanran Li
Chen Wei
Jianwei Cui
71
18
0
15 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
92
406
0
14 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Pang Wei Koh
Shiori Sagawa
Henrik Marklund
Sang Michael Xie
Marvin Zhang
...
A. Kundaje
Emma Pierson
Sergey Levine
Chelsea Finn
Percy Liang
OOD
328
1,452
0
14 Dec 2020
LRC-BERT: Latent-representation Contrastive Knowledge Distillation for Natural Language Understanding
Hao Fu
Shaojun Zhou
Qihong Yang
Junjie Tang
Guiquan Liu
Kaikui Liu
Xiaolong Li
119
60
0
14 Dec 2020
Mask-Align: Self-Supervised Neural Word Alignment
Chi Chen
Maosong Sun
Yang Liu
46
34
0
13 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation
Fei Yuan
Linjun Shou
J. Pei
Wutao Lin
Ming Gong
Yan Fu
Daxin Jiang
71
124
0
11 Dec 2020
Improving Task-Agnostic BERT Distillation with Layer Mapping Search
Xiaoqi Jiao
Huating Chang
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
Fang Wang
Qun Liu
49
12
0
11 Dec 2020
Infusing Finetuning with Semantic Dependencies
Zhaofeng Wu
Hao Peng
Noah A. Smith
71
37
0
10 Dec 2020
Data and its (dis)contents: A survey of dataset development and use in machine learning research
Amandalynne Paullada
Inioluwa Deborah Raji
Emily M. Bender
Emily L. Denton
A. Hanna
133
532
0
09 Dec 2020
What Meaning-Form Correlation Has to Compose With
Timothee Mickus
Timothée Bernard
Denis Paperno
58
4
0
07 Dec 2020
Reference Knowledgeable Network for Machine Reading Comprehension
Yilin Zhao
Zhuosheng Zhang
Hai Zhao
67
5
0
07 Dec 2020
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data
Lili Wang
Chongyang Gao
Jason W. Wei
Weicheng Ma
Ruibo Liu
Soroush Vosoughi
34
15
0
07 Dec 2020
Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning
Daniel Grießhaber
J. Maucher
Ngoc Thang Vu
85
46
0
04 Dec 2020
What Makes a Star Teacher? A Hierarchical BERT Model for Evaluating Teacher's Performance in Online Education
Wen Wang
Honglei Zhuang
Michael X. Zhou
Hanyu Liu
Beibei Li
26
7
0
03 Dec 2020
Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Embeddings and the Implications to Representation Learning
Wei Zhang
Murray Campbell
Yang Yu
Yara Rizk
42
0
0
03 Dec 2020
DERAIL: Diagnostic Environments for Reward And Imitation Learning
Pedro Freire
Adam Gleave
Sam Toyer
Stuart J. Russell
OffRL
65
6
0
02 Dec 2020
Learning from others' mistakes: Avoiding dataset biases without modeling them
Victor Sanh
Thomas Wolf
Yonatan Belinkov
Alexander M. Rush
96
116
0
02 Dec 2020
EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference
Thierry Tambe
Coleman Hooper
Lillian Pentecost
Tianyu Jia
En-Yu Yang
...
Victor Sanh
P. Whatmough
Alexander M. Rush
David Brooks
Gu-Yeon Wei
112
126
0
28 Nov 2020
Transformer Query-Target Knowledge Discovery (TEND): Drug Discovery from CORD-19
Leo K. Tam
Xiaosong Wang
Daguang Xu
MedIm
45
2
0
28 Nov 2020
An Investigation of Language Model Interpretability via Sentence Editing
Samuel Stevens
Yu-Chuan Su
LRM
39
9
0
28 Nov 2020
Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Cheng Yang
Shengnan Wang
Chao Yang
Yuechuan Li
Ru He
Jingqiao Zhang
85
25
0
27 Nov 2020
Two Stage Transformer Model for COVID-19 Fake News Detection and Fact Checking
Rutvik Vijjali
Prathyush Potluri
S. Kumar
Sundeep Teki
MedIm
74
75
0
26 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
113
66
0
24 Nov 2020
A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger's Adversarial Attacks
Thai Le
Noseong Park
Dongwon Lee
167
24
0
20 Nov 2020
Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks
Ileana Rugina
Rumen Dangovski
L. Jing
Preslav Nakov
Marin Soljacic
63
0
0
20 Nov 2020
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
Minghui Qiu
Peng Li
Chengyu Wang
Hanjie Pan
Yaliang Li
...
Jun Yang
Yaliang Li
Jun Huang
Deng Cai
Wei Lin
VLM
SyDa
109
20
0
18 Nov 2020
A Definition and a Test for Human-Level Artificial Intelligence
Deokgun Park
Md Ashaduzzaman Rubel Mondol
Aishwarya Pothula
Mazharul Islam
VLM
62
4
0
18 Nov 2020
Out-of-Task Training for Dialog State Tracking Models
Michael Heck
Carel van Niekerk
Nurul Lubis
Christian Geishauser
Hsien-Chin Lin
Marco Moresi
Milica Gavsić
44
3
0
18 Nov 2020
Predictions For Pre-training Language Models
Tonglei Guo
16
0
0
18 Nov 2020
Learning from Task Descriptions
Orion Weller
Nicholas Lourie
Matt Gardner
Matthew E. Peters
113
91
0
16 Nov 2020
Comparative Probing of Lexical Semantics Theories for Cognitive Plausibility and Technological Usefulness
António Branco
João Rodrigues
M. Salawa
Ruben Branco
Chakaveh Saedi
51
6
0
16 Nov 2020
doc2dial: A Goal-Oriented Document-Grounded Dialogue Dataset
Song Feng
H. Wan
R. Chulaka Gunasekara
S. Patel
Sachindra Joshi
Luis A. Lastras
85
122
0
12 Nov 2020
Towards Preemptive Detection of Depression and Anxiety in Twitter
David Owen
Jose Camacho-Collados
Luis Espinosa-Anke
32
25
0
10 Nov 2020
When Do You Need Billions of Words of Pretraining Data?
Yian Zhang
Alex Warstadt
Haau-Sing Li
Samuel R. Bowman
73
141
0
10 Nov 2020
Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts
Hanmeng Liu
Leyang Cui
Jian Liu
Yue Zhang
ReLM
LRM
78
44
0
10 Nov 2020
An Analysis of Dataset Overlap on Winograd-Style Tasks
Ali Emami
Adam Trischler
Kaheer Suleman
Jackie C.K. Cheung
81
22
0
09 Nov 2020
Low-Resource Adaptation of Neural NLP Models
Farhad Nooralahzadeh
85
0
0
09 Nov 2020
Previous
1
2
3
...
74
75
76
...
87
88
89
Next