v1v2v3 (latest)

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

20 April 2018

Amanpreet Singh

Papers citing "GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding"

50 / 4,447 papers shown

Title
Inserting Information Bottlenecks for Attribution in Transformers Zhiying Jiang Raphael Tang Ji Xin Jimmy J. Lin 55 6 0 27 Dec 2020
Pre-Training a Language Model Without Human Language Cheng-Han Chiang Hung-yi Lee 71 13 0 22 Dec 2020
Undivided Attention: Are Intermediate Layers Necessary for BERT? S. N. Sridhar Anthony Sarah 66 15 0 22 Dec 2020
Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning Armen Aghajanyan Luke Zettlemoyer Sonal Gupta 110 577 1 22 Dec 2020
RealFormer: Transformer Likes Residual Attention Ruining He Anirudh Ravula Bhargav Kanagal Joshua Ainslie 76 111 0 21 Dec 2020
SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning Hanrui Wang Zhekai Zhang Song Han 156 399 0 17 Dec 2020
Continual Lifelong Learning in Natural Language Processing: A Survey Magdalena Biesialska Katarzyna Biesialska Marta R. Costa-jussá KELM CLL 100 222 0 17 Dec 2020
MASKER: Masked Keyword Regularization for Reliable Text Classification S. Moon Sangwoo Mo Kimin Lee Jaeho Lee Jinwoo Shin 120 38 0 17 Dec 2020
Assessing COVID-19 Impacts on College Students via Automated Processing of Free-form Text Ravi Sharma Srivyshnavi Pagadala Pratool Bharti Sriram Chellappan Trine Schmidt Raj Goyal 28 7 0 17 Dec 2020
Revisiting Linformer with a modified self-attention with linear complexity Madhusudan Verma 37 8 0 16 Dec 2020
Learning from Mistakes: Using Mis-predictions as Harm Alerts in Language Pre-Training Chen Xing Wenhao Liu Caiming Xiong 31 0 0 16 Dec 2020
Pre-Training Transformers as Energy-Based Cloze Models Kevin Clark Minh-Thang Luong Quoc V. Le Christopher D. Manning 77 80 0 15 Dec 2020
Modeling Heterogeneous Statistical Patterns in High-dimensional Data by Adversarial Distributions: An Unsupervised Generative Framework Han Zhang Wenhao Zheng C. L. Philip Chen Kevin Gao Yao Hu Ling Huang Wenyuan Xu AAML 30 1 0 15 Dec 2020
Writing Polishment with Simile: Task, Dataset and A Neural Approach Jiayi Zhang Zhi Cui Xiaoqiang Xia Yalong Guo Yanran Li Chen Wei Jianwei Cui 71 18 0 15 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning Demi Guo Alexander M. Rush Yoon Kim 92 406 0 14 Dec 2020
WILDS: A Benchmark of in-the-Wild Distribution Shifts Pang Wei Koh Shiori Sagawa Henrik Marklund Sang Michael Xie Marvin Zhang ... A. Kundaje Emma Pierson Sergey Levine Chelsea Finn Percy Liang OOD 331 1,452 0 14 Dec 2020
LRC-BERT: Latent-representation Contrastive Knowledge Distillation for Natural Language Understanding Hao Fu Shaojun Zhou Qihong Yang Junjie Tang Guiquan Liu Kaikui Liu Xiaolong Li 119 60 0 14 Dec 2020
Mask-Align: Self-Supervised Neural Word Alignment Chi Chen Maosong Sun Yang Liu 46 34 0 13 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation Fei Yuan Linjun Shou J. Pei Wutao Lin Ming Gong Yan Fu Daxin Jiang 71 124 0 11 Dec 2020
Improving Task-Agnostic BERT Distillation with Layer Mapping Search Xiaoqi Jiao Huating Chang Yichun Yin Lifeng Shang Xin Jiang Xiao Chen Linlin Li Fang Wang Qun Liu 49 12 0 11 Dec 2020
Infusing Finetuning with Semantic Dependencies Zhaofeng Wu Hao Peng Noah A. Smith 71 37 0 10 Dec 2020
Data and its (dis)contents: A survey of dataset development and use in machine learning research Amandalynne Paullada Inioluwa Deborah Raji Emily M. Bender Emily L. Denton A. Hanna 133 532 0 09 Dec 2020
What Meaning-Form Correlation Has to Compose With Timothee Mickus Timothée Bernard Denis Paperno 58 4 0 07 Dec 2020
Reference Knowledgeable Network for Machine Reading Comprehension Yilin Zhao Zhuosheng Zhang Hai Zhao 67 5 0 07 Dec 2020
An Empirical Survey of Unsupervised Text Representation Methods on Twitter Data Lili Wang Chongyang Gao Jason W. Wei Weicheng Ma Ruibo Liu Soroush Vosoughi 34 15 0 07 Dec 2020
Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning Daniel Grießhaber J. Maucher Ngoc Thang Vu 85 46 0 04 Dec 2020
What Makes a Star Teacher? A Hierarchical BERT Model for Evaluating Teacher's Performance in Online Education Wen Wang Honglei Zhuang Michael X. Zhou Hanyu Liu Beibei Li 26 7 0 03 Dec 2020
Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Embeddings and the Implications to Representation Learning Wei Zhang Murray Campbell Yang Yu Yara Rizk 42 0 0 03 Dec 2020
DERAIL: Diagnostic Environments for Reward And Imitation Learning Pedro Freire Adam Gleave Sam Toyer Stuart J. Russell OffRL 65 6 0 02 Dec 2020
Learning from others' mistakes: Avoiding dataset biases without modeling them Victor Sanh Thomas Wolf Yonatan Belinkov Alexander M. Rush 96 116 0 02 Dec 2020
EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference Thierry Tambe Coleman Hooper Lillian Pentecost Tianyu Jia En-Yu Yang ... Victor Sanh P. Whatmough Alexander M. Rush David Brooks Gu-Yeon Wei 112 126 0 28 Nov 2020
Transformer Query-Target Knowledge Discovery (TEND): Drug Discovery from CORD-19 Leo K. Tam Xiaosong Wang Daguang Xu MedIm 45 2 0 28 Nov 2020
An Investigation of Language Model Interpretability via Sentence Editing Samuel Stevens Yu-Chuan Su LRM 39 9 0 28 Nov 2020
Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup Cheng Yang Shengnan Wang Chao Yang Yuechuan Li Ru He Jingqiao Zhang 85 25 0 27 Nov 2020
Two Stage Transformer Model for COVID-19 Fake News Detection and Fact Checking Rutvik Vijjali Prathyush Potluri S. Kumar Sundeep Teki MedIm 74 75 0 26 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark Dayiheng Liu Yu Yan Yeyun Gong Weizhen Qi Hang Zhang ... Jiancheng Lv Ruofei Zhang Winnie Wu Ming Zhou Nan Duan ELM 113 66 0 24 Nov 2020
A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Trigger's Adversarial Attacks Thai Le Noseong Park Dongwon Lee 167 24 0 20 Nov 2020
Data-Informed Global Sparseness in Attention Mechanisms for Deep Neural Networks Ileana Rugina Rumen Dangovski L. Jing Preslav Nakov Marin Soljacic 63 0 0 20 Nov 2020
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications Minghui Qiu Peng Li Chengyu Wang Hanjie Pan Yaliang Li ... Jun Yang Yaliang Li Jun Huang Deng Cai Wei Lin VLM SyDa 109 20 0 18 Nov 2020
A Definition and a Test for Human-Level Artificial Intelligence Deokgun Park Md Ashaduzzaman Rubel Mondol Aishwarya Pothula Mazharul Islam VLM 62 4 0 18 Nov 2020
Out-of-Task Training for Dialog State Tracking Models Michael Heck Carel van Niekerk Nurul Lubis Christian Geishauser Hsien-Chin Lin Marco Moresi Milica Gavsić 44 3 0 18 Nov 2020
Predictions For Pre-training Language Models Tonglei Guo 16 0 0 18 Nov 2020
Learning from Task Descriptions Orion Weller Nicholas Lourie Matt Gardner Matthew E. Peters 113 91 0 16 Nov 2020
Comparative Probing of Lexical Semantics Theories for Cognitive Plausibility and Technological Usefulness António Branco João Rodrigues M. Salawa Ruben Branco Chakaveh Saedi 51 6 0 16 Nov 2020
doc2dial: A Goal-Oriented Document-Grounded Dialogue Dataset Song Feng H. Wan R. Chulaka Gunasekara S. Patel Sachindra Joshi Luis A. Lastras 85 122 0 12 Nov 2020
Towards Preemptive Detection of Depression and Anxiety in Twitter David Owen Jose Camacho-Collados Luis Espinosa-Anke 32 25 0 10 Nov 2020
When Do You Need Billions of Words of Pretraining Data? Yian Zhang Alex Warstadt Haau-Sing Li Samuel R. Bowman 73 141 0 10 Nov 2020
Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts Hanmeng Liu Leyang Cui Jian Liu Yue Zhang ReLM LRM 78 44 0 10 Nov 2020
An Analysis of Dataset Overlap on Winograd-Style Tasks Ali Emami Adam Trischler Kaheer Suleman Jackie C.K. Cheung 81 22 0 09 Nov 2020
Low-Resource Adaptation of Neural NLP Models Farhad Nooralahzadeh 85 0 0 09 Nov 2020