Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.03771
Cited By
v1
v2
v3
v4
v5 (latest)
HuggingFace's Transformers: State-of-the-art Natural Language Processing
9 October 2019
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
Anthony Moi
Pierric Cistac
Tim Rault
Rémi Louf
Morgan Funtowicz
Joe Davison
Sam Shleifer
Patrick von Platen
Clara Ma
Yacine Jernite
J. Plu
Canwen Xu
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Github (144926★)
Papers citing
"HuggingFace's Transformers: State-of-the-art Natural Language Processing"
50 / 515 papers shown
Title
Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media
Hansi Hettiarachchi
Mariam Adedoyin-Olowe
Jagdev Bhogal
M. Gaber
40
34
0
10 Jun 2020
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
Marius Mosbach
Maksym Andriushchenko
Dietrich Klakow
187
363
0
08 Jun 2020
BERT Loses Patience: Fast and Robust Inference with Early Exit
Wangchunshu Zhou
Canwen Xu
Tao Ge
Julian McAuley
Ke Xu
Furu Wei
63
343
0
07 Jun 2020
DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations
John Giorgi
Osvald Nitski
Bo Wang
Gary D. Bader
SSL
149
499
0
05 Jun 2020
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
V. Kieuvongngam
Bowen Tan
Yiming Niu
AI4MH
56
96
0
03 Jun 2020
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir Feder
Nadav Oved
Uri Shalit
Roi Reichart
CML
LRM
161
162
0
27 May 2020
Living Machines: A study of atypical animacy
Mariona Coll Ardanuy
F. Nanni
K. Beelen
Kasra Hosseini
R. Ahnert
J. Lawrence
Katherine McDonough
Giorgia Tolfo
Daniel C. S. Wilson
Barbara McGillivray
56
20
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MH
LM&MA
104
704
0
22 May 2020
On the Robustness of Language Encoders against Grammatical Errors
Fan Yin
Quanyu Long
Tao Meng
Kai-Wei Chang
81
35
0
12 May 2020
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
Douwe Kiela
Hamed Firooz
Aravind Mohan
Vedanuj Goswami
Amanpreet Singh
Pratik Ringshia
Davide Testuggine
109
611
0
10 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
219
1,110
0
08 May 2020
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
72
190
0
08 May 2020
Blind Backdoors in Deep Learning Models
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
FedML
SILM
160
310
0
08 May 2020
Moving Down the Long Tail of Word Sense Disambiguation with Gloss-Informed Biencoders
Terra Blevins
Luke Zettlemoyer
68
167
0
06 May 2020
Neural CRF Model for Sentence Alignment in Text Simplification
Chao Jiang
Mounica Maddela
Wuwei Lan
Yang Zhong
Wenyuan Xu
105
162
0
05 May 2020
Quantifying Attention Flow in Transformers
Samira Abnar
Willem H. Zuidema
177
808
0
02 May 2020
ProtoQA: A Question Answering Dataset for Prototypical Common-Sense Reasoning
Michael Boratko
Xiang Lorraine Li
Rajarshi Das
Timothy J. O'Gorman
Daniel Le
Andrew McCallum
122
58
0
02 May 2020
Contrastive Self-Supervised Learning for Commonsense Reasoning
T. Klein
Moin Nabi
LRM
SSL
81
63
0
02 May 2020
When BERT Plays the Lottery, All Tickets Are Winning
Sai Prasanna
Anna Rogers
Anna Rumshisky
MILM
88
187
0
01 May 2020
Self-supervised Knowledge Triplet Learning for Zero-shot Question Answering
Pratyay Banerjee
Chitta Baral
90
65
0
01 May 2020
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer
Jonas Pfeiffer
Ivan Vulić
Iryna Gurevych
Sebastian Ruder
146
631
0
30 Apr 2020
Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society
Firoj Alam
Shaden Shaar
Fahim Dalvi
Hassan Sajjad
Alex Nikolov
...
Tommaso Caselli
Gijs Danoe
Friso Stolk
Britt Bruntink
Preslav Nakov
118
160
0
30 Apr 2020
Fact or Fiction: Verifying Scientific Claims
David Wadden
Shanchuan Lin
Kyle Lo
Lucy Lu Wang
Madeleine van Zuylen
Arman Cohan
Hannaneh Hajishirzi
HAI
197
465
0
30 Apr 2020
Question Rewriting for Conversational Question Answering
Svitlana Vakulenko
Shayne Longpre
Zhucheng Tu
R. Anantha
92
180
0
30 Apr 2020
Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning
Alexandre Tamborrino
Nicola Pellicanò
B. Pannier
Pascal Voitot
Louise Naudin
LRM
81
63
0
29 Apr 2020
MAVEN: A Massive General Domain Event Detection Dataset
Xiaozhi Wang
Ziqi Wang
Xu Han
Wangyi Jiang
Rong Han
Zhiyuan Liu
Juan-Zi Li
Peng Li
Yankai Lin
Jie Zhou
69
189
0
28 Apr 2020
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
77
121
0
26 Apr 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
207
2,452
0
23 Apr 2020
The Ivory Tower Lost: How College Students Respond Differently than the General Public to the COVID-19 Pandemic
Viet-An Duong
Phu Pham
Tongyu Yang
Yu Wang
Jiebo Luo
AI4CE
45
94
0
21 Apr 2020
Knowledge-Driven Distractor Generation for Cloze-style Multiple Choice Questions
Siyu Ren
Kenny Q. Zhu
48
48
0
21 Apr 2020
Pretrained Transformers Improve Out-of-Distribution Robustness
Dan Hendrycks
Xiaoyuan Liu
Eric Wallace
Adam Dziedzic
R. Krishnan
Basel Alomair
OOD
221
436
0
13 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
110
388
0
13 Apr 2020
Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned
Edwin Zhang
Nikhil Gupta
Rodrigo Nogueira
Kyunghyun Cho
Jimmy J. Lin
55
58
0
10 Apr 2020
Asking and Answering Questions to Evaluate the Factual Consistency of Summaries
Alex Jinpeng Wang
Kyunghyun Cho
M. Lewis
HILM
94
486
0
08 Apr 2020
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning
Zhaojiang Lin
Andrea Madotto
Pascale Fung
105
163
0
08 Apr 2020
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension
Yiming Cui
Ting Liu
Ziqing Yang
Zhipeng Chen
Wentao Ma
Wanxiang Che
Shijin Wang
Guoping Hu
73
19
0
07 Apr 2020
DARE: Data Augmented Relation Extraction with GPT-2
Yannis Papanikolaou
Andrea Pierleoni
60
76
0
06 Apr 2020
Unsupervised Domain Clusters in Pretrained Language Models
Roee Aharoni
Yoav Goldberg
101
252
0
05 Apr 2020
Deep Entity Matching with Pre-Trained Language Models
Yuliang Li
Jinfeng Li
Yoshihiko Suhara
A. Doan
W. Tan
VLM
108
391
0
01 Apr 2020
Give your Text Representation Models some Love: the Case for Basque
Rodrigo Agerri
Iñaki San Vicente
Jon Ander Campos
Ander Barrena
X. Saralegi
Aitor Soroa Etxabe
Eneko Agirre
56
63
0
31 Mar 2020
Named Entities in Medical Case Reports: Corpus and Experiments
Sarah Schulz
Jurica vSeva
Samuel Rodriguez
Malte Ostendorff
Georg Rehm
44
9
0
29 Mar 2020
Mining Coronavirus (COVID-19) Posts in Social Media
Negin Karisani
Payam Karisani
38
24
0
28 Mar 2020
Calibration of Pre-trained Transformers
Shrey Desai
Greg Durrett
UQLM
344
302
0
17 Mar 2020
BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward
Florian Schmidt
Thomas Hofmann
88
8
0
05 Mar 2020
jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
Yada Pruksachatkun
Philip Yeres
Haokun Liu
Jason Phang
Phu Mon Htut
Alex Jinpeng Wang
Ian Tenney
Samuel R. Bowman
SSeg
36
94
0
04 Mar 2020
Med7: a transferable clinical natural language processing model for electronic health records
Andrey Kormilitzin
N. Vaci
Qiang Liu
A. Nevado-Holgado
97
120
0
03 Mar 2020
A Primer in BERTology: What we know about how BERT works
Anna Rogers
Olga Kovaleva
Anna Rumshisky
OffRL
137
1,511
0
27 Feb 2020
Predicting trends in the quality of state-of-the-art neural networks without access to training or testing data
Charles H. Martin
Tongsu Peng
Peng
Michael W. Mahoney
111
110
0
17 Feb 2020
TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval
Wenhao Lu
Jian Jiao
Ruofei Zhang
60
50
0
14 Feb 2020
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
69
31
0
14 Feb 2020
Previous
1
2
3
...
10
11
9
Next