v1v2v3 (latest)

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

23 April 2020

Kyle Lo

Papers citing "Don't Stop Pretraining: Adapt Language Models to Domains and Tasks"

50 / 1,369 papers shown

Reference-based Weak Supervision for Answer Sentence Selection using Web DataConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Vivek Krishnamurthy

Thuy Vu

Alessandro Moschitti

175

18 Apr 2021

On the Influence of Masking Policies in Intermediate Pre-trainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Sinong Wang

Hao Ma

Xiang Ren

Madian Khabsa

218

18 Apr 2021

SciCo: Hierarchical Cross-Document Coreference for Scientific ConceptsConference on Automated Knowledge Base Construction (AKBC), 2021

328

18 Apr 2021

Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled CorpusConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Dirk Groeneveld

297

557

18 Apr 2021

Transductive Learning for Abstractive News Summarization

214

17 Apr 2021

Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue SystemsJournal of Artificial Intelligence Research (JAIR), 2021

481

17 Apr 2021

The challenges of temporal alignment on Twitter during crisesConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

217

17 Apr 2021

Moving on from OntoNotes: Coreference Resolution Model TransferConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Patrick Xia

Benjamin Van Durme

225

17 Apr 2021

Sequential Cross-Document Coreference ResolutionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Emily Allaway

Shuai Wang

Miguel Ballesteros

149

17 Apr 2021

On the Importance of Effectively Adapting Pretrained Language Models for Active LearningAnnual Meeting of the Association for Computational Linguistics (ACL), 2021

Katerina Margatina

Loïc Barrault

Nikolaos Aletras

245

16 Apr 2021

Capturing Row and Column Semantics in Transformer Based Question Answering over TablesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Michael R. Glass

Mustafa Canim

A. Gliozzo

Saneem A. Chemmengath

Nicolas Rodolfo Fauceglia

LMTD

284

16 Apr 2021

AMMU : A Survey of Transformer-based Biomedical Pretrained Language ModelsJournal of Biomedical Informatics (JBI), 2021

Katikapalli Subramanyam Kalyan

A. Rajasekharan

S. Sangeetha

LM&MA MedIm

379

190

16 Apr 2021

What to Pre-Train on? Efficient Intermediate Task SelectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

246

106

16 Apr 2021

Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social MediaConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Paul Röttger

J. Pierrehumbert

203

16 Apr 2021

To Share or not to Share: Predicting Sets of Sources for Model Transfer LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

158

16 Apr 2021

A Million Tweets Are Worth a Few Points: Tuning Transformers for Customer Service TasksNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

143

16 Apr 2021

Probing Across Time: What Does RoBERTa Know and When?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021

318

16 Apr 2021

Towards Robust Neural Retrieval Models with Synthetic Pre-Training

Revanth Reddy Gangi Reddy

Vikas Yadav

Heng Ji

133

15 Apr 2021

Cross-Domain Label-Adaptive Stance DetectionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

281

15 Apr 2021

Pseudo Zero Pronoun Resolution Improves Zero Anaphora ResolutionConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Ryuto Konno

Shun Kiyono

Yuichiroh Matsubayashi

Hiroki Ouchi

Kentaro Inui

146

15 Apr 2021

Multitasking Inhibits Semantic DriftNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Athul Paul Jacob

M. Lewis

Jacob Andreas

182

15 Apr 2021

Modeling Human Mental States with an Entity-based Narrative GraphNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

I-Ta Lee

Maria Leonor Pacheco

Dan Goldwasser

126

14 Apr 2021

UDALM: Unsupervised Domain Adaptation through Language ModelingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Constantinos F. Karouzos

Georgios Paraskevopoulos

Alexandros Potamianos

163

14 Apr 2021

TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding LearningConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

Kexin Wang

Nils Reimers

Iryna Gurevych

358

214

14 Apr 2021

Detoxifying Language Models Risks Marginalizing Minority VoicesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

246

136

13 Apr 2021

Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders

Brendan Chambers

James A. Evans

MedIm

175

13 Apr 2021

SpartQA: : A Textual Question Answering Benchmark for Spatial ReasoningNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Roshanak Mirzaee

Hossein Rajaby Faghihi

Qiang Ning

Parisa Kordjmashidi

179

101

12 Apr 2021

On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic DependenciesNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Tianyi Zhang

Tatsunori Hashimoto

AI4CE

196

12 Apr 2021

Fine-tuning Encoders for Improved Monolingual and Zero-shot Polylingual Neural Topic ModelingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

Aaron Mueller

Mark Dredze

152

11 Apr 2021

TAPAS at SemEval-2021 Task 9: Reasoning over tables with intermediate pre-trainingInternational Workshop on Semantic Evaluation (SemEval), 2021

Thomas Müller

Julian Martin Eisenschlos

Syrine Krichene

LMTD

255

02 Apr 2021

CURIE: An Iterative Querying Approach for Reasoning About Situations

Abhilasha Ravichander

Peter Clark

Eduard H. Hovy

ReLM LRM

209

01 Apr 2021

CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning

Lei Zhang

189

01 Apr 2021

Self-Supervised Pretraining Improves Self-Supervised PretrainingIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2021

...

Shanghang Zhang

308

124

23 Mar 2021

Improving and Simplifying Pattern Exploiting TrainingConference on Empirical Methods in Natural Language Processing (EMNLP), 2021

243

154

22 Mar 2021

MasakhaNER: Named Entity Recognition for African LanguagesTransactions of the Association for Computational Linguistics (TACL), 2021

David Ifeoluwa Adelani

Graham Neubig

...

305

226

22 Mar 2021

AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive SummarizationNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

310

21 Mar 2021

Self-Supervised Test-Time Learning for Reading ComprehensionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2021

194

20 Mar 2021

Structure Inducing Pre-TrainingNature Machine Intelligence (Nat. Mach. Intell.), 2021

Matthew B. A. McDermott

Brendan Yap

Peter Szolovits

Marinka Zitnik

337

18 Mar 2021

Modeling the Second Player in Distributionally Robust OptimizationInternational Conference on Learning Representations (ICLR), 2021

Paul Michel

Tatsunori Hashimoto

Graham Neubig

227

18 Mar 2021

Does the Magic of BERT Apply to Medical Code Assignment? A Quantitative Study

Shaoxiong Ji

M. Holtta

Pekka Marttinen

284

11 Mar 2021

Self-supervised Text-to-SQL Learning with Header Alignment Training

Donggyu Kim

Seanie Lee

SSL LMTD

117

11 Mar 2021

CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review

334

252

10 Mar 2021

Self-supervised Regularization for Text ClassificationTransactions of the Association for Computational Linguistics (TACL), 2021

Meng Zhou

Zechen Li

P. Xie

158

09 Mar 2021

Large Pre-trained Language Models Contain Human-like Biases of What is Right and Wrong to DoNature Machine Intelligence (Nat. Mach. Intell.), 2021

289

358

08 Mar 2021

"Sharks are not the threat humans are": Argument Component Segmentation in School Student EssaysWorkshop on Innovative Use of NLP for Building Educational Applications (UNBEA), 2021

Tariq Alhindi

Debanjan Ghosh

115

08 Mar 2021

Measuring Mathematical Problem Solving With the MATH Dataset

899

3,885

05 Mar 2021

OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge ServicesKnowledge Discovery and Data Mining (KDD), 2021

Xiao Liu

Hongxia Yang

Yuxiao Dong

Jie Tang

VLM

230

03 Mar 2021

Gradual Fine-Tuning for Low-Resource Domain Adaptation

168

03 Mar 2021

ToxCCIn: Toxic Content Classification with InterpretabilityWorkshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA), 2021

239

01 Mar 2021

Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLPTransactions of the Association for Computational Linguistics (TACL), 2021

Timo Schick

Sahana Udupa

Hinrich Schütze

694

438

28 Feb 2021