v1v2v3v4 (latest)

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Journal of machine learning research (JMLR), 2019

23 October 2019

Sharan Narang

Papers citing "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

38 / 12,038 papers shown

Multilingual is not enough: BERT for Finnish

252

300

15 Dec 2019

WaLDORf: Wasteless Language-model Distillation On Reading-comprehension

169

13 Dec 2019

Extending Machine Language Models toward Human-Level Language Understanding

159

12 Dec 2019

FlauBERT: Unsupervised Language Model Pre-training for FrenchInternational Conference on Language Resources and Evaluation (LREC), 2019

350

431

11 Dec 2019

Zero-shot Text Classification With Generative Language Models

Raul Puri

Bryan Catanzaro

VLM

166

116

10 Dec 2019

Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art BaselineEuropean Conference on Computer Vision (ECCV), 2019

Devi Parikh

359

120

05 Dec 2019

12-in-1: Multi-Task Vision and Language Representation LearningComputer Vision and Pattern Recognition (CVPR), 2019

Devi Parikh

315

499

05 Dec 2019

BLiMP: The Benchmark of Linguistic Minimal Pairs for EnglishTransactions of the Association for Computational Linguistics (TACL), 2019

477

619

02 Dec 2019

What's Hidden in a Randomly Weighted Neural Network?Computer Vision and Pattern Recognition (CVPR), 2019

256

393

29 Nov 2019

Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQAComputer Vision and Pattern Recognition (CVPR), 2019

Ronghang Hu

Amanpreet Singh

Trevor Darrell

Marcus Rohrbach

361

224

14 Nov 2019

KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language RepresentationTransactions of the Association for Computational Linguistics (TACL), 2019

Xiaozhi Wang

Zhengyan Zhang

Jian Tang

386

771

13 Nov 2019

CamemBERT: a Tasty French Language ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Louis Martin

Eric Villemonte de la Clergerie

Djamé Seddah

Benoît Sagot

540

1,056

10 Nov 2019

INSET: Sentence Infilling with INter-SEntential Transformer

248

10 Nov 2019

Learning to Few-Shot Learn Across Diverse Natural Language Classification TasksInternational Conference on Computational Linguistics (COLING), 2019

273

126

10 Nov 2019

The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational AgentsAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Jason Weston

264

09 Nov 2019

Sentence Meta-Embeddings for Unsupervised Semantic Textual SimilarityAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

474

09 Nov 2019

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized OptimizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Xiaodong Liu

643

590

08 Nov 2019

Contrastive Multi-document Question GenerationConference of the European Chapter of the Association for Computational Linguistics (EACL), 2019

Mengdi Wang

362

08 Nov 2019

BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performanceBlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP (BlackBoxNLP), 2019

R. Thomas McCoy

Junghyun Min

Tal Linzen

402

156

07 Nov 2019

Unsupervised Cross-lingual Representation Learning at ScaleAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Francisco Guzmán

Luke Zettlemoyer

492

7,725

05 Nov 2019

DialoGPT: Large-Scale Generative Pre-training for Conversational Response GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

650

1,658

01 Nov 2019

CCNet: Extracting High Quality Monolingual Datasets from Web Crawl DataInternational Conference on Language Resources and Evaluation (LREC), 2019

Francisco Guzmán

470

756

01 Nov 2019

Multi-Stage Document Ranking with BERT

317

461

31 Oct 2019

Discourse-Aware Neural Extractive Text SummarizationAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

334

292

30 Oct 2019

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and ComprehensionAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

Luke Zettlemoyer

834

12,121

29 Oct 2019

ZeRO: Memory Optimizations Toward Training Trillion Parameter ModelsInternational Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019

Yuxiong He

434

1,424

04 Oct 2019

ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsInternational Conference on Learning Representations (ICLR), 2019

1.2K

7,141

26 Sep 2019

FreeLB: Enhanced Adversarial Training for Natural Language UnderstandingInternational Conference on Learning Representations (ICLR), 2019

686

490

25 Sep 2019

Portuguese Named Entity Recognition using BERT-CRF

Fábio Souza

Rodrigo Nogueira

R. Lotufo

266

280

23 Sep 2019

TinyBERT: Distilling BERT for Natural Language UnderstandingFindings (Findings), 2019

Xiaoqi Jiao

Yichun Yin

Lifeng Shang

Xin Jiang

Xiao Chen

Linlin Li

F. Wang

Qun Liu

VLM

604

2,161

23 Sep 2019

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

1.3K

2,442

17 Sep 2019

I-MAD: Interpretable Malware Detector Using Galaxy TransformerComputers & security (Comput. Secur.), 2019

299

15 Sep 2019

Conditional Text Generation for Harmonious Human-Machine Interaction

185

08 Sep 2019

Taming Momentum in a Distributed Asynchronous Environment

303

26 Jul 2019

Contextual Word Representations: A Contextual Introduction

Noah A. Smith

239

15 Feb 2019

Are All Layers Created Equal?

Chiyuan Zhang

Samy Bengio

Y. Singer

316

157

06 Feb 2019

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

420

252

05 Dec 2018

Deep Learning for Genomics: A Concise Overview

289

02 Feb 2018